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Au these were honored in their generations, and 
were the glory of their times. 


Tikes be of them, that have left a name behind 
them, that their praises might be reported. 


And some there be, which have no memorial; who 
are perished, as though they had never been; and are 
become as though they had never been born; and 
their children after them. 


ECCLESIASTICUS 44:7-9 
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Since many excellent treatises on the history of mathe- 

P t e f a Cc e matics are available, there may seem to be little reason 

for writing another. But most current works are severely 

technical, written by mathematicians for other mathe- 

maticians or for historians of science. Despite the ad- 

mirable scholarship and often clear presentation of these works, they are not especially 

well adapted to the undergraduate classroom. (Perhaps the most notable exception is 

Howard Eves’s popular account, An Introduction to the History of Mathematics.) There 

is a need today for an undergraduate textbook, which is also accessible to the general 

reader interested in the history of mathematics. In the following pages, I have tried to 

give a reasonably full account of how mathematics has developed over the past 5000 

years. Because mathematics is one of the oldest intellectual instruments, it has a long 

story, interwoven with striking personalities and outstanding achievements. This narrative 

is chronological, beginning with the origin of mathematics in the great civilizations of 

antiquity and progressing through the later decades of the twentieth century. The presen- 

tation necessarily becomes less complete for modern times, when the pace of discovery 
has been rapid and the subject matter more technical. 

Considerable prominence has been assigned to the lives of the people responsible 
for progress in the mathematical enterprise. In emphasizing the biographical element, 
I can say only that there is no sphere in which individuals count for more than the 
intellectual life, and that most of the mathematicians cited here really did tower over 
their contemporaries. So that they will stand out as living gures and representatives of 
their day, it is necessary to pause from time to time to consider the social and cultural 
framework in which they lived. I have especially tried to de ne why mathematical activity 
waxed and waned in different periods and in different countries. 

Writers on the history of mathematics tend to be trapped between the desire to 
interject some genuine mathematics into a work and the desire to make the reading 
as painless and pleasant as possible. Believing that any mathematics textbook should 
concern itself primarily with teaching mathematical content, I have favored stressing 
the mathematics. Thus, assorted problems of varying degrees of dif culty have been 
interspersed throughout. Usually these problems typify a particular historical period, 
requiring the procedures of that time. They are an integral part of the text and, in 
working them, you will learn some interesting mathematics as well as history. The level of 
maturity needed for this work is approximately the mathematical background of a college 
junior or senior. Readers with more extensive training in the subject must forgive certain 
explanations that seem unnecessary. 

The title indicates that this book is in no way an encyclopedic enterprise: it does not 
pretend to present all the important mathematical ideas that arose during the vast sweep 
of time it covers. The inevitable limitations of space necessitate illuminating some out- 
standing landmarks instead of casting light of equal brilliance over the whole landscape. 
A certain amount of judgment and self-denial has been exercised, both in choosing 
mathematicians and in treating their contributions. The material that appears here does 
re ect some personal tastes and prejudices. It stands to reason that not everyone will be 
satis ed with the choices. Some readers will raise an eyebrow at the omission of some 
household names of mathematics that have been either passed over in complete silence 
or shown no great hospitality; others will regard the scant treatment of their favorite 
topic as an unpardonable omission. Nevertheless, the path that I have pieced together 
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should provide an adequate explanation of how mathematics came to occupy its position 
as a primary cultural force in Western civilization. The book is published in the modest 
hope that it may stimulate the reader to pursue more elaborate works on the subject. 

Anyone who ranges over such a well-cultivated eld as the history of mathematics 
becomes much indebted to the scholarship of others. The chapter bibliographies represent 
a partial listing of works that in one way or another have helped my command of the 
facts. To the writers and many others of whom no record was kept, I am enormously 
grateful. 


Readers familiar with previous editions of The History of Mathematics 


New to This Edition will nd that this seventh edition maintains the same overall organi- 


Electronic Books 


zation and content. Nevertheless, the preparation of a seventh edition 
has provided the occasion for a variety of small improvements as well 
as several more signi cant ones. 

The most notable difference is an enhanced treatment of American mathematics. 
Section 12.1, for instance, includes the efforts of such early nineteenth-century gures as 
Robert Adrain and Benjamin Banneker. Because the mathematically gifted of the period 
often became observational astronomers, the contributions of Simon Newcomb, George 
William Hill, Albert Michelson, and Maria Mitchell are also recounted. Later sections 
consider the work of more recent mathematicians, such as Oswald Veblen, R. L. Moore, 
Richard Courant, and Walter Feit. 

Another noteworthy difference is the attention now paid to several mathematicians 
passed over in previous editions. Among them are Lazar Carnot, Herman Giinther Grass- 
mann, Andrei Kolmogorov, William Burnside, and Paul Erdés. 

Beyond these modi cations, there are some minor changes: biographies are brought 
up to date and certain numerical information kept current. In addition, an attempt has 
been made to correct errors, both typographical and historical, which crept into the earlier 
editions. 


If you or your students are ready for an alternative version of the 
traditional textbook, McGraw-Hill has partnered with CourseSmart 
and VitalSource to bring you innovative and inexpensive electronic 
textbooks. Students can save up to 50 percent off the cost of a print 
book, reduce their impact on the environment, and gain access to 
powerful Web tools for learning including full text search, notes and highlighting, and 
email tools for sharing notes between classmates. eBooks from McGraw-Hill are smart, 
interactive, searchable, and portable. 
To review complimentary copies or to purchase an eBook, go to either 
www.CourseSmart.com or www. VitalSource.com. 


Many friends, colleagues, and readers—too numerous to mention 


Acknowledgments individually—have been kind enough to forward corrections or to of- 


fer suggestions for the book’s enrichment. My thanks to all for their 

collective contributions. Although not every recommendation was in- 

corporated, all were gratefully received and seriously considered when 
deciding upon alterations. 
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In particular, the advice of the following reviewers was especially helpful in the 
creation of the seventh edition: 


Victor Akatsa, Chicago State University 

Carl FitzGerald, The University of California, San Diego 
Gary Shannon, California State University, Sacramento 
Tomas Smotzer, Youngstown State University 

John Stroyls, Georgia Southwestern State University 


A special debt of thanks is owed my wife, Martha Beck Burton, for providing assis- 
tance throughout the preparation of this edition. Her thoughtful comments signi cantly 
improved the exposition. Finally, I would like to express my appreciation to the staff 
members of McGraw-Hill for their unfailing cooperation during the course of production. 

Any errors that have survived all this generous assistance must be laid at my door. 


D. M. B. 


CHAPTER l 


Early Number Systems and Symbols 


To think the thinkable—that is the mathematician’s aim. 


C.J. KEYSER 
aw : The root of the term mathematics is in the Greek word math- 
1,1 Primitive Counting emata, which was used quite generally in early writings to 
indicate any subject of instruction or study. As learning ad- 
A Sense of Number vanced, it was found convenient to restrict the scope of this 


term to particular elds of knowledge. The Pythagoreans 
are said to have used it to describe arithmetic and geometry; previously, each of these 
subjects had been called by its separate name, with no designation common to both. The 
Pythagoreans’ use of the name would perhaps be a basis for the notion that mathematics 
began in Classical Greece during the years from 600 to 300 B.c. But its history can be 
followed much further back. Three or four thousand years ago, in ancient Egypt and 
Babylonia, there already existed a signi cant body of knowledge that we should describe 
as mathematics. If we take the broad view that mathematics involves the study of issues 
of a quantitative or spatial nature—number, size, order, and form—it is an activity that 
has been present from the earliest days of human experience. In every time and culture, 
there have been people with a compelling desire to comprehend and master the form of 
the natural world around them. To use Alexander Pope’s words, “This mighty maze is 
not without a plan.” 

It is commonly accepted that mathematics originated with the practical problems of 
counting and recording numbers. The birth of the idea of number is so hidden behind 
the veil of countless ages that it is tantalizing to speculate on the remaining evidences of 
early humans’ sense of number. Our remote ancestors of some 20,000 years ago—who 
were quite as clever as we are—must have felt the need to enumerate their livestock, 
tally objects for barter, or mark the passage of days. But the evolution of counting, with 
its spoken number words and written number symbols, was gradual and does not allow 
any determination of precise dates for its stages. 

Anthropologists tell us that there has hardly been a culture, however primitive, that 
has not had some awareness of number, though it might have been as rudimentary as 
the distinction between one and two. Certain Australian aboriginal tribes, for instance, 
counted to two only, with any number larger than two called simply “much” or “many.” 
South American Indians along the tributaries of the Amazon were equally destitute of 
number words. Although they ventured further than the aborigines in being able to count 
to six, they had no independent number names for groups of three, four, ve, or six. In 
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their counting vocabulary, three was called “two-one,” four was “two-two,” and so on. 
A similar system has been reported for the Bushmen of South Africa, who counted to 
ten 10 =2+2+2+2+42) with just two words; beyond ten, the descriptive phrases 
became too long. It is notable that such tribal groups would not willingly trade, say, 
two cows for four pigs, yet had no hesitation in exchanging one cow for two pigs and a 
second cow for another two pigs. 

The earliest and most immediate technique for visibly expressing the idea of number 
is tallying. The idea in tallying is to match the collection to be counted with some easily 
employed set of objects—in the case of our early forebears, these were ngers, shells, 
or stones. Sheep, for instance, could be counted by driving them one by one through 
a narrow passage while dropping a pebble for each. As the ock was gathered in for 
the night, the pebbles were moved from one pile to another until all the sheep had 
been accounted for. On the occasion of a victory, a treaty, or the founding of a village, 
frequently a cairn, or pillar of stones, was erected with one stone for each person present. 

The term tally comes from the French verb tailler, “to cut,” like the English word 
tailor; the root is seen in the Latin faliare, meaning “to cut.” It is also interesting to note 
that the English word write can be traced to the Anglo-Saxon writan, “to scratch,” or 
“to notch.” 

Neither the spoken numbers nor nger tallying have any permanence, although nger 
counting shares the visual quality of written numerals. To preserve the record of any 
count, it was necessary to have other representations. We should recognize as human 
intellectual progress the idea of making a correspondence between the events or objects 
recorded and a series of marks on some suitably permanent material, with one mark 
representing each individual item. The change from counting by assembling collections 
of physical objects to counting by making collections of marks on one object is a long 
step, not only toward abstract number concept, but also toward written communication. 

Counts were maintained by making scratches on stones, by cutting notches in wooden 
sticks or pieces of bone, or by tying knots in strings of different colors or lengths. When 
the numbers of tally marks became too unwieldy to visualize, primitive people arranged 
them in easily recognizable groups such as groups of 5, for the ngers of a hand. It 
is likely that grouping by pairs came rst, soon abandoned in favor of groups of 5, 
10, or 20. The organization of counting by groups was a noteworthy improvement on 
counting by ones. The practice of counting by ves, say, shows a tentative sort of 
progress toward reaching an abstract concept of “ ve” as contrasted with the descriptive 
ideas “ve ngers” or “ve days.” To be sure, it was a timid step in the long journey 
toward detaching the number sequence from the objects being counted. 


Notches as Tally Marks 


Bone artifacts bearing incised markings seem to indicate that the people of the Old 
Stone Age had devised a system of tallying by groups as early as 30,000 B.c. The most 
impressive example is a shinbone from a young wolf, found in Czechoslovakia in 1937; 
about 7 inches long, the bone is engraved with 55 deeply cut notches, more or less equal 
in length, arranged in groups of ve. (Similar recording notations are still used, with 
the strokes bundled in ves, like JT. Voting results in small towns are still counted in 
the manner devised by our remote ancestors.) For many years such notched bones were 
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interpreted as hunting tallies and the incisions were thought to represent kills. A more 
recent theory, however, is that the rst recordings of ancient people were concerned with 
reckoning time. The markings on bones discovered in French cave sites in the late 1880s 
are grouped in sequences of recurring numbers that agree with the numbers of days 
included in successive phases of the moon. One might argue that these incised bones 
represent lunar calendars. 

Another arresting example of an incised bone was unearthed at Ishango along the 
shores of Lake Edward, one of the headwater sources of the Nile. The best archeological 
and geological evidence dates the site to 17,500 B.c., or some 12,000 years before the 
rst settled agrarian communities appeared in the Nile valley. This fossil fragment was 
probably the handle of a tool used for engraving, or tattooing, or even writing in some 
way. It contains groups of notches arranged in three de nite columns; the odd, unbalanced 
composition does not seem to be decorative. In one of the columns, the groups are 
composed of 11, 21, 19, and 9 notches. The underlying pattern may be 10+ 1, 20+ 
1, 20—1, and 10—1. The notches in another column occur in eight groups, in the 
following order: 3, 6, 4, 8, 10, 5, 5, 7. This arrangement seems to suggest an appreciation 
of the concept of duplication, or multiplying by 2. The last column has four groups 
consisting of 11, 13, 17, and 19 individual notches. The pattern here may be fortuitous 
and does not necessarily indicate—as some authorities are wont to infer—a familiarity 
with prime numbers. Because 11 + 13+17+19=60 and 114+21+19+9= 60, it 
might be argued that markings on the prehistoric Ishango bone are related to a lunar 
count, with the rst and third columns indicating two lunar months. 

The use of tally marks to record counts was prominent among the prehistoric peoples 
of the Near East. Archaeological excavations have unearthed a large number of small 
clay objects that had been hardened by re to make them more durable. These handmade 
artifacts occur in a variety of geometric shapes, the most common being circular disks, 
triangles, and cones. The oldest, dating to about 8000 B.c., are incised with sets of parallel 
lines on a plain surface; occasionally, there will be a cluster of circular impressions as if 
punched into the clay by the blunt end of a bone or stylus. Because they go back to the 
time when people rst adopted a settled agricultural life, it is believed that the objects are 
primitive reckoning devices; hence, they have become known as “counters” or “tokens.” 
It is quite likely also that the shapes represent different commodities. For instance, a 
token of a particular type might be used to indicate the number of animals in a herd, 
while one of another kind could count measures of grain. Over several millennia, tokens 
became increasingly complex, with diverse markings and new shapes. Eventually, there 
came to be 16 main forms of tokens. Many were perforated with small holes, allowing 
them to be strung together for safekeeping. The token system of recording information 
went out of favor around 3000 B.c., with the rapid adoption of writing on clay tablets. 

A method of tallying that has been used in many different times and places involves 
the notched stick. Although this device provided one of the earliest forms of keeping 
records, its use was by no means limited to “primitive peoples,” or for that matter, to 
the remote past. The acceptance of tally sticks as promissory notes or bills of exchange 
reached its highest level of development in the British Exchequer tallies, which formed an 
essential part of the government records from the twelfth century onward. In this instance, 
the tallies were at pieces of hazelwood about 6—9 inches long and up to an inch thick. 
Notches of varying sizes and types were cut in the tallies, each notch representing a xed 
amount of money. The width of the cut decided its value. For example, the notch of £1000 
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was as large as the width of a hand; for £100, as large as the thickness of a thumb; and 
for £20, the width of the little nger. When a loan was made the appropriate notches were 
cut and the stick split into two pieces so that the notches appeared in each section. The 
debtor kept one piece and the Exchequer kept the other, so the transaction could easily 
be veried by tting the two halves together and noticing whether the notches coincided 
(whence the expression “our accounts tallied”). Presumably, when the two halves had 
been matched, the Exchequer destroyed its section—either by burning it or by making 
it smooth again by cutting off the notches—but retained the debtor’s section for future 
record. Obstinate adherence to custom kept this wooden accounting system in of cial use 
long after the rise of banking institutions and modern numeration had made its practice 
quaintly obsolete. It took an act of Parliament, which went into effect in 1826, to abolish 
the practice. In 1834, when the long-accumulated tallies were burned in the furnaces that 
heated the House of Lords, the re got out of hand, starting a more general con agration 
that destroyed the old Houses of Parliament. 

The English language has taken note of the peculiar quality of the double tally stick. 
Formerly, if someone lent money to the Bank of England, the amount was cut on a 
tally stick, which was then split. The piece retained by the bank was known as the foil, 
whereas the other half, known as the stock, was given the lender as a receipt for the sum 
of money paid in. Thus, he became a “stockholder” and owned “bank stock” having the 
same worth as paper money issued by the government. When the holder would return, 
the stock was carefully checked and compared against the foil in the bank’s possession; 
if they agreed, the owner’s piece would be redeemed in currency. Hence, a written 
certi cate that was presented for remittance and checked against its security later came 
to be called a “check.” 

Using wooden tallies for records of obligations was common in most European 
countries and continued there until fairly recently. Early in this century, for instance, 
in some remote valleys of Switzerland, “milk sticks” provided evidence of transactions 
among farmers who owned cows in a common herd. Each day the chief herdsman would 
carve a six- or seven-sided rod of ashwood, coloring it with red chalk so that incised 
lines would stand out vividly. Below the personal symbol of each farmer, the herdsman 
marked off the amounts of milk, butter, and cheese yielded by a farmer’s cows. Every 
Sunday after church, all parties would meet and settle the accounts. Tally sticks—in 
particular, double tallies—were recognized as legally valid documents until well into the 
1800s. France’s rst modern code of law, the Code Civil, promulgated by Napoleon in 
1804, contained the provision: 


The tally sticks which match their stocks have the force of contracts between persons who 
are accustomed to declare in this manner the deliveries they have made or received. 


The variety in practical methods of tallying is so great that giving any detailed 
account would be impossible here. But the procedure of counting both days and objects 
by means of knots tied in cords has such a long tradition that it is worth mentioning. The 
device was frequently used in ancient Greece, and we nd reference to it in the work of 
Herodotus ( fth century B.c.). Commenting in his History, he informs us that the Persian 
king Darius handed the Ionians a knotted cord to serve as a calendar: 


The King took a leather thong and tying sixty knots in it called together the Ionian tyrants 
and spoke thus to them: “Untie every day one of the knots; if I do not return before the 
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Three views of a Paleolithic wolfbone used for tallying. (The Illustrated London News 
Picture Library.) 
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last day to which the knots will hold out, then leave your station and return to your several 
homes.” 


The Peruvian Quipus: Knots as Numbers 


In the New World, the number string is best illustrated by the knotted cords, called 
quipus, of the Incas of Peru. They were originally a South American Indian tribe, or 
a collection of kindred tribes, living in the central Andean mountainous highlands. 
Through gradual expansion and warfare, they came to rule a vast empire consisting 
of the coastal and mountain regions of present-day Ecuador, Peru, Bolivia, and the 
northern parts of Chile and Argentina. The Incas became renowned for their engineering 
skills, constructing stone temples and public buildings of a great size. A striking 
accomplishment was their creation of a vast network (as much as 14,000 miles) of roads 
and bridges linking the far- ung parts of the empire. The isolation of the Incas from the 
horrors of the Spanish Conquest ended early in 1532 when 180 conquistadors landed in 
northern Peru. By the end of the year, the invaders had seized the capital city of Cuzco 
and imprisoned the emperor. The Spaniards imposed a way of life on the people that 
within about 40 years would destroy the Inca culture. 

When the Spanish conquerors arrived in the sixteenth century, they observed that 
each city in Peru had an “ofcial of the knots,” who maintained complex accounts by 
means of knots and loops in strands of various colors. Performing duties not unlike 
those of the city treasurer of today, the quipu keepers recorded all of cial transactions 
concerning the land and subjects of the city and submitted the strings to the central 
government in Cuzco. The quipus were important in the Inca Empire, because apart 
from these knots no system of writing was ever developed there. The quipu was made of 
a thick main cord or crossbar to which were attached ner cords of different lengths and 
colors; ordinarily the cords hung down like the strands of a mop. Each of the pendent 
strings represented a certain item to be tallied; one might be used to show the number 
of sheep, for instance, another for goats, and a third for lambs. The knots themselves 
indicated numbers, the values of which varied according to the type of knot used and its 
specic position on the strand. A decimal system was used, with the knot representing 
units placed nearest the bottom, the tens appearing immediately above, then the hundreds, 
and so on; absence of a knot denoted zero. Bunches of cords were tied off by a single 
main thread, a summation cord, whose knots gave the total count for each bunch. The 
range of possibilities for numerical representation in the quipus allowed the Incas to keep 
incredibly detailed administrative records, despite their ignorance of the written word. 
More recent (1872) evidence of knots as a counting device occurs in India; some of 
the Santal headsmen, being illiterate, made knots in strings of four different colors to 
maintain an up-to-date census. 

To appreciate the quipu fully, we should notice the numerical values represented by 
the tied knots. Just three types of knots were used: a_ gure-eight knot standing for 1, a 
long knot denoting one of the values 2 through 9, depending on the number of twists 
in the knot, and a single knot also indicating 1. The gure-eight knot and long knot 
appear only in the lowest (units) position on a cord, while clusters of single knots can 
appear in the other spaced positions. Because pendant cords have the same length, an 
empty position (a value of zero) would be apparent on comparison with adjacent cords. 
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Also, the reappearance of either a gure-eight or long knot would point out that another 
number is being recorded on the same cord. 

Recalling that ascending positions carry place value for successive powers of ten, let 
us suppose that a particular cord contains the following, in order: a long knot with four 
twists, two single knots, an empty space, seven clustered single knots, and one single 
knot. For the Inca, this array would represent the number 


17024 = 4+(2-10)+(0- 107) +(7- 10°) + (1 - 10°). 


Another New World culture that used a place value numeration system was that of 
the ancient Maya. The people occupied a broad expanse of territory embracing southern 
Mexico and parts of what is today Guatemala, El Salvador, and Honduras. The Mayan 
civilization existed for over 2000 years, with the time of its greatest owering being the 
period 300-900 a.p. A distinctive accomplishment was its development of an elaborate 
form of hieroglyphic writing using about 1000 glyphs. The glyphs are sometimes sound 
based and sometimes meaning based: the vast majority of those that have survived have 
yet to be deciphered. After 900 A.p., the Mayan civilization underwent a sudden decline— 
The Great Collapse—as its populous cities were abandoned. The cause of this catastrophic 
exodus is a continuing mystery, despite speculative explanations of natural disasters, 
epidemic diseases, and conquering warfare. What remained of the traditional culture did 
not succumb easily or quickly to the Spanish Conquest, which began shortly after 1500. 
It was a struggle of relentless brutality, stretching over nearly a century, before the last 
unconquered Mayan kingdom fell in 1597. 

The Mayan calendar year was composed of 365 days divided into 18 months of 20 
days each, with a residual period of 5 days. This led to the adoption of a counting system 
based on 20 (a vigesimal system). Numbers were expressed symbolically in two forms. 
The priestly class employed elaborate glyphs of grotesque faces of deities to indicate 
the numbers | through 19. These were used for dates carved in stone, commemorating 
notable events. The common people recorded the same numbers with combinations of 
bars and dots, where a short horizontal bar represented 5 and a dot 1. A particular feature 
was a stylized shell that served as a symbol for zero; this is the earliest known use of a 
mark for that number. 


e ee eee eeee 
0 1 2 4 

e e e eee @eeee 
5 7 9 

e e e eee eeee 
10 11 12 13 14 

e e e eee eeee 
15 16 17 18 19 


The symbols representing numbers larger than 19 were arranged in a vertical column 
with those in each position, moving upward, multiplied by successive powers of 20; that 
is, by 1, 20, 400, 8000, 160,000, and so on. A shell placed in a position would indicate 
the absence of bars and dots there. In particular, the number 20 was expressed by a shell 
at the bottom of the column and a single dot in the second position. For an example 
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Thirteenth-century British Exchequer tallies. (By courtesy of the Society of Antiquaries of 
London.) 


of a number recorded in this system, let us write the symbols horizontally rather than 
vertically, with the smallest value on the left: 


For us, this expression denotes the number 62808, for 


62808 = 8-14 0-204 17-400 +4 7 - 8000. 


Because the Mayan numeration system was developed primarily for calendar reckoning, 
there was a minor variation when carrying out such calculations. The symbol in the 
third position of the column was multiplied by 18-20 rather than by 20-20, the idea 
being that 360 was a better approximation to the length of the year than was 400. The 
place value of each position therefore increased by 20 times the one before; that is, the 
multiples are 1, 20, 360, 7200, 144,000, and so on. Under this adjustment, the value of 
the collection of symbols mentioned earlier would be 


56528 = 8-1+0-20+17-360+7- 7200. 


Over the long sweep of history, it seems clear that progress in devising ef cient 
ways of retaining and conveying numerical information did not take place until primitive 
people abandoned the nomadic life. Incised markings on bone or stone may have been 
adequate for keeping records when human beings were hunters and gatherers, but the food 
producer required entirely new forms of numerical representation. Besides, as a means 
for storing information, groups of markings on a bone would have been intelligible only 
to the person making them, or perhaps to close friends or relatives; thus, the record was 
probably not intended to be used by people separated by great distances. 

Deliberate cultivation of crops, particularly cereal grains, and the domestication of 
animals began, so far as can be judged from present evidence, in the Near East some 
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10,000 years ago. Later experiments in agriculture occurred in China and in the New 
World. A widely held theory is that a climatic change at the end of the last ice age 
provided the essential stimulus for the introduction of food production and a settled 
village existence. As the polar ice cap began to retreat, the rain belt moved northward, 
causing the desiccation of much of the Near East. The increasing scarcity of wild food 
plants and the game on which people had lived forced them, as a condition of survival, 
to change to an agricultural life. It became necessary to count one’s harvest and herd, to 
measure land, and to devise a calendar that would indicate the proper time to plant crops. 
Even at this stage, the need for a means of counting was modest; and tallying techniques, 
although slow and cumbersome, were still adequate for ordinary dealings. But with a 
more secure food supply came the possibility of a considerable increase in population, 
which meant that larger collections of objects had to be enumerated. Repetition of some 
fundamental mark to record a tally led to inconvenient numeral representations, tedious 
to compose and dif cult to interpret. The desire of village, temple, and palace of cials to 
maintain meticulous records (if only for the purposes of systematic taxation) gave further 
impetus to nding new and more re ned means of “ xing” a count in a permanent or 
semipermanent form. 

Thus, it was in the more elaborate life of those societies that rose to power some 
6000 years ago in the broad river valleys of the Nile, the Tigris-Euphrates, the Indus, 
and the Yangtze that special symbols for numbers rst appeared. From these, some of 
our most elementary branches of mathematics arose, because a symbolism that would 
allow expressing large numbers in written numerals was an essential prerequisite for 
computation and measurement. Through a welter of practical experience with number 
symbols, people gradually recognized certain abstract principles; for instance, it was 
discovered that in the fundamental operation of addition, the sum did not depend on the 
order of the summands. Such discoveries were hardly the work of a single individual, 
or even a single culture, but more a slow process of awareness moving toward an 
increasingly abstract way of thinking. 

We shall begin by considering the numeration systems of the important Near Eastern 
civilizations—the Egyptian and the Babylonian—from which sprang the main line of our 
own mathematical development. Number words are found among the word forms of 
the earliest extant writings of these people. Indeed, their use of symbols for numbers, 
detached from an association with the objects to be counted, was a big turning point 
in the history of civilization. It is more than likely to have been a_ rst step in the 
evolution of humans’ supreme intellectual achievement, the art of writing. Because the 
recording of quantities came more easily than the visual symbolization of speech, there 
is unmistakable evidence that the written languages of these ancient cultures grew out of 
their previously written number systems. 


The writing of history, as we understand it, 


Number Recording of the Egyptians is a Greek invention; and foremost among the 
and Greeks early Greek historians was Herodotus. Herodotus 


(circa 485—430 B.C.) was born at Halicarnassus, a 
The History of Herodotus largely Greek settlement on the southwest coast 
of Asia Minor. In early life, he was involved in 
political troubles in his home city and forced to ee in exile to the island of Samos, and 
thence to Athens. From there Herodotus set out on travels whose leisurely character and 
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broad extent indicate that they occupied many years. It is assumed that he made three 
principal journeys, perhaps as a merchant, collecting material and recording his impres- 
sions. In the Black Sea, he sailed all the way up the west coast to the Greek communities 
at the mouth of the Dnieper River, in what is now Ukraine, and then along the south 
coast to the foot of the Caucasus. In Asia Minor, he traversed modern Syria and Iraq 
and traveled down the Euphrates, possibly as far as Babylon. In Egypt, he ascended the 
Nile River from its delta to somewhere near Aswan, exploring the pyramids along the 
way. Around 443 B.c., Herodotus became a citizen of Thurium in southern Italy, a new 
colony planted under Athenian auspices. In Thurium, he seems to have passed the last 
years of his life involved almost entirely in nishing the History of Herodotus, a book 
larger than any Greek prose work before it. The reputation of Herodotus as a historian 
stood high even in his own day. In the absence of numerous copies of books, it is natural 
that a history, like other literary compositions, should have been read aloud at public 
and private gatherings. In Athens, some 20 years before his death, Herodotus recited 
completed portions of his History to admiring audiences and, we are told, was voted an 
unprecedentedly large sum of public money in recognition of the merit of his work. 

Although the story of the Persian Wars provides the connecting link in the History of 
Herodotus, the work is no mere chronicle of carefully recorded events. Almost anything 
that concerned people interested Herodotus, and his History is a vast store of information 
on all manner of details of daily life. He contrived to set before his compatriots a general 
picture of the known world, of its various peoples, of their lands and cities, and of what 
they did and above all why they did it. (A modern historian would probably describe the 
History as a guidebook containing useful sociological and anthropological data, instead 
of a work of history.) The object of his History, as Herodotus conceived it, required 
him to tell all he had heard but not necessarily to accept it all as fact. He atly stated, 
“My job is to report what people say, not to believe it all, and this principle is meant 
to apply to my whole work.” We nd him, accordingly, giving the traditional account 
of an occurrence and then offering his own interpretation or a contradictory one from a 
different source, leaving the reader to choose between versions. One point must be clear: 
Herodotus interpreted the state of the world at his time as a result of change in the past 
and felt that the change could be described. It is this attempt that earned for him, and 
not any of the earlier writers of prose, the honorable title “Father of History.” 

Herodotus took the trouble to describe Egypt at great length, for he seems to have 
been more enthusiastic about the Egyptians than about almost any other people that he 
met. Like most visitors to Egypt, he was distinctly aware of the exceptional nature of the 
climate and the topography along the Nile: “For anyone who sees Egypt, without having 
heard a word about it before, must perceive that Egypt is an acquired country, the gift 
of the river.” This famous passage—often paraphrased to read “Egypt is the gift of the 
Nile”—aptly sums up the great geographical fact about the country. In that sun-soaked, 
rainless climate, the river in over owing its banks each year regularly deposited the rich 
silt washed down from the East African highlands. To the extreme limits of the river’s 
waters there were fertile elds for crops and the pasturage of animals; and beyond that 
the barren desert frontiers stretched in all directions. This was the setting in which that 
literate, complex society known as Egyptian civilization developed. 

The emergence of one of the world’s earliest cultures was essentially a political act. 
Between 3500 and 3100 B.c., the self-sufcient agricultural communities that clung to 
the strip of land bordering the Nile had gradually coalesced into larger units until there 
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The habitable world according to Herodotus. (From Stories from Herodotus by B. Wilson 
and D. Miller. Reproduced by permission of Oxford University Press.) 


were only the two kingdoms of Upper Egypt and Lower Egypt. Then, about 3100 B.c., 
these regions were united by military conquest from the south by a ruler named Menes, an 
elusive gure who stepped forth into history to head the long line of pharaohs. Protected 
from external invasion by the same deserts that isolated her, Egypt was able to develop 
the most stable and longest-lasting of the ancient civilizations. Whereas Greece and 
Rome counted their supremacies by the century, Egypt counted hers by the millennium; 
a well-ordered succession of 32 dynasties stretched from the uni cation of the Upper 
and Lower Kingdoms by Menes to Cleopatra’s encounter with the asp in 31 B.c. Long 
after the apogee of Ancient Egypt, Napoleon was able to exhort his weary veterans with 
the glory of its past. Standing in the shadow of the Great Pyramid of Gizeh, he cried, 
“Soldiers, forty centuries are looking down upon you!” 


Hieroglyphic Representation of Numbers 


As soon as the uni cation of Egypt under a single leader became an accomplished 
fact, a powerful and extensive administrative system began to evolve. The census had 
to be taken, taxes imposed, an army maintained, and so forth, all of which required 
reckoning with relatively large numbers. (One of the years of the Second Dynasty was 
named Year of the Occurrence of the Numbering of all Large and Small Cattle of the 
North and South.) As early as 3500 B.c., the Egyptians had a fully developed number 
system that would allow counting to continue inde nitely with only the introduction from 
time to time of a new symbol. This is borne out by the macehead of King Narmer, one of 
the most remarkable relics of the ancient world, now in a museum at Oxford University. 
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This scene is taken from the great stone macehead of Narmer, which J. E. Quibell discovered at 
Hierakonpolis in 1898. There is a summary of the spoil taken by Narmer during his wars, namely 


“cows, 400,000, bry DBD D Lots, 1,422,000, fo i» DBP ) } 1, and 
captives, 120,000, h Db ) - 


Scene reproduced from the stone macehead of Narmer, giving a summary of the spoil taken by him 
during his wars. (From The Dwellers on the Nile by E. W. Budge, 1977, Dover Publications, N.Y.) 


Near the beginning of the dynastic age, Narmer (who, some authorities suppose, may have 
been the legendary Menes, the rst ruler of the united Egyptian nation) was obliged to 
punish the rebellious Libyans in the western Delta. He left in the temple at Hierakonpolis 
a magnicent slate palette—the famous Narmer Palette—and a ceremonial macehead, 
both of which bear scenes testifying to his victory. The macehead preserves forever the 
of cial record of the king’s accomplishment, for the inscription boasts of the taking of 
120,000 prisoners and a register of captive animals, 400,000 oxen and 1,422,000 goats. 
Another example of the recording of very large numbers at an early stage occurs in 
the Book of the Dead, a collection of religious and magical texts whose principle aim was 
to secure for the deceased a satisfactory afterlife. In one section, which is believed to date 
from the First Dynasty, we read (the Egyptian god Nu is speaking): “I work for you, 0 ye 
spirits, we are in number four millions, six hundred and one thousand, and two hundred.” 
The spectacular emergence of the Egyptian government and administration under 
the pharaohs of the rst two dynasties could not have taken place without a method of 
writing, and we nd such a method both in the elaborate “sacred signs,” or hieroglyphics, 
and in the rapid cursive hand of the accounting scribe. The hieroglyphic system of writing 
is a picture script, in which each character represents a concrete object, the signi cance 
of which may still be recognizable in many cases. In one of the tombs near the Pyramid 
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of Gizeh there have been found hieroglyphic number symbols in which the number one 
is represented by a single vertical stroke, or a picture of a staff, and a kind of horseshoe, 
or heelbone sign M is used as a collective symbol to replace ten separate strokes. In other 
words, the Egyptian system was a decimal one (from the Latin decem, “ten’”), which 
used counting by powers of 10. That 10 is so often found among ancient peoples as a 
base for their number systems is undoubtedly attributable to humans’ ten ngers and to 
our habit of counting on them. For the same reason, a symbol much like our numeral 1 
was almost everywhere used to express the number one. 

Special pictographs were used for each new power of 10 up to 10,000,000: 100 by 
a curved rope, 1000 by a lotus ower, 10,000 by an upright bent nger, 100,000 by a 
tadpole, 1,000,000 by a person holding up two hands as if in great astonishment, and 
10,000,000 by a symbol sometimes conjectured to be a rising sun. 


1 10 100 1000 10,000 100,000 1,000,000 10,000,000 


Po 9 FF Gp y @ 


Other numbers could be expressed by using these symbols additively (that is, the number 
represented by a set of symbols is the sum of the numbers represented by the individual 
symbols), with each character repeated up to nine times. Usually, the direction of writing 
was from right to left, with the larger units listed rst, then the others in order of 
importance. Thus, the scribe would write 


Moor 9 REIL 
[|| [ 


to indicate our number 
1- 100,000 + 4- 10,000 + 2- 1000+ 1-1004+3-104+6-1= 142,136. 


Occasionally, the larger units were written on the left, in which case the symbols were 
turned around to face the direction from which the writing began. Lateral space was 
saved by placing the symbols in two or three rows, one above the other. Because there 
was a different symbol for each power of 10, the value of the number represented was 
not affected by the order of the hieroglyphs within a grouping. For example, 


]92n 998 ier nnn 998 | 


all stood for the number 1232. Thus the Egyptian method of writing numbers was not 
a “positional system’”—a system in which one and the same symbol has a different 
signi cance depending on its position in the numerical representation. 

Addition and subtraction caused little dif culty in the Egyptian number system. For 
addition, it was necessary only to collect symbols and exchange ten like symbols for the 
next higher symbol. This is how the Egyptians would have added, say, 345 and 678: 
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345 
oe Wi “ant 999 
1023 11 n 


(i mann 999 
itl) oann 999 


iit ° mann 9999 
Hii) onnnn 9999 
a nnn 9 
This converted would be 
nit Sn Sg 999 
7999 


and converted again, 


Vu nn x 


Subtraction was performed by the same process in reverse. Sometimes “borrowing” was 
used, wherein a symbol for the large number was exchanged for ten lower-order symbols 
to provide enough for the smaller number to be subtracted, as in the case 


123 
=45 
= Wooan 9 
WWI ann 
\\ a 
which, converted, would be 
WV nnnn 
hui nnnn 
Veil nnn 
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Although the Egyptians had symbols for numbers, they had no generally uniform nota- 
tion for arithmetical operations. In the case of the famous Rhind Papyrus (dating about 
1650 B.c.), the scribe did represent addition and subtraction by the hieroglyphs //\ and 
/\, which resemble the legs of a person coming and going. 


Egyptian Hieratic Numeration 


As long as writing was restricted to inscriptions carved on stone or metal, its scope 
was limited to short records deemed to be outstandingly important. What was needed 
was an easily available, inexpensive material to write on. The Egyptians solved this 
problem with the invention of papyrus. Papyrus was made by cutting thin lengthwise 
strips of the stem of the reedlike papyrus plant, which was abundant in the Nile Delta 
marshes. The sections were placed side by side on a board so as to form a sheet, and 
another layer was added at right angles to the rst. When these were all soaked in water, 
pounded with a mallet, and allowed to dry in the sun, the natural gum of the plant glued 
the sections together. The writing surface was then scraped smooth with a shell until a 
nished sheet (usually 10 to 18 inches wide) resembled coarse brown paper; by pasting 
these sheets together along overlapping edges, the Egyptians could produce strips up to 
100 feet long, which were rolled up when not in use. They wrote with a brushlike pen, 
and ink made of colored earth or charcoal that was mixed with gum or water. Thanks 
not so much to the durability of papyrus as to the exceedingly dry climate of Egypt, 
which prevented mold and mildew, a sizable body of scrolls has been preserved for us 
in a condition otherwise impossible. 

With the introduction of papyrus, further steps in simplifying writing were almost 
inevitable. The rst steps were made largely by the Egyptian priests who developed a 
more rapid, less pictorial style that was better adapted to pen and ink. In this so-called 
“hieratic” (sacred) script, the symbols were written in a cursive, or free-running, hand 
so that at rst sight their forms bore little resemblance to the old hieroglyphs. It can be 
said to correspond to our handwriting as hieroglyphics corresponds to our print. As time 
passed and writing came into general use, even the hieratic proved to be too slow and a 
kind of shorthand known as “demotic” (popular) script arose. Hieratic writing is child’s 
play compared with demotic, which at its worst consists of row upon row of agitated 
commas, each representing a totally different sign. 

In both of these writing forms, numerical representation was still additive, based on 
powers of 10; but the repetitive principle of hieroglyphics was replaced by the device 
of using a single mark to represent a collection of like symbols. This type of notation 
may be called “cipherization.” Five, for instance, was assigned the distinctive mark > 
instead of being indicated by a group of ve vertical strokes. 


1 2 3 4 5 6 7 8 9 10 
Lo = 2 eae 
20 30 40 50 60 70 80 90 100 1000 
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The hieratic system used to represent numbers is as shown in the preceeding table. 
Note that the signs for 1, 10, 100, and 1000 are essentially abbreviations for the pic- 
tographs used earlier. In hieroglyphics, the number 37 had appeared as 


aan MIN 


but in hieratic script it is replaced by the less cumbersome 


NZ 


The larger number of symbols called for in this notation imposed an annoying tax on 
the memory, but the Egyptian scribes no doubt regarded this as justied by its speed 
and conciseness. The idea of ciphering is one of the decisive steps in the development 
of numeration, comparable in signi cance to the Babylonian adoption of the positional 
principle. 


The Greek Alphabetic Numeral System 


Around the fth century B.C., the Greeks of Ionia also developed a ciphered numeral 
system, but with a more extensive set of symbols to be memorized. They ciphered their 
numbers by means of the 24 letters of the ordinary Greek alphabet, augmented by three 
obsolete Phoenician letters (the digamma > for 6, the koppa T for 90, and the sampi 
‘A. for 900). The resulting 27 letters were used as follows. The initial nine letters were 
associated with the numbers from | to 9; the next nine letters represented the rst nine 
integral multiples of 10; the nal nine letters were used for the rst nine integral multiples 
of 100. The following table shows how the letters of the alphabet (including the special 
forms) were arranged for use as numerals. 


la 10¢ 100 p 
2B 20k 200 o 
3y 30 A 300 t 
4§ 40 w 400 v 
5¢é 50 v 500 @ 
695 60 600 y 
Te 70 o 700 yw 
8 n 80 2 800 w 
96 90 900 A 


Because the Ionic system was still a system of additive type, all numbers between | and 
999 could be represented by at most three symbols. The principle is shown by 


wd = 700 + 80 + 4 = 784. 


For larger numbers, the following scheme was used. An accent mark placed to the 
left and below the appropriate unit letter multiplied the corresponding number by 1000; 
thus ,6 represents not 2 but 2000. Tens of thousands were indicated by using a new letter 
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M, from the word myriad (meaning “ten thousand’). The letter M placed either next to 
or below the symbols for a number from 1 to 9999 caused the number to be multiplied 
by 10,000, as with 


56M, or M = 40,000, 
pvM, or M = 1,500,000. 
With these conventions, the Greeks wrote 
tTuEeM ,Bpud = 3,452,144. 


To express still larger numbers, powers of 10,000 were used, the double myriad MM 
denoting (10,000), and so on. 

The symbols were always arranged in the same order, from the highest multiple of 
10 on the left to the lowest on the right, so accent marks sometimes could be omitted 
when the context was clear. The use of the same letter for thousands and units, as in 


560165 = 4234, 


gave the left-hand letter a local place value. To distinguish the numerical meaning of 
letters from their ordinary use in language, the Greeks added an accent at the end or a 
bar extended over them; thus, the number 1085 might appear as 


ye’ «Or aire. 
The system as a whole afforded much economy of writing (whereas the Greek alphabetic 


numerical for 900 is a single letter, the Egyptians had to use the symbol y nine times), 
but it required the mastery of numerous signs. 

Multiplication in Greek alphabetic numerals was performed by beginning with the 
highest order in each factor and forming a sum of partial products. Let us calculate, for 
example, 24 x 53: 


K 6 24 
vy x 53 

ya & 1000 60 
o ip 200 12 


;ao op 1200 72 = 1272 


The idea in multiplying numbers consisting of more than one letter was to write each 
number as a sum of numbers represented by a single letter. Thus, the Greeks began by cal- 
culating 20 x 50 (x by v), then proceeded to 20 x 3 (x by y), then 4 x 50 (6 by v), and - 
nally 4 x 3 (6 by y). This method, called Greek multiplication, corresponds to the modern 
computation 


24 x 53 = (20 + 4)(50 + 3) 
= 20-50+20-34+4-504+4-3 
= 107%. 


The numerical connection in these products is not evident in the letter products, which 
necessitated elaborate multiplication tables. The Greeks had 27 symbols to multiply 
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by each other, so they were obliged to keep track of 729 entirely separate answers. 
The same multiplicity of symbols tended to hide simple relations among numbers; 
where we recognize an even number by its ending in 0, 2, 4, 6, and 8, any one of 
the 27 Greek letters (possibly modied by an accent mark) could represent an even 
number. 

An incidental objection raised against the alphabetic notation is that the juxtaposition 
of words and number expressions using the same symbols led to a form of number 
mysticism known as “gematria.” In gematria, a number is assigned to each letter of the 
alphabet in some way, and the value of a word is the sum of the numbers represented 
by its letters. Two words are then considered somehow related if they add up to the 
same number. This gave rise to the practice of giving names cryptically by citing their 
individual numbers. The most famous number was 666, the “number of the Beast,” 
mentioned in the Bible in the Book of Revelation. (It is probable that it referred to Nero 
Caesar, whose name has this value when written in Hebrew.) A favorite pastime among 
Catholic theologians during the Reformation was devising alphabet schemes in which 
666 was shown to stand for the name Martin Luther, thereby supporting their contention 
that he was the Antichrist. Luther replied in kind; he concocted a system in which 666 
forecast the duration of the papal reign and rejoiced that it was nearing an end. Readers 
of Tolstoy’s War and Peace may recall that “L’Empereur Napoleon” can also be made 
equivalent to the number of the Beast. 

Another number replacement that occurs in early theological writings concerns the 
word amen, which is anv in Greek. These letters have the numerical values 


A(a)=1, M(u)=40, E(m)=8, N(v)=50, 


totaling 99. Thus, in many old editions of the Bible, the number 99 appears at the end 
of a prayer as a substitute for amen. An interesting illustration of gematria is also found 
in the graf ti of Pompeii: “I love her whose number is 545.” 


a 


1. Express each of the given numbers in Egyptian (d) : 7 n - S o) g wy. 


hieroglyphics. 
(a) 1492. 


(b) 1999. 
(c) 12,321. 


(d) 70,807. 
123,456. 3. Perform the indicated operations and express the 
(f) 3,040,279. answers in hieroglyphics. 


2. Write each of these Egyptian numbers in our 


system. 


(a) l 
| 


@) Addis 99 BY 


fanene) 9 and 
sue ee) I 299 ¥1 


0) non [ff C. ) = a 


IH 
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mt a OS (Ef, 
I] f 


(c) Subtract | | | on 9 
I] 


from | | ae 99 


(d) Subtract {{/} PON 999 
| n 


from | | ger 


. Multiply the number below by M (10), expressing the 


result in hieroglyphics. 


eee 


Describe a simple rule for multiplying any Egyptian 
number by 10. 


. Write the Ionian Greek numerals corresponding to 


(a) 396. (d) 24,789. 
(b) 1492. (e) 123,456. 
(c) 1999, (f) 1,234,567. 


. Convert each of these from Ionian Greek numerals to 


our system. 
(a) ,aodd. (c) M /EQVE. 
(b) ,Ba. (d) Oumrm BX LS. 


. Perform the indicated operations, 


(a) Add vé and doy. 

(b) Add oAB and Awa. 

(c) Subtract xwé from ,y dip. 
(d) Multiply ove by 6. 


. Another system of number symbols the Greeks used 
from about 450 to 85 B.C. is known as the “Attic” 
or “Herodianic” (after Herodian, a Byzantine 
grammarian of the second century, who described it). 
In this system, the initial letters of the words for 5 
and the powers of 10 are used to represent the 
corresponding numbers; these are 


10. 
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[" the initial letter of penta, meaning “ ve.” 
A the initial letter of deka, meaning “ten.” 
H the initial letter of hekaton, meaning 
“hundred.” 
x the initial letter of kilo, meaning “thousand.” 
the initial letter of myriad, meaning “ten 
thousand.” 


The letter denoting 5 was combined with other letters 
to get intermediate symbols for 50, 500, 5000, and 
50,000: 


1 5 10 50 100 500 
1000 5000 10,000 


X FM PF 


Other numbers were made up on an additive basis, 
with higher units coming before lower. Thus each 
symbol was repeated not more than four times. An 
example in this numeration system is 


MIF X FAA II 


= 10,000 + 5000 + 1000 + 50 + 20 + 3 


50,000 


= 16,073. 

Write the Attic Greek numerals corresponding to 
(a) 386. (d) 24,789. 

(b) 1492. (e) 74,802. 

(c) 1999, (f) 123,456. 


Convert these from Greek Attic numerals to our 
system. 


@ XX PHHPEL 
0) MFPAAAT IIL 
i) XX PRRRAPAAL 
@ P/MMMPAIL 


Perform the indicated operations and express the 
answers in Attic numerals. 


x PAPI 


and HHPAAAIIIL 


(a) Add 


20 


11. 


A 


(b) Add [ XXXXHHHAAI 
and PHAPAL 

(c) Subtract [> HP#AAAI! 
fom =X X |, 

(d) Multiply HA|| 
by = AAT" 


The Roman numerals, still used for such decorative 
purposes as clock faces and monuments, are 
patterned on the Greek Attic system in having letters 
as symbols for certain multiples of 5 as well as for 
numbers that are powers of 10. The primary symbols 
with their values are 


I Vv Xx L C D M 
1 5 10 50 100 500 1000 


The Roman numeration system is essentially 
additive, with certain subtractive and multiplicative 
features. If the symbols decrease in value from left to 
right, their values are added, as in the example 


MDCCCXXVIII = 1000 + 500 + 300 
+204+5+3 = 1828. 


The representation of numbers that involve 4s and 9s 
is shortened by using a subtractive principle whereby 
a letter for a small unit placed before a unit of higher 
value indicates that the smaller is to be subtracted 
from the larger. For instance, 


CDXCV = (500 — 100) + (100 — 10) + 5 
= 495. 
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Babylonian Cuneiform Script 


Early Number Systems and Symbols 


(This scheme incorporates features of a positional 
system, because IV = 4, whereas VI = 6.) However, 
there were de nite rules: 


I could precede only V or X. 
X could precede only L or C. 
C could precede only D or M. 


In place of new symbols for large numbers, a 
multiplicative device was introduced; a bar drawn 
over the entire symbol multiplied the corresponding 
number by 1000, whereas a double bar meant 
multiplication by 10007. Thus 


XV = 15,000 and XV = 15,000,000. 


Write the Roman numerals corresponding to 


(a) 1492. (d) 74,802. 
(b) 1066. (e) 123,456. 
(c) 1999. (f) 3,040,279. 


Convert each of these from Roman numerals into our 
system. 


(a) CXXIV. (@) DCCIIOCRVE, 
(b) MDLXI. (e) XIX. 


(c) MDCCXLVII. (f) XCXXV. 


Perform the indicated operations and express the 
answers in Roman numerals. 


(a) Add CM and XIX. 

(b) Add MMCLXI and MDCXX. 
(c) Add XXIV and XLVI. 

(d) Subtract XXIII from XXX. 
(e) Subtract CLXI from CCLII. 
(f) Multiply XXXIV by XVI. 


Besides the Egyptian, another culture of an- 
tiquity that exerted a marked in uence on the 
development of mathematics was the Baby- 
lonian. Here the term “Babylonian” is used 
without chronological restrictions to refer to 


those peoples who, many thousands of years ago, occupied the alluvial plain between the 
twin rivers, the Tigris and the Euphrates. The Greeks called this land “Mesopotamia,” 
meaning “the land between the rivers.” Most of it today is part of the modern state of 
Iraq, although both the Tigris and the Euphrates rise in Turkey. Humans stepped over 
the threshold of civilization in this region—and more especially in the lowland marshes 
near the Persian Gulf—about the same time that humans did in Egypt, that is, about 
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3500 B.c. or possibly a little earlier. Although the deserts surrounding Egypt successfully 
protected it against invasions, the open plains of the Tigris-Euphrates valley made it less 
defensible. The early history of Mesopotamia is largely the story of incessant invaders 
who, attracted by the richness of the land, conquered their decadent predecessors, ab- 
sorbed their culture, and then settled into a placid enjoyment of wealth until they were 
themselves overcome by the next wave of intruders. 

Shortly after 3000 B.c., the Babylonians developed a system of writing from 
“pictographs”—a kind of picture writing much like hieroglyphics. But the materials 
chosen for writing imposed special limitations of their own, which soon robbed the pic- 
tographs of any resemblance to the objects they stood for. Whereas the Egyptians used 
pen and ink to keep their records, the Babylonians used _ rst a reed and later a stylus 
with a triangular end. With this they made impressions (rather than scratches) in moist 
clay. Clay dries quickly, so documents had to be relatively short and written all at one 
time, but they were virtually indestructible when baked hard in an oven or by the heat of 
the sun. (Contrast this with the Chinese method, which involved more perishable writing 
material such as bark or bamboo and did not allow keeping permanent evidence of the 
culture’s early attainments.) The sharp edge of a stylus made a vertical stroke () and 
the base made a more or less deep impression (a), so that the combined effect was a 
head-and-tail gure resembling a wedge, or nail (Y). Because the Latin word for “wedge” 
is cuneus, the resulting style of writing has become known as “cuneiform.” 

Cuneiform script was a natural consequence of the choice of clay as a writing 
medium. The stylus did not allow for drawing curved lines, so all pictographic symbols 
had to be composed of wedges oriented in different ways: vertical (Y), horizontal (»—), 
and oblique (4 or ). Another wedge was later added to these three types; it looked 
something like an angle bracket opening to the right (<) and was made by holding the 
stylus so that its sides were inclined to the clay tablet. These four types of wedges had to 
serve for all drawings, because executing others was considered too tiresome for the hand 
or too time-consuming. Unlike hieroglyphics, which remained a picture writing until near 
the end of Egyptian civilization, cuneiform characters were gradually simpli ed until the 
pictographic originals were no longer apparent. The nearest the Babylonians could get to 


the old circle <> representing the sun was aS which was later condensed still further 


to ay, Similarly, the symbol for a sh, which began as ¥ ended up as 1 The net 
effect of cuneiform script seems, to the uninitiated, “like bird tracks in wet sand.” Only 
within the last two centuries has anyone known what the many extant cuneiform writings 
meant, and indeed whether they were writing or simply decoration. 


Deciphering Cuneiform: Grotefend and Rawlinson 


Because there were no colossal temples or monuments to capture the archeological 
imagination (the land is practically devoid of building stone), excavation came later 
to this part of the ancient world than to Egypt. It is estimated that today there are at 
least 400,000 Babylonian clay tablets, generally the size of a hand, scattered among the 
museums of various countries. Of these, some 400 tablets or tablet fragments have been 
identied as having mathematical content. Their decipherment and interpretation have 
gone slowly, owing to the variety of dialects and natural modi cations in the language 
over the intervening several thousand years. 
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The initial step was taken by an obscure German schoolteacher, Georg Friedrich 
Grotefend (1775-1853), of Géttingen, who although well versed in classical Greek, was 
absolutely ignorant of Oriental languages. While drinking with friends, Grotefend wa- 
gered that he could decipher a certain cuneiform inscription from Persepolis provided 
that they would supply him with the previously published literature on the subject. By 
an inspired guess he found the key to reading Persian cuneiform. The prevailing arrange- 
ment of the characters was such that the points of the wedges headed either downward 
or to the right, and the angles formed by the broad wedges consistently opened to the 
right. He assumed that the language’s characters were alphabetic; he then began picking 
out those characters that occurred with the greatest frequency and postulated that these 
were vowels. The most recurrent sign group was assumed to represent the word for 
“king.” These suppositions allowed Grotefend to decipher the title “King of Kings” and 
the names Darius, Xerxes, and Hystapes. Thereafter, he was able to isolate a great many 
individual characters and to read 12 of them correctly. Grotefend thus produced a trans- 
lation that, although it contained numerous errors, gave an adequate idea of the contents. 
In 1802, when Grotefend was only 27 years old, he had his investigations presented to 
the Academy of Science in Gottingen (Grotefend was not allowed to read his own paper). 
But the achievements of this little-known scholar, who neither belonged to the faculty 
of the university nor was even an Orientalist by profession, only evoked ridicule from 
the learned body. Buried in an obscure publication, Grotefend’s brilliant discovery fell 
into oblivion, and decades later cuneiform script had to be deciphered anew. It is one of 
the whims of history that Champollion, the original translator of hieroglyphics, won an 
international reputation, while Georg Grotefend is almost entirely ignored. 

Few chapters in the discovery of the ancient world can rival for interest the copying 
of the monumental rock inscriptions at Behistun by Henry Creswicke Rawlinson 
(1810-1895). Rawlinson, who was an of cer in the Indian Army, became interested in 
cuneiform inscriptions when posted to Persia in 1835 as an advisor to the shah’s troops. 
He learned the language and toured the country extensively, exploring its many antiqui- 
ties. Rawlinson’s attention was soon turned to Behistun, where a towering rock cliff, the 
“Mountain of the Gods,” rises dramatically above an ancient caravan road to Babylon. 
There, in 516 B.c., Darius the Great caused a lasting monument to his accomplishments 
to be engraved on a specially prepared surface measuring 150 feet by 100 feet. The 
inscription is written in thirteen panels in three languages—Old Persian, Elamite, and 
Akkadian (the language of the Babylonians)—all using a cuneiform script. Above the 
ve panels of Persian writing, the artists chiseled a life-size gure in relief of Darius 
receiving the submission of ten rebel leaders who had disputed his right to the throne. 

Although the Behistun Rock has been called by some the Mesopotamian Rosetta 
Stone, the designation is not entirely apt. The Greek text on the Rosetta Stone allowed 
Champollion to proceed from the known to the unknown, whereas all three passages of 
the Behistun trilingual were written in the same unknown cuneiform script. However, 
Old Persian, with its mainly alphabetic script limited to 43 signs, had been the subject of 
serious investigation since the beginning of the nineteenth century. This version of the 
text was ultimately to provide the key of admission into the whole cuneiform world. 

The rst difculty lay in copying the long inscription. It is cut 400 feet above the 
ground on the face of a rock mass that itself rises 1700 feet above the plain. Since the 
stone steps were destroyed after the sculptors nished their work, there was no means of 
ascent. Rawlinson had to construct enormous ladders to get to the inscription and at times 
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had to be suspended by block and tackle in front of the almost precipitous rock face. 
By the end of 1837, he had copied approximately half the 414 lines of Persian text; and 
using methods akin to those Grotefend worked out for himself 35 years earlier, he had 
translated the rst two paragraphs. Rawlinson’s goal was to transcribe every bit of the 
inscription on the Behistun Rock, but unfortunately war broke out between Great Britain 
and Afghanistan in 1839. Rawlinson was transferred to active duty in Afghanistan, where 
he was cut off by siege for the better part of the next two years. The year 1843 again found 
him back in Baghdad, this time as British consul, eager to continue to copy, decipher, and 
interpret the remainder of the Behistun inscription. His complete translation of the Old 
Persian part of the text, along with a copy of all the 263 lines of the Elamite, was published 
in 1846. Next he tackled the third class of cuneiform writing on the monument, the Baby- 
lonian, which was cut on two sides of a ponderous boulder overhanging the Elamite pan- 
els. Despite great danger to life and limb, Rawlinson obtained paper squeezes (casts) of 
112 lines. With the help of the already translated Persian text, which contained numerous 
proper names, he assigned correct values to a total of 246 characters. During this work, he 
discovered an important feature of Babylonian writing, the principle of “polyphony”; that 
is, the same sign could stand for different consonantal sounds, depending on the vowel 
that followed. Thanks to Rawlinson’s remarkable efforts, the cuneiform enigma was 
penetrated, and the vast records of Mesopotamian civilization were now an open book. 


The Babylonian Positional Number System 


From the exhaustive studies of the last half-century, it is apparent that Babylonian 
mathematics was far more highly developed than had hitherto been imagined. The Baby- 
lonians were the only pre-Grecian people who made even a partial use of a positional 
number system. Such systems are based on the notion of place value, in which the value 
of a symbol depends on the position it occupies in the numerical representation. Their 
immense advantage over other systems is that a limited set of symbols suf ces to express 
numbers, no matter how large or small. The Babylonian scale of enumeration was not 
decimal, but sexagesimal (60 as a base), so that every place a “digit” is moved to the left 
increases its value by a factor of 60. When whole numbers are represented in the sexa- 
gesimal system, the last space is reserved for the numbers from | to 59, the next-to-last 
space for the multiples of 60, preceded by multiples of 60°, and so on. For example, the 
Babylonian 3 25 4 might stand for the number 


3-607 + 25-60+4 = 12,304 
and not 
3-10°+25-10+4 = 3254, 


as in our decimal (base 10) system. 

The Babylonian use of the sexagesimal place-value notation was con rmed_ by two 
tablets found in 1854 at Senkerah on the Euphrates by the English geologist W. K. 
Loftus. These tablets, which probably date from the period of Hammurabi (2000 B.c.), 
give the squares of all integers from | to 59 and their cubes as far as that of 32. The 
tablet of squares reads easily up to 7°, or 49. Where we should expect to nd 64, the 
tablet gives 1 4; the only thing that makes sense is to let 1 stand for 60. Following 87, 
the value of 97 is listed as 1 21, implying again that the left digit must represent 60. The 
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same scheme is followed throughout the table until we come to the last entry, which is 
58 1; this cannot but mean 


58 1 = 58-604 1 = 3481 = 59’. 


The disadvantages of Egyptian hieroglyphic numeration are obvious. Representing even 
small numbers might necessitate relatively many symbols (to represent 999, no less than 
27 hieroglyphs were required); and with each new power of 10, a new symbol had to be 
invented. By contrast, the numerical notation of the Babylonians emphasized two-wedge 
characters. The simple upright wedge Y had the value 1 and could be used nine times, 
while the broad sideways wedge < stood for 10 and could be used up to ve times. The 
Babylonians, proceeding along the same lines as the Egyptians, made up all other numbers 
of combinations of these symbols, each represented as often as it was needed. When both 
symbols were used, those indicating tens appeared to the left of those for ones, as in 


44a qT 


Appropriate spacing between tight groups of symbols corresponded to descending 
powers of 60, read from left to right. As an illustration, we have 


Y << rr 44d T <4 
Tr << 


which could be interpreted as 1 - 60° + 28 - 607 + 52 - 60 + 20 = 319,940. The Babylo- 
nians occasionally relieved the awkwardness of their system by using a subtractive sign 


Yt permitted writing such numbers as 19 in the form 20 — 1, 


. Oem f 


instead of using a tens symbol followed by nine units: 


TTT 
S ani 
TTY 


Babylonian positional notation in its earliest development lent itself to con icting 
interpretations because there was no symbol for zero. There was no way to distinguish 
between the numbers 

1-60+24=84 and  1-60°+0-60+24 = 3624, 
since each was represented in cuneiform by 


14a! 


One could only rely on the context to relieve the ambiguity. A gap was often used to 
indicate that a whole sexagesimal place was missing, but this rule was not strictly applied 
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and confusion could result. Someone recopying the tablet might not notice the empty 
space, and would put the gures closer together, thereby altering the value of the number. 
(Only in a positional system must the existence of an empty space be speci ed, so the 
Egyptians did not encounter this problem.) From 300 B.c. on, a separate symbol 


a OD 


called a divider, was introduced to serve as a placeholder, thus indicating an empty space 
between two digits inside a number. With this, the number 84 was readily distinguishable 
from 3624, the latter being represented by 


ra aq 


The confusion was not ended, since the Babylonian divider was used only medially and 
there still existed no symbol to indicate the absence of a digit at the end of a number. 
About A.D. 150, the Alexandrian astronomer Ptolemy began using the omicron (o, the 
rst letter of the Greek ovdev, “nothing”), in the manner of our zero, not only in a 
medial but also in a terminal position. There is no evidence that Ptolemy regarded o as 
a number by itself that could enter into computation with other numbers. 

The absence of zero signs at the ends of numbers meant that there was no way of 
telling whether the lowest place was a unit, a multiple of 60 or 607, or even a multiple 


of 4. The value of the symbol 2 24 (in cuneiform, "7 44977) could be 
2-604 24 = 144. 
But other interpretations are possible, for instance, 
2. 60° + 24 - 60 = 8640, 


or if intended as a fraction, 


Thus, the Babylonians of antiquity never achieved an absolute positional system. 
Their numerical representation expressed the relative order of the digits, and context 
alone decided the magnitude of a sexagesimally written number; since the base was so 
large, it was usually evident what value was intended. To remedy this shortcoming, let us 
agree to use a semicolon to separate integers from fractions, while all other sexagesimal 
places will be separated from one another by commas. With this convention, 25,0,3;30 
and 25,0;3,30 will mean, respectively, 


30 
25- 60° +0-60+3+ = = 90,0035 


and 


3 30 j 
25-60+0+ + oo = 150075. 


Note that neither the semicolon nor the comma had any counterpart in the original 
cuneiform texts. 
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The question how the sexagesimal system originated was posed long ago and has 
received different answers over time. According to Theon of Alexandria, a commentator 
of the fourth century, 60 was among all the numbers the most convenient since it was the 
smallest among all those that had the most divisors, and hence the most easily handled. 
Theon’s point seemed to be that because 60 had a large number of proper divisors, 
namely, 2, 3, 4, 5, 6, 10, 12, 15, 20, and 30, certain useful fractions could be represented 
conveniently. The integers 30, 20 and 15 could represent 1/2, 1/3 and 1/4: 


eee | 
2 60 
i -2 
ee 
3 60 
1 1 
Be” Sos, 
4 60 


Fractions that had nonterminating sexagesimal expansions were approximated by nite 
ones, so that every number presented the form of an integer. The result was a simplicity of 
calculation that eluded the Egyptians, who reduced all their fractions to sums of fractions 
with numerator 1. 

Others attached a “natural” origin to the sexagesimal system; their theory was that 
the early Babylonians reckoned the year at 360 days, and a higher base of 360 was chosen 
rst, then lowered to 60. Perhaps the most satisfactory explanation is that it evolved from 
the merger between two peoples of whom one had adopted the decimal system, whereas 
the other brought with them a 6-system, affording the advantage of being divisible by 2 
and by 3. (The origin of the decimal system is not logical but anatomical; humans have 
been provided with a natural abacus—their ngers and toes.) 

The advantages of the Babylonian place-value system over the Egyptian additive 
computation with unit fractions were so apparent that this method became the principal 
instrument of calculation among astronomers. We see this numerical notation in full use 
in Ptolemy’s outstanding work, the Megale Syntaxis (The Great Collection). The Arabs 
later passed this on to the West under the curious name A/magest (The Greatest). The 
Almagest so overshadowed its predecessors that until the time of Copernicus, it was the 
fundamental textbook on astronomy. In one of the early chapters, Ptolemy announced 
that he would be carrying out all his calculations in the sexagesimal system to avoid “the 
embarrassment of [Egyptian] fractions.” 


Writing in Ancient China 


Our study of early mathematics is limited mostly to the peoples of Mediterranean 
antiquity, chie y the Greeks, and their debt to the Egyptians and the inhabitants of the 
Fertile Crescent. Nevertheless, some general comment is called for about the civilizations 
of the Far East, and especially about its oldest and most central civilization, that of China. 
Although Chinese society was no older than the other river valley civilizations of the 
ancient world, it ourished long before those of Greece and Rome. In the middle of 
the second millennium B.c., the Chinese were already keeping records of astronomical 
events on bone fragments, some of which are extant. Indeed, by 1400 B.c., the Chinese 
had a positional numeration system that used nine signs. 
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The scarcity of reliable sources of information almost completely seals from us the 
history of the ancient Orient. In India, no mathematical text exists that can be ascribed 
with any certainty to the pre-Christian era; and the rst rm date that can be connected 
with a Chinese work, namely, the Nine Chapters on the Mathematical Arts, is 150 B.C. 

Much of the difference in availability of sources of information is to be ascribed 
to differences in climate between the Near East and the Far East. The dry climate and 
soil of Egypt and Babylonia preserved materials that would long since have perished in 
more moist climates, materials that make it possible for us to trace the progress of these 
cultures from the barbarism of the remote past to the full ower of civilization. No other 
countries provide so rich a harvest of information about the origin and transmission of 
mathematics. “The Egyptians who lived in the cultivated part of the country,” wrote 
Herodotus in his History, “by their practice of keeping records of the past, have made 
themselves much the best historians of any nation that I have experienced.” 

If China had had Egypt’s climate, there is no question that many records would 
have survived from antiquity, each with its story to tell of the intellectual life of earlier 
generations. But the ancient Orient was a “bamboo civilization,” and among the manifold 
uses of this plant was making books. The small bamboo slips used were prepared by 
splitting the smooth section between two knots into thin strips, which were then dried 
over a re and scraped off. The narrowness of the bamboo strips made it necessary to 
arrange the written characters in vertical lines running from top to bottom, a practice that 
continues to this day. The opened, dried, and scraped strips of bamboo were laid side 
by side, joined, and kept in proper place by four crosswise cords. Naturally enough the 
joining cords often rotted and broke, with the result that the order of the slips was lost and 
could be reestablished only by a careful reading of the text. (Another material used about 
that time for writing was silk, which presumably came into use because bamboo books 
or wooden tablets were too heavy and cumbersome.) The great majority of these ancient 
books was irretrievably lost to the ravages of time and nature. Those few available today 
are known only as brief fragments. 

Another factor making chronological accounts less trustworthy for China than for 
Egypt and Babylonia is that books tended to accumulate in palace or government li- 
braries, where they disappeared in the great interdynastic upheavals. There is a story that 
in 221 B.c., when China was united under the despotic emperor Shih Huang-ti, he tried 
to destroy all books of learning and nearly succeeded. Fortunately, many books were 
preserved in secret hiding places or in the memory of scholars, who feverishly repro- 
duced them in the following dynasty. But such events make the dating of mathematical 
discoveries far from easy. 

Modern science and technology, as all the world knows, grew up in western Europe, 
with the life of Galileo marking the great turning point. Yet between the rst and fteenth 
centuries, the Chinese who experienced nothing comparable to Europe’s Dark Ages, were 
generally much in advance of the West. Not until the scientic revolution of the later 
stages of the Renaissance did Europe rapidly draw ahead. Before China’s isolation and 
inhibition, she transmitted to Europe a veritable abundance of inventions and technolog- 
ical discoveries, which were often received by the West with no clear idea of where they 
originated. No doubt the three greatest discoveries of the Chinese—ones that changed 
Western civilization, and indeed the civilization of the whole world—were gunpowder, 
the magnetic compass, and paper and printing. The subject of paper is of great interest; 
and we know almost to the day when the discovery was_ rst made. A popular account 
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1.3. Problems 


1. Express each of the given numbers in Babylonian 
cuneiform notation. 
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of the time tells that Tshai Lun, the director of imperial workshops in A.D. 105, went to 
the emperor and said, “Bamboo tablets are so heavy and silk so expensive that I sought 
for a way of mixing together the fragments of bark, bamboo, and _ shnets, and I have 
made a very thin material that is suitable for writing.” It took more than a thousand years 
for paper to make its way from China to Europe, rst appearing in Egypt about 900 and 
then in Spain about 1150. 

All the while mathematics was overwhelmingly concerned with practical matters that 
were important to a bureaucratic government: land measurement and surveying, taxation, 
the making of canals and dikes, granary dimensions, and so on. The misconception that 
the Chinese made considerable progress in theoretical mathematics is due to the Jesuit 
missionaries who arrived in Peking in the early 1600s. Finding that one of the most 
important governmental departments was known as the Ofce of Mathematics, they 
assumed that its function was to promote mathematical studies throughout the empire. 
Actually it consisted of minor ofcials trained in preparing the calendar. Throughout 
Chinese history the main importance of mathematics was in making the calendar, for its 
promulgation was considered a right of the emperor, corresponding to the issue of minted 
coins. In an agricultural economy so dependent on arti cial irrigation, it was necessary 
to be forewarned of the beginning and end of the rainy monsoon season, as well as of 
the melting of the snows and the consequent rise of the rivers. The person who could 
give an accurate calendar to the people could thereby claim great importance. 

Because the establishment of the calendar was a jealously guarded prerogative, it is 
not surprising that the emperor was likely to view any independent investigations with 
alarm. “In China,” wrote the Italian Jesuit Matteo Ricci (died 1610), “it is forbidden under 
pain of death to study mathematics, without the Emperor’s authorization.” Regarded as 
a servant of the more important science astronomy, mathematics acquired a practical 
orientation that precluded the consideration of abstract ideas. Little mathematics was 
undertaken for its own sake in China. 


3. Express the fractions 4,4, 4, 4,4. and - in 


sexagesimal notation. 


4. Convert these numbers from sexagesimal notation to 


our system. 
i ee es (a) 1,23,45. (c) 0;12,3,45. 
(b) 10,000. (e) 12,345, (b) 12:3.45 (@) 1.23:45 
(c) 100,000. (f) 123,456. —— canal 
2. Translate each of these into a number in our system. 5. Multiply the number 12,3;45,6 by 60. Describe a 
(a) simple rule for multiplying any sexagesimal number 
a 


ATTA TIT ATT 


< 


TIT4 4117. by 60; by 602. 


T < 


TT T 6. Chinese bamboo or counting-rod numerals, which 
may go back to 1000 B.C., originated from bamboo 
sticks laid out on at boards. The system is 
essentially positional, based on a 10-scale, with 
blanks where we should put zeros. There are two sets 
of symbols for the digits 1, 2, 3,..., 9, which are 


o t T< < TY TT <<< T T < < < 7, used in alternate positions. The rst set is used for 


TY 


yf < < units, hundreds, ten thousands: 
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Units, 
Hundreds, 
Ten 
thousands 
Tens, 
Thousands, = 
Hundred 

thousands 


Hi 
HT 
[I> 
[IF 
Ik 


| 
ill 


Thus, for example, the number 36,278 would be 


Hl Ll WW 


The circular symbol © for zero was introduced 
relatively late, rst appearing in print in the 1200s. 
Write the Chinese counting-rod numerals 
corresponding to 


(a) 1492. (d) 57,942. 
(b) 1999, (ec) 123,456. 
(c) 1606. (f) 3,040,279. 


Convert these into our numerals. 


Y 
© LT=IIt 


(©) . 
—W=Nl=! 
(@) 


miLu=zoLm 


Multiply I ATL lI by 10 and express the result 
in Chinese rod numerals. Describe a simple rule for 


multiplying any Chinese rod numerals by 10; by 10°. 


Perform the indicated operations. 


° STL +TLm™ 


INH 


(b) 

=!1OT - T= 
i= TT 1h 

The fth century Chinese (brush form) numeral 
system shares some of the best features of both 


(c) 


TUE TP TT TT 
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Egyptian hieroglyphic and Greek alphabetic 
numerals. It is an example of a vertically written 
multiplicative grouping system based on powers of 
10. The digits 1, 2, 3,..., 9 are ciphered in this 
system, thus avoiding the repetition of symbols, and 
special characters exist for 100, 1000, 10,000, and 
100,000. 


Numerals are written from the top downward, 
so that 


B (5 x 10,000) 
= 

A. (2 x 1000) 
f= j (100) 

+ (7 x 10) 
AN GC) 
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11. 


represents 


5- 10,000 + 2- 1000+ 100+ 7-10+4 
= 52,174. 


Notice that if only one of a certain power of 10 is 
intended, then the multiplier 1 is omitted. 

Express each of the given numbers in traditional 
Chinese numerals. 


(a) 236. (d) 1066. 
(b) 1492. (ec) 57,942. 
(c) 1999. (f) 123,456. 


Translate each of these numerals from the Chinese 
system to our numerals. 


(a) J (b) AS O=—=- @ & 


a+ IGN 


12. Multiply the given number by 10, expressing the 


result in Chinese numerals. 


[\ + ))-4 


13. The Mayan Indians of Central America developed a 


positional number system with 20 as the primary 
base, along with an additive grouping technique 
(based on 5) for the numbers in the 20-block. The 
symbols for 1 to 19 were represented by 
combinations of dots and horizontal bars, each dot 
standing for 1 and each bar for 5 (P26 & 7). 

The Mayan year was divided into 18 months 
of 20 days each, with 5 extra holidays added to Il 
the difference between this and the solar year. 
Because the system the Mayan priests developed was 
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designed mainly for calendar computations, they used 
18 - 20 = 360 instead of 207 for the third position; 
successive positions after the third had a 
multiplicative value 20, so that the place values 
turned out to be 


1, 20, 360, 7200, 144,000, .... 


Numerals were written vertically with the larger units 
above, and missing positions were indicated by a 


sign @. 
Thus, 

ee (2 x 144,000) 
a> (0 x 7200) 

ioe 

= (16 x 360) 

ee 
— (7x20) 
@e 
= (1x1) 


represents 


2- 144,000 + 0 - 7200 + 16- 360 
+ 7-204 11 = 290,311. 


Write the Mayan Priest numerals corresponding to 


(a) 1492. (d) 57,942. 
(b) 1999. (e) 123,456. 
(c) 1066. (f) 3,040,279. 


Convert these numerals from the Mayan Priest 
system into ours. 


() see ) a2 © + 
ae ~ = 
ae = <> 

a oe 


15. Perform the indicated operations shown here. 


(a) ee 


@ | 


iN 


@ | II 


Bibliography 


6) = 
QP ac 
(c) 


e 

16. Multiply the given number by <&¥® (20), expressing 
the result in the Mayan system. Describe a simple 
rule for multiplying any Mayan number by 20; by 
20°. 


17. How many different symbols are required to write 
the number 999,999 in (a) Egyptian hieroglyphics; 
(b) Babylonian cuneiform; (c) Ionian Greek numerals; 
(d) Roman numerals; (e) Chinese rod numerals; 
(f) traditional Chinese numerals; and (g) Mayan 
numerals? 
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2.1 


CHAPTER ) 


Mathematics in Early Civilizations 


In most sciences one generation tears down what another has built and what one has established 
another undoes. In Mathematics alone each generation builds a new story to an old structure. 
HERMANN HANKEL 


With the possible exception of astronomy, 


The Rhind Papyrus mathematics is the oldest and most continuously 


pursued of the exact sciences. Its origins lie 
Egyptian Mathematical Papyri shrouded in the mists of antiquity. We are often 

told that in mathematics all roads lead back to 
Greece. But the Greeks themselves had other ideas about where mathematics began. A 
favored one is represented by Aristotle, who in his Metaphysics wrote: “The mathemat- 
ical sciences originated in the neighborhood of Egypt, because there the priestly class 
was allowed leisure.” This is partly true, for the most spectacular advances in mathe- 
matics have occurred contemporaneously with the existence of a leisure class devoted to 
the pursuit of knowledge. A more prosaic view is that mathematics arose from practical 
needs. The Egyptians required ordinary arithmetic in the daily transactions of commerce 
and state government to x taxes, to calculate the interest on loans, to compute wages, 
and to construct a workable calendar. Simple geometric rules were applied to determine 
boundaries of elds and the contents of granaries. As Herodotus called Egypt the gift of 
the Nile, we could call geometry a second gift. For with the annual ooding of the Nile 
Valley, it became necessary for purposes of taxation to determine how much land had 
been gained or lost. This was the view of the Greek commentator Proclus (A.D. 410-485), 
whose Commentary on the First Book of Euclid’s Elements is our invaluable source of 
information on pre-Euclidean geometry: 


According to most accounts geometry was rst discovered among the Egyptians and origi- 
nated in the measuring of their lands. This was necessary for them because the Nile over ows 
and obliterates the boundaries between their properties. 


Although the initial emphasis was on utilitarian mathematics, the subject began eventu- 
ally to be studied for its own sake. Algebra evolved ultimately from the techniques of 
calculation, and theoretical geometry began with land measurement. 

Most historians date the beginning of the recovery of the ancient past in Egypt from 
Napoleon Bonaparte’s ill-fated invasion of 1798. In April of that year, Napoleon set sail 
from Toulon with an army of 38,000 soldiers crammed into 328 ships. He was intent 
on seizing Egypt and thereby threatening the land routes to the rich British possessions 


33 


34 


Chapter 2 Mathematics in Early Civilizations 


in India. Although England’s Admiral Nelson destroyed much of the French eet a 
month after the army debarked near Alexandria, the campaign dragged on another 12 
months before Napoleon abandoned the cause and hurried back to France. Yet what had 
been a French military disaster was a scientic triumph. Napoleon had carried with his 
expeditionary force a commission on the sciences and arts, a carefully chosen body of 
167 scholars—including the mathematicians Gaspard Monge and Jean-Baptiste Fourier— 
charged with making a comprehensive inquiry into every aspect of the life of Egypt 
in ancient and modern times. The grand plan had been to enrich the world’s store of 
knowledge while softening the impact of France’s military adventures by calling attention 
to the superiority of her culture. 

The savants of the commission were captured by the British but generously allowed 
to return to France with their notes and drawings. In due course, they produced a truly 
monumental work with the title Déscription de l’Egypte. This work ran to 9 folio volumes 
of text and 12 volumes of plates, published over 25 years. The text itself was divided into 
four parts concerned respectively with ancient Egyptian civilization, monuments, modern 
Egypt, and natural history. Never before or since has an account of a foreign land been 
made so completely, so accurately, so rapidly, and under such dif cult conditions. 

The Description de l’Egypte, with its sumptuous and magni cently illustrated folios, 
thrust the riches of ancient Egypt on a society accustomed to the antiquities of Greece 
and Rome. The sudden revelation of a ourishing civilization, older than any known 
so far, aroused immense interest in European cultural and scholarly circles. What made 
the fascination even greater was that the historical records of this early society were 
in a script that no one had been able to translate into a modern language. The same 
military campaign of Napoleon provided the literary clue to the Egyptian past, for one 
of his engineers uncovered the Rosetta Stone and realized its possible importance for 
deciphering hieroglyphics. 

Most of our knowledge of early mathematics in Egypt comes from two sizable papyri, 
each named after its former owner—the Rhind Papyrus and the Golenischev. The latter 
is sometimes called the Moscow Papyrus, since it reposes in the Museum of Fine Arts in 
Moscow. The Rhind Papyrus was purchased in Luxor, Egypt, in 1858 by the Scotsman 
A. Henry Rhind and was subsequently willed to the British Museum. When the health 
of this young lawyer broke down, he visited the milder climate of Egypt and became an 
archaeologist, specializing in the excavation of Theban tombs. It was in Thebes, in the 
ruins of a small building near the Ramesseum, that the papyrus was said to have been 
found. 

The Rhind Papyrus was written in hieratic script (a cursive form of hieroglyphics 
better adapted to the use of pen and ink) about 1650 B.c. by a scribe named Ahmes, who 
assured us that it was the likeness of an earlier work dating to the Twelfth Dynasty, 1849— 
1801 B.c. Although the papyrus was originally a single scroll nearly 18 feet long and 13 
inches high, it came to the British Museum in two pieces, with a central portion missing. 
Perhaps the papyrus had been broken apart while being unrolled by someone who lacked 
the skill for handling such delicate documents, or perhaps there were two nders and each 
claimed a portion. In any case, it appeared that a key section of the papyrus was forever 
lost to us, until one of those chance events that sometimes occur in archeology took place. 
About four years after Rhind had made his famous purchase, an American Egyptologist, 
Edwin Smith, was sold what he thought was a medical papyrus. This papyrus proved 
to be a deception, for it was made by pasting fragments of other papyri on a dummy 
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scroll. At Smith’s death (in 1906), his collection of Egyptian antiquaries was presented 
to the New York Historical Society, and in 1922, the pieces in the fraudulent scroll 
were identi ed as belonging to the Rhind Papyrus. The decipherment of the papyrus was 
completed when the missing fragments were brought to the British Museum and put in 
their appropriate places. Rhind also purchased a short leather manuscript, the Egyptian 
Mathematical Leather Scroll, at the same time as his papyrus; but owing to its very brittle 
condition, it remained unexamined for more than 60 years. 


A Key to Deciphering: The Rosetta Stone 


It was possible to begin the translation of the Rhind Papyrus almost immediately 
because of the knowledge gained from the Rosetta Stone. Finding this slab of polished 
black basalt was the most signi cant event of Napoleon’s expedition. It was uncovered by 
of cers of Napoleon’s army near the Rosetta branch of the Nile in 1799, when they were 
digging the foundations of a fort. The Rosetta Stone is made up of three panels, each 
inscribed in a different type of writing: Greek down the bottom third, demotic script 
of Egyptian (a form developed from hieratic) in the middle, and ancient hieroglyphic 
in the broken upper third. The way to read Greek had never been lost; the way to 
read hieroglyphics and demotic had never been found. It was inferred from the Greek 
inscription that the other two panels carried the same message, so that here was a trilingual 
text from which the hieroglyphic alphabet could be deciphered. 

The importance of the Rosetta Stone was realized at once by the French, especially 
by Napoleon, who ordered ink rubbings of it taken and distributed among the scholars 
of Europe. Public interest was so intense that when Napoleon was forced to relinquish 
Egypt in 1801, one of the articles of the treaty of capitulation required the surrender of 
the stone to the British. Like all the rest of the captured artifacts, the Rosetta Stone came 
to rest in the British Museum, where four plaster casts were made for the universities 
of Oxford, Cambridge, Edinburgh, and Dublin, and its decipherment by comparative 
analysis began. The problem turned out to be more dif cult than imagined, requiring 23 
years and the intensive study of many scholars for its solution. 

The nal chapter of the mystery of the Rosetta Stone, like the rst, was written by a 
Frenchman, Jean Francois Champollion (1790-1832). The greatest of all names associated 
with the study of Egypt, Champollion had had from his childhood a premonition of the 
part he would play in the revival of ancient Egyptian culture. Story has it that at the age 
of 11, he met the mathematician Jean-Baptiste Fourier, who showed him some papyri 
and stone tablets bearing hieroglyphics. Although assured that no one could read them, 
the boy made the determined reply, “I will do it when I am older.” From then on, almost 
everything Champollion did was related to Egyptology; at the age of 13 he was reading 
three Eastern languages, and when he was 17, he was appointed to the faculty of the 
University of Grenoble. By 1822 he had compiled a hieroglyphic vocabulary and given 
a complete reading of the upper panel of the Rosetta Stone. 

Through many years hieroglyphics had evolved from a system of pictures of com- 
plete words to one that included both alphabetic signs and phonetic symbols. In the hi- 
eroglyphic inscription of the Rosetta Stone, oval frames called “cartouches” (the French 
word for “cartridge”) were drawn around certain characters. Because these were the 
only signs showing special emphasis, Champollion reasoned that symbols enclosed by 
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THE ROSETTA STONE. 


The Rosetta Stone, bearing the same inscription in hieroglyphics, demotic script, 
and Greek. (Copyright British Museum.) 


the cartouches represented the name of the ruler Ptolemy, mentioned in the Greek text. 
Champollion also secured a copy of inscriptions on an obelisk, and its base pedestal, 
from Philae. The base had a Greek dedication honoring Ptolemy and his wife Cleopatra 
(not the famous but ill-fated Cleopatra). On the obelisk itself, which was carved in hi- 
eroglyphics, are two cartouches close together, so it seemed probable that these outlined 
the Egyptian equivalents of their proper names. Moreover, one of them contained the 
same hieroglyphic characters that Iled the cartouches found on the Rosetta Stone. This 
cross-check was enough to allow Champollion to make a preliminary decipherment. From 
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the royal names he established a correlation between individual hieroglyphics and Greek 
letters. In that instant in which hieroglyphics dropped its shroud of insoluble mystery, 
Champollion, worn by the years of ceaseless effort, was rumored to cry, “I’ve got it!” 
and fell into a dead faint. 

As a_ tting climax to a life’s study, Champollion wrote his Grammaire Egyptienne 
en Encriture Hieroglyphique, published posthumously in 1843. In it, he formulated a 
system of grammar and general decipherment that is the foundation on which all later 
Egyptologists have worked. The Rosetta Stone had provided the key to understanding 
one of the great civilizations of the past. 


The Rhind Papyrus starts with a bold premise. 


pe) Egyptian Arithmetic Its content has to do with “a thorough study of 
eas all things, insight into all that exists, knowledge 
Early Egyptian Multiplication of all obscure secrets.” It soon becomes apparent 


that we are dealing with a practical handbook of 
mathematical exercises, and the only “secrets” are how to multiply and divide. Nonethe- 
less, the 85 problems contained therein give us a pretty clear idea of the character of 
Egyptian mathematics. The Egyptian arithmetic was essentially “additive,” meaning that 
its tendency was to reduce multiplication and division to repeated additions. Multipli- 
cation of two numbers was accomplished by successively doubling one of the numbers 
and then adding the appropriate duplications to form the product. To nd the product of 
19 and 71, for instance, assume the multiplicand to be 71, doubling thus: 


1 71 
2 142 
4 284 
8 568 
16 1136 


Here we stop doubling, for a further step would give a multiplier of 71 that is larger than 
19. Because 19 = 1+2-+ 16, let us put checks alongside these multipliers to indicate 
that they should be added. The problem 19 times 71 would then look like this: 


wl 71 
w 2 142 
4+ 284 
8 568 
- 16 1136 


totals 19 1349 


Adding those numbers in the right-hand column opposite the checks, the Egyptian math- 
ematician would get the required answer, 1349; that is, 


1349 = 71+ 1424+ 1136 =(1+2+4 16)71 = 19-71. 


Had the number 19 been chosen as the multiplicand and 71 as the multiplier, the 
work would have been arranged as follows: 
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w 1 19 
ww 2 38 
w 4 76 
8 152 
16 304 
32 608 
- 64 1216 


totals 71 1349 


Because 71 = 1+2-+ 4+ 64, one has merely to add these multiples of 19 to get, again, 
1349. 

The method of multiplying by doubling and summing is workable because every 
integer (positive) can be expressed as a sum of distinct powers of 2; that is, as a sum 
of terms from the sequence, 1, 2, 4, 8, 16, 32,.... It is not likely that the ancient Egyp- 
tians actually proved this fact, but their con dence therein was probably established by 
numerous examples. The scheme of doubling and halving is sometimes called Russian 
multiplication because of its use among the Russian peasants. The obvious advantage is 
that it makes memorizing tables unnecessary. 

Egyptian division might be described as doing multiplication in reverse—where the 
divisor is repeatedly doubled to give the dividend. To divide 91 by 7, for example, a 
number x is sought such that 7x = 91. This is found by redoubling 7 until a total of 91 
is reached; the procedure is shown herewith. 


1 

2 

4 28 4 
8 = 30 
totals 13 91 


Finding that 7 + 28 + 56 = 91, one adds the powers of 2 corresponding to the checked 
numbers, namely, | + 4+ 8 = 13, which gives the desired quotient. The Egyptian divi- 
sion procedure has the pedagogical advantage of not appearing to be a new operation. 

Division was not always as simple as in the example just given, and fractions would 
often have to be introduced. To divide, say, 35 by 8, the scribe would begin by doubling 
the divisor, 8, to the point at which the next duplication would exceed the dividend, 
35. Then he would start halving the divisor in order to complete the remainder. The 
calculations might appear thus: 


cole ol BIE NIE BPR 
& 


totals 4+ 5+ 
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39 


Doubling 16 gives 32, so that what is missing is 35 — 32 = 3. One rst takes half of 8 to 

get 4, then half of 4 to get 2, and nally half of this to arrive at 1; when the fourth and 

the eighth are added, the needed 3 is obtained. Thus, the required quotient is 4 + ; + t. 
In another example, division of 16 by 3 might be effected as follows: 


1 3 
2 6 
4 12 
2 

5 2 
1 

3 ly 


totals 5+ i 16 


The sum of the entries in the left-hand column corresponding to the checks gives the 
quotient 5 + i. It is extraordinary that to get one-third of a number, the Egyptians rst 
found two-thirds of the number and then took one-half of the result. This is illustrated 
in more than a dozen problems of the Rhind Papyrus. 

When the Egyptian mathematician needed to compute with fractions, he was con- 
fronted with many dif culties arising from his refusal to conceive of a fraction like z, 
His computational practice allowed him only to admit the so-called unit fractions; that 
is, fractions of the form 1/n, where n is a natural number. The Egyptians indicated a 
unit fraction by placing an elongated oval over the hieroglyphic for the integer that was 

1 


és : = ie) 7 ‘ 
to appear in the denominator, so that ; was written as # or Too a8 9. With the exception 


of , for which there was a special symbol & all other fractions had to be decomposed 
into sums of unit fractions, each having a different denominator. Thus, for instance, g 
would be represented as 


6 fet yl 1 
T= 27tat wat we 
Although it is true that g can be written in the form 
6_lyiptpiygist 
oT pe ee Tg 


the Egyptians would have thought it both absurd and contradictory to allow such repre- 
sentations. In their eyes there was one and one part only that could be the seventh of 
anything. The ancient scribe would probably have found the unit fraction equivalent of 
g by the following conventional division of 6 by 7: 


1 7 
L eee vw 
2 2 
1 1 1 
a4 Itgtage 
1 
t 
1 1 
a 2 
1 1 
3 | Og a 
1 1 1 1 
totals stqgtyat eR 6 
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The Unit Fraction Table 


To facilitate such decomposition into unit fractions, many reference tables must have 
existed, the simplest of which were no doubt committed to memory. At the beginning 
of the Rhind Papyrus, there is such a table giving the breakdown for fractions with 
numerator 2 and denominator an odd number between 5 and 101. This table, which 
occupies about one-third of the whole of the 18-foot roll, is the most extensive of the 
arithmetic tables to be found among the ancient Egyptian papyri that have come down to 
us. The scribe rst stated what decomposition of 2/n he had selected; then, by ordinary 
multiplication, he proved that his choice of values was correct. That is, he multiplied the 
selected expression by the odd integer n to produce 2. Nowhere is there any inkling of 
the technique used to arrive at the decomposition. 

Fractions 2/n whose denominators are divisible by 3 all follow the general rule 


2 1 i 1 
3k 2k 6k’ 
Typical of these entries is (the case k = 5), which is given as 


2 


1 il 
is = 16 + 30° 


If we ignore the representations for fractions of the form 2/(3k), then the remainder of 
the 2/n table reads as shown herewith. 


2-141 214) 14 1 
sa GE 33 = 30 + 31g + 795 
2_1,1 214, 1 

7= aT 3g 3 = 30 + 30 
21,1 oe ala th a 
i ae 3 36 + oe t+ BT 
2 Neel el 2ot14y, 14, 14 1 
p=statig 6. = 20 + ma + ae t oO 
2-141 4541 eh te 
D=ptatEe 6 = 39 1 195 

2. 1 1 1 2) =. We Te 
pr-ptet+iga a -atwt sz 

2 i fe, al ee Mee alt 

mB —TDt TF apts t 7 

2_ 1 1 2s al sis, 2M i ee ie a 
site 3 = + m9 + m0 + 365 
2 tg To Lg ie aM es ces 

99 =a ts tia t a 7 = at 308 

Be MS Hee Wg fe all ie Gescecl Sey es He 
31 = 30 + fea + 55 79 = 0 + a7 t 36 + 700 
ee oe pee ee eee! a ee ee ae 
3 =30t & 3 — ot 32+ as + me 
De sl fe fe eee Cen ee 

7 =a t+ Ti Tt 206 a5 = 31 t 355 

De pole fe oe fe De all) Hes ed 3 AS cll Heals 
at = 34 + 346 + 338 39 = 6 + 350 + 33a + HO 
De ys ifs, SM hs el fo ds 
gp=ote tpt 30 1 = 70 + BO 

2D eds, fe pepe! oe are pets Lee 
H=3 tint To sot te 
Dal, tN, een Gee Bere! Gees eee 

a = 28 + 106 7 = 36+ oot 7. 
agers! es en coe age eed ae ts Oe ea ee ge! 
auto Tor = Tor + 202 + 303 + Ge 


Ever since the rst translation of the papyrus appeared, mathematicians have tried 
to explain what the scribe’s method may have been in preparing this table. Of the many 
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possible reductions to unit fractions, why is 


chosen for n = 19 instead of, say, 


No de nite rule has been discovered that will give all the results of the table. 
The very last entry in the table, which is 2 divided by 101, is presented as 


De Me ae Ne ae ope lt 
Tor = Tor + 302 + 303 + oe: 


This is the only possible decomposition of _ into no more than four different unit 
fractions with all the denominators less than 1000; and is a particular case of the general 
formula 


1 1 1 1 


n in Sh Gn 


By the indicated formula, it is possible to produce a whole new 2/n table consisting 
entirely of four-term expressions: 


patsttted 
pala dheged 
padadeaes 
pads geged 


Although the scribe was presumably aware of this, nowhere did he accept these values 
for this table (except in the last case, in): because there were so many other “simpler” 
representations available. To the modern mind it even seems that the scribe followed 
certain principles in assembling his lists. We note that 


Small denominators were preferred, with none greater than 1000. 
The fewer the unit fractions, the better; and there were never more than four. 


3. Denominators that were even were more desirable than odd ones, especially for the 
initial term. 


The smaller denominators came _ rst, and no two were the same. 
A small rst denominator might be increased if the size of the others was thereby 


2 1 1 I 2 I I 1 
reduced (for example, 3; = 59 + yg + ges was preferred to = = 7g + agg + 7). 


Why—or even whether—these precepts were chosen, we cannot determine. 


Example. As an illustration of multiplying with fractions, let us nd the product of 
2+ i and 1 + ; + i. Notice that doubling 1 + 5 + 4 gives 3 + z, which the Egyptian 
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mathematicians would have written 3 + ‘ + x. The work may be arranged as follows: 


1 +444 
wx 2 34454+H 
Lo odeitd 
rt ftits 


totals 2+ 4 3444444 


The mathematicians knew that twice the unit fraction 1/(27) is the unit fraction 1/n, so 
the answer would appear as 3+ 4+2+ 7. 


Example. For amore dif cult division involving fractions, let us look at a calculation 
that occurs in Problem 33 of the Rhind Papyrus. One is required here to divide 37 
by 1+ 2 + 5 + 5. In the standard form for an Egyptian division, the computation 
begins: 


1 1+3+5+4 
2 44+44+44+% 
4 8424544 
8 18+4+3 


16 36444444 


with the value for 2 recorded as ; + x: Now the sum 36 + 2 + ; + x is close to 37. 
By how much are we short? Or as the scribe would say, “What completes 3 + ++ 3 
up to 1?” In modern notation, it is necessary to get a fraction x for which 


Bip D ofe fh _— 1: 
s+ gta tx I; 
or with the problem stated another way, a numerator y is sought that will satisfy 
2 1 1 yo 
stata tee 1, 


where the denominator 84 is simply the least common multiple of the denominators, 3, 
4, and 28. Multiplying both sides of this equation by 84 gives 56+ 21+3+ y= 84, 


and so y = 4. Therefore, the remainder that must be added to 2 + ‘ + a to make 


1 is a. or x. The next step is to determine by what amount we should multiply 


1+ Z + 5 + 4 to get the required x: This means solving for z in the equation 
2l+$+54+4)= 39. 


Multiplying through by 42 leads to 97z =2 or z= 2 


97° 
found to be equal to x + = + =. Thus, the whole calculation would proceed as 


which the Egyptian scribe 


Egyptian Arithmetic 


43 


follows: 
1 14+44+444 
2 44+44+44+% 
4 8+24544 
8 18+34+4 
16 36+2+44+5 
1 1 1 1 
36+ ot 776 OT a 
totals 16+s¢+ a5 +7 37 


The result of dividing 37 by 1+$+5+4 is l6+2+ a54+ 7%. 


Representing Rational Numbers 


There are several modern ways of expanding a fraction with numerator other than 
2 as a sum of unit fractions. Suppose that z is required to be expanded. Because 


9=1+44- 2, one procedure might be to convert _ to 
9 _4 2 
ig = 13 + 4(yy)- 


The fraction _ could be reduced by means of the 2/n table and the results collected to 
give a sum of unit fractions without repetitions: 


S=4e4b toed) 
(cer Mera 1 
=ptitiyt 3% 
2,151 
=p TIT 36 
=Gtot@tit+xZ- 
The nal answer would then be 
GP ty A 1 1 
Ts 2b 34 36 + 35 + Tor 
What makes this example work is that the denominators 8, 52, and 104 are all divisible 
by 4. We might not always be so fortunate. 

Although we shall not do so, it can be proved that every positive rational number 
is expressible as a sum of a nite number of distinct unit fractions. Two systematic 
procedures will accomplish this decomposition; for the lack of better names let us call 
these the splitting method and Fibonacci’s method. The splitting method is based on the 
so-called splitting identity 

1 1 1 


A ae Ae 


which allows us to replace one unit fraction by a sum of two others. For instance, to 
handle 4 we rst write 
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and then split one of the fractions > into 1/20 + 1/19 - 20, so that 


aly 

19 

2_41 I I 
15 = 19 + 30 + 380° 
Again, in the case of 3, this method begins with 


3_1lil1i1 
spre Ss 


and splits each of the last two unit fractions into 1/6 + 1/5 - 6; thus, 
3 1 1 1 1 
sHsztletatGt+%)- 


There are several avenues open to us at this point. Ignoring the obvious simpli cations 
2. 


2) Po i it 4 tj Feo Do ot 
t=3 and 30 = 15 let us instead split z and 5 into the sums 5+ +34 737; 
respectively, to arrive at the decomposition 


Se Ma sp ee ts all the A ae es 
s=stetytatatat wa: 


In general, the method is as follows. Starting with a fraction m/n, rst write 
m 1 1 1 
es el Se eee 
non n n 
m—1 summands 
Now use the splitting identity to replace m — | instances of the unit fraction 1/n by 


1 1 
71 AED’ 


thereby getting 


m 1 1 1 1 1 
walt otaat l(a tm) 

1 1 
to+( +a): 


m — 2 summands 
Continue in this manner. At the next stage, the splitting identity, as applied to 


1 | 
—— and ———, 
n+1 n(n + 1) 


m1 1 nm 1 7 1 x 1 5. 1 
non ntl ntntl) nt+2) (n+1)m42)) nn+1)41 
+ + 
n(n + 1)[n(n + 1) + 1] : 
Although the number of unit fractions (and hence the likelihood of repetition) is increasing 
at each stage, it can be shown that this process eventually terminates. 

The second technique we want to consider is credited to the thirteenth-century Italian 
mathematician Leonardo of Pisa, better known by his patronymic, Fibonacci. In 1202, 
Fibonacci published an algorithm for expressing any rational number between 0 and 1 
as a sum of distinct unit fractions; this was rediscovered and more deeply investigated 
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by J. J. Sylvester in 1880. The idea is this. Suppose that the fraction a/b is given, where 
0 <a/b <1. First nd the integer ; satisfying 


1 2 a 1 

n{ a) Ay = 1 
or what amounts to the same thing, determine nm; in such a way that n; — 1 < b/a <n. 
These inequalities imply that nja — a < b < nya, whence nya — b <a. Subtract 1/n; 
from a/b and express the difference as a fraction, calling it a)/b,: 


a 1 nia—-b a, 
bon bn Dy 
This enables us to write a/b as 
a 1 qQ\ 
bm Br 


The important point is that aj = na — b <a. In other words, the numerator a, of this 
new fraction is smaller than the numerator a of the original fraction. 

If a; = 1, there is nothing more to do. Otherwise, repeat the process with a, /b,; now 
playing the role of a/b to get 


-=—+—4+-, where a < aq). 
b ny nz bo 
At each successive stage, the numerator of the remainder fraction decreases. We must 
eventually come to a fraction a;/b, in which a, = 1; for the strictly decreasing sequence 
1 < ay < ay_| <+++ <a, <a cannot continue inde nitely. Thus, the desired represen- 
tation of a/b is reached, with 
a 1 m 1 a ade 1 a 1 
b ny no ne dy 
a sum of unit fractions. 
Let us examine several examples illustrating Fibonacci’s method. 


Example. Take a/b = 4. To nd 7, note that 9 < ~ < 10, and so “7 < ~ < $3 
hence, n; = 10. Subtraction gives 
2 I 20-19 1 
19 10 19-10 190 
We may therefore represent - as - = + we 
Example. For a more penetrating illustration, we turn to the fraction a/b = - once 


9 
BB 
This means that the rst unit fraction in the decomposition of = is 5. Now 
9 I= D5 
13.2 13-2 26 


again. Dividing 9 into 13, one gets 1 < — < 2, leading to 5 < — < 1; hence, n; = 2. 


which implies that 


1 5 
327+ 36: 
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As expected, the numerator of the remainder fraction is less than the numerator of the 
given fraction; that is, 5 < 9. Now repeat the process with the fraction =. Because 


5< 2% <6 weget! <2 <! and n2 = 6. Carrying out the arithmetic gives 
get Irying g 


5 26 5 
5 1 30-26 4 1 


266 26-6 156 39’ 


in consequence of which 


Sui1lyd 
2 6 T 35° 
Putting the pieces together, we get our expansion for a: 


Oe oD pe cl ah od 
BH~7tEt 3- 


The Rhind Papyrus contains several “com- 


Four Problems from the Rhind Papyrus pletion” problems. These usually start with 


a sum of unit fractions and seek further unit 
The Method of False Position fractions to be added, to obtain the value 1. 

Problem 22, for instance, asks to complete 
2 + + so as to produce the sum 1. In modern notation, the scribe performs the calcula- 
tions by rst selecting a convenient number N and unit fractions 1/n,,..., 1/n, in order 
to satisfy the equation 


It would follow from this that the expanded sum is equal to 1. Taking N to be 30— 
convenient, as it is a common multiple of the given denominators—the scribe observed 
that 


2.4 
~ + —)30=204+1=21, 
(F+5) - 


which is 9 short of the desired 30. But 


lL. 
—~+—)30=64+3=9. 
(5+7) * 


Adding the two equations gives 


ef ea 30 = 30 
3 30 5 10 ~ 


and so the desired completion is 


Much space is taken up in the Rhind Papyrus by practical problems concerning the 
equitable division of loaves among a certain number of men or determining the amount 
of grain needed for making beer. These problems were simple and did not go beyond 
a linear equation in one unknown. Problem 24, for example, reads: “A quantity and its 
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added become 19. What is the quantity?” Today with our algebraic symbolism, we 
hould let x stand for the quantity sought and the equation to be solved would be 


Drape 


x 
a or 7 = 19. 
Ahmes reasoned that although his notation did not admit the fraction £, “As many times 
as 8 must be multiplied to give 19, just as many times must 7 be multiplied to give 
the correct number.” The scribe was using the oldest and most universal procedure for 
treating linear equations, the method of false position, or false assumption. Brie y, this 
method is to assume any convenient value for the desired quantity, and by performing 
the operations of the problem at hand, to calculate a number that can then be compared 
with a given number. The true answer has the same relation to the assumed answer that 
the given number has to the number thus calculated. 

For instance, in solving the equation x + x/7 = 19, one assumes falsely that x = 7 
(the choice is convenient because x/7 is easy to calculate). The left-hand side of the 
equation would then become 7 + Z = 8, instead of the required answer 19. Because 
8 must be multiplied by 2 =2+ ; +3 to give the desired 19, the correct value 
of x is obtained by multiplying the false assumption, namely, 7, by 2+ i + z The 
result is 


x=(2+54+3)7=164+54+3. 
Actually, we could pose any convenient value for the unknown quantity, say x =a. 


If a+a/7 =b5 and bc = 19, then x = ac satis es the equation x + x/7 = 19; for it is 
easily seen that 


ie = ( +=) = be = 19 
ac yaaa 7) o= c= 7 


We have seen that the Egyptians anticipated, at least in an elementary form, a favorite 
method of the Middle Ages, the false position. Once the method was learned from the 
Arabs, it became a prominent feature of European mathematics texts from the Liber Abaci 
(1202) of Fibonacci to the arithmetics of the sixteenth century. As algebraic symbolism 
developed, the rule disappeared from the more advanced works. Here is an example 
from Liber Abaci: a certain man buys eggs at the rate of 7 for 1 denarius and sells them 
at a rate of 5 for 1 denarius, and thus makes a prot of 19 denarii. The question is: 
How much money did he invest? Algebraically, this problem would be expressed by the 
equation 

ts —-x=19. 

5 
The procedure of false position consists here in assuming 5 for the unknown; then 
z -5—5=2. This 2, in the expressive language of Fibonacci, “would be like 19” (it is 


related to 19 as 5 is to the sought number). Because 22) = 19, the correct answer is 
x = 5(2) = 47}. 
Notice that the number posed by Fibonacci for the unknown was not arbitrarily chosen— 


when the coef cient of an unknown is a fraction, the number assumed for the unknown 
is the denominator of the fraction. 
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Thus far we have considered the rule of false position in which a single guess 
was made; but there was a variant that necessitated making two trials and noting the 
error due to each. This cumbersome rule of double false position, as it was sometimes 
called, can be explained as follows. To solve the equation ax + b = 0, let g; and g> be 
two guesses about the value of x, and let f; and /5 be the corresponding failures, that 
is, the values ag; + 5 and ag, + b, which would equal zero if the guesses were right. 
Then 


(1) agitb=f; and (2) ag+b= fr. 
On subtracting, it is clear that 
(3) a(gi — 82) = fi— fr. 


Multiplying equation (1) by go and equation (2) by gi gives 


42122 + bgr = fig? and = agog, + bg, = fog. 


When these last two equations are subtracted, the result is 


(4) b(g2 — 81) = fig2 — fri. 

To nish the argument, divide equation (4) by equation (3) to get 
bd figs posi 
a fi-h 

But because x = —b/a, the value of x is found to be 


ya LBT Bi 
f= ~ 


In summary, we have placed two false values for x in the expression ax +b, and 
from these trials we have been able to get the correct solution to the equation 
ax+b=0. 

To make things more specic, let us look at an actual example, for instance, the 
equation 


x+ = = 19, or equivalently, x+ = —19=0. 


We take two guesses about the value of x, say g} = 7 and gy = 14. Then 
7+2-19=-11=f, and 1444 -19=-3= f. 


It follows that the true value of x is 


fis fg, (M437 133 
i ia: (=I) =(=2) 8 2 8 
Awkward as it seems, there is a certain element of simplicity in this primitive rule, 
and no wonder it was used even in the late 1880s. In his Grounde of Artes, Robert Recorde 
(1510-1558) wrote that he astonished his friends by proposing dif cult questions and 
then, with the rule of falsehood, nding the true result from the chance answers of “such 
children or idiots as happened to be in the place.” 
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A Curious Problem 


Getting back to the Rhind Papyrus, we can consider Problem 28 the earliest example 
of a “think of a number” problem. Let us state this problem and Ahmes’s solution in 
modern terms, adding a few clarifying details. 


Example. Think of a number, and add z of this number to itself. From this sum 
subtract + its value and say what your answer is. Suppose the answer was 10. Then 
take away a of this 10, giving 9. Then this was the number rst thought of. 


Proof. If the original number was 9, then 3 is 6, which added makes 15. Then i of 
15 is 5, which on subtraction leaves 10. That is how you do it. 


Here the scribe was really illustrating the algebraic identity 


(8) $048) llor 3) $9) 


by a simple example, in this case using the number n = 9. Having disclosed his “obscure 
secret,” he added a traditional concluding phrase, “And that is how you do it.” 

Problem 79 is extremely concise and contains a curious set of data—which seems 
to indicate an acquaintance with the sum of a geometric series: 


Houses 7] 

Cats 49 

1 2801 Mice 343 

2 5602 Sheaves 2401 

4 11,204 Hekats (measures of grain) 16,807 
total 19,607 total 19,607 


This catalog of miscellany has suggested some fanciful ideas. Certain authorities regard 
these words as symbolic terminology given to the rst ve powers of 7. For at the right, 
we have the summation of 7, 77, 7°, 74, and 7° by actual addition. At the left, the sum 
of the same series is given as 7- 2801, with the multiplication carried out by the usual 
method of duplication. Because 2801 = (7° — 1)/(7 — 1), the result 


5 
72801 =7(7 
7 


-—1 


) STEP EPSPs 
is exactly what would be obtained by substitution in the modern formula for the sum S,, 
of n terms of a geometric series: 


"_] 
S,=atar+ar?+---4ar"! =a- 


r-1 

(We note in the problem before us that a =r = 7 and n = 5.) Was such a formula, even 
for simpler cases, known to the Egyptians? There is no concrete evidence that it was. 
A more plausible interpretation of what is intended is something of the sort: “In each 
of seven houses there are seven cats; each cat kills seven mice; each mouse would have 
eaten seven sheaves of wheat; and each sheaf of wheat was capable of yielding seven 
hekat measures of grain. How much grain was thereby saved?” Or one may prefer the 
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question, “Houses, cats, mice, sheaves, and hekats of grain—how many of these were 
there in all?” 

Some 3000 years after Ahmes, Fibonacci included in his Liber Abaci the same series 
of powers of seven with one further term: 


Seven old women were on the road to Rome; 
Each woman had seven donkeys; 

Each donkey carried seven sacks; 

Each sack contained seven loaves of bread; 
With each loaf were seven knives; 

Each knife was in seven sheaths. 

What is the total? 


This rendering, coupled with the number seven, reminds us of an Old English chil- 
dren’s rhyme, one version of which appears below: 


As I was going to Saint Ives, 

I met a man with seven wives. 

Each wife had seven sacks; 

Every sack had seven cats; 

Every cat had seven kits; 

Kits, cats, sacks, and wives, 

How many were going to Saint Ives? 


Here also, it is suggested that the sum total of a geometric progression be calculated, 
but there is a joker in the actual wording of the rst and last lines. While the surprise 
twist is in all likelihood an Anglo-Saxon contribution, one can see how the same kind 
of problem perpetuated itself throughout centuries. 

The Rhind Papyrus closes with the following prayer, expressing the principal worries 
of an agricultural community: “Catch the vermin and the mice, extinguish the noxious 
weeds; pray to the God Ra for heat, wind, and high water.” 


Egyptian Mathematics as Applied Arithmetic 


Looking at the extant Egyptian mathematical manuscripts as a whole, we nd_ that 
they are nothing but collections of practical problems of the kind that are associated with 
business and administrative transactions. The teaching of the art of calculation appears to 
be the chief element in the problems. Everything is stated in terms of speci c numbers, 
and nowhere does one nd a trace of what might properly be called a theorem or a 
general rule of procedure. If the criterion for scientic mathematics is the existence of 
the concept of proof, the ancient Egyptians con ned themselves to “applied arithmetic.” 
Perhaps the best explanation of why the Egyptians never got beyond this relatively prim- 
itive level is that they had a natural, but unfortunate, idea of admitting only fractions 
with numerator one; thus even the simplest calculations became slow and laborious. It 
is hard to say whether the symbolism prevented the use of fractions with other numera- 
tors or whether the exclusive use of unit numerators was the reason for the symbolism 
they used to express fractions. The handling of fractions always remained a special art 
in Egyptian mathematics and can best be described as a retarding force on numerical 
procedures. 
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Part of the Rhind Papyrus. (Copyright British Museum.) 


As evidenced by the Akhmin Papyrus (named after the city on the upper Nile where 
it was discovered), it appears that the methods of the scribe Ahmes were still in vogue 
centuries later. This document, written in Greek at some point between A.D. 500 and 800, 
closely resembles the Rhind Papyrus. Its author, like his ancient predecessor Ahmes, gave 
tables of fractions decomposed into unit fractions. Why did Egyptian mathematics remain 
so remarkably the same for more than 2000 years? Perhaps the Egyptians entered their 
discoveries in sacred books, and in later ages, it was considered heresy to modify the 
method or result. Whatever the explanation, the mathematical attainments of Ahmes were 
those of his ancestors and of his descendants. 


3. Use the Egyptian method of multiplication to 
2.3 Problems calculate the following products: 


(a) (114+ 4+ 4)37. 
1. Use the Egyptian method of doubling to nd the 
following products: oe 3 + DO + 3 + ) ; 
(a) 18-25.  (b) 26-33. (c) (24+4)1+ $+ 9). 


i) Beeehs a BOS oe 4. (a) Show that the product of - by 1+ 5 + ; is 


2. Find, in the Egyptian fashion, the following equal to ¢ (Problem 12 of the Rhind Papyrus). 
quotients: (b) Show that the product of ~ + = by 1+ 4 =F t 
; is equal to + (Problem 15 of the Rhind 
(a) 184=+8. Papyrus). 
(b) 19+8. 
(c) 47+9. 5. Problem 30 of the Rhind Papyrus asks the reader to 
(d) 1060 + 12. nd a quantity such that : + J of it will make 10. 


(e) 61+8. Do this as the Egyptians would have done, rst by 
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10. 


11. 


12. 


13. 


con rming that 
2 it 29 


and then determining by what amount 2 s+ = should 
be multiplied to give _ 


(a) Show that 


hence that 2/n can be expressed as a sum of 
unit fractions whenever n is divisible by 5. 
(b) Use part (a) to seh a me unit fraction 


decompositions of + oe 2, and = as given in 
the Rhind Papyrus. 
(a) Show that 
2 11 rn 3 1 
n 2n 2n° 


hence that 2/n can be expressed as a sum of 
unit fractions whenever n is divisible by 3. 
(b) Use part (a) to obi he unit fraction 


= 
decompositions of 2 oe 2, and 


Show that 


2 _it,#4 
nk 


mn mk 
where the number k = (m +n) iE 2 Use this to get the 


unit fraction decompositions of 2 and = a 
given in the Rhind Papyrus. 


Verify that 


7 oe 


2 i 1 1 1 
=== ree — a —_ 1S —_— 
3n 
and use this fact ih obtain the unit fraction 


decomposition of > as given in the Rhind Papyrus. 


Suppose that 7 is ak by 7. Find a formula 
similar to that of Problem 6(a) that will represent 
2/n as a sum of unit fractions. 


Using the 2/n table, write 2, as sums of 


i 49° 
unit fractions without repetitions. 


19 
and 3: 


Represent 3 7 4 eo and 4 39 as sums of distinct unit 
fractions using (a) the splitting identity and 
(b) Fibonacci’s method. 


Show that if divides m + 1, the fraction n/m can 
always be written as a sum of two unit fractions. 
Illustrate this with a specic example. 


Chapter 2 


14. 


15. 


16. 


17. 


18. 
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Expand 2 as a sum of distinct unit fractions using 
the splitting identity. 


Find a unit fraction representation of 2 that involves 


at least six terms. Do the same for z. 


Represent 4 = ~ and 4 = as sums of distinct unit fractions 
using Fibonacct S method. 


A method for writing 2/n, where n is an odd 
number, as a sum of unit fractions, proceeds as 
follows. Given an integer m, put 2/n = 2m/(nm). If 
from among the divisors of nm a set can be chosen 
whose sum equals 2m, take these divisors as the 
numerators of fractions whose denominators are nm. 
The peau is a unit fraction decomposition oF 2/n. 
For 4 75 We might let m = 12, so that = = =. From 
the divisors 1, 2, 3, 4, 6, 12, 19, ... of 228, it is 
possible to nd four sets of integers whose sums are 
each 24; speci cal ly, 


24=14+44+19=2+43419 
=2+4+6412 
=14+24+3+4+6+412. 


Using these, one gets 


bBrmtmtmomtate 

bBomtatmaemtete 

: = a ag ms + aE si aR = Tr + a + = + a 

b= mtototopto = otitis 
+a+% 


Applying this technique, obtain unit fraction 
expansions of 2 and =. [Hint: Take m = 4 and 
m = 12, respectively.] 


Consider the following variation of Problem 17 for 
writing 2/n as a sum of unit fractions. Choose an 
integer N having a set of divisors whose sum is 
2N —n, say, 


N= dM, = d)M), = d3M, and 
2N=n+d,+a+4+d3. 
Then 
je gee 
N N N WN 
n 1 1 1 


= W TA + ‘v2 + Ms 
whence 2/n can be decomposed as 


2 it . 1 1 
Mon M3n- 


= ae 
n N Min 
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In the case of =. we might take N = 18, so that 
18=2-9=6-3=9-2 

and 
2-18=19+2+6+9. 


Using these relations, one gets 


i 1 1 1 
is — ig t+ in + 37 + 38: 
Apply this technique to obtain unit fraction 


expansions for = and =. [Hint: Take N = 12 and 
N = 36.] 


Problems 3-6 of the Rhind Papyrus describe four 
practical problems: the division of 6, 7, 8, and 9 
loaves equally among 10 men. Solve each of these 


problems by false position, expressing the answers in 


unit fractions. 


Problems 25, 26, and 27 of the Rhind Papyrus are as 
follows: 

Problem 25. A quantity and its ; added together 
become 16. What is the quantity? 

Problem 26. A quantity and its + added together 
become 15. What is the quantity? 

Problem 27. A quantity and its 4 added together 
become 21. What is the quantity? 

Solve each of these problems by false position, 
expressing the answers in unit fractions. 


Problem 31 of the Rhind Papyrus states: A quantity 


and its 2, its +, and its 4 added together become 33. 


What is the quantity? Using modern notation, this 
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calls for solving the equation 


peg a? 33 
x+—+4+2-4+2-=33. 
3 27 


Show that the scribe’s answer 
- 1 1 1 I 1 1 1 
x= 144 gt 56+ 95 + oq + 3e8 + os t+ 776 


: ; 42 - 33 28 
is correct. | Hint: x = ——— = 144+ —. 
97 97 
22. Solve Problem 32 of the Rhind Papyrus, which 
states: A quantity, its x, its 7 added together 
becomes 2. What is the quantity? Express the answer 
in the Egyptian fashion. 


23. In Problem 70 of the Rhind Papyrus, one is asked to 
nd_ the quotient when 100 is divided by 
7+ t + + + 4: do this. [Hint: At some point in the 
calculation take 2 of 7+ 5 + ; + i. Also note that 
the relation 8(7 + ; + ; + z) = 63 implies that 
g0+}+14+)=H1 

24. Problem 40 of the Rhind Papyrus concerns an 
arithmetic progression of ve terms. It states: Divide 
100 loaves among 5 men so that the sum of the three 
largest shares is 7 times the sum of the two smallest. 


(a) Solve this problem by modern techniques. 

(b) Using the method of false position, the scribe 
assumed a common difference of 5 + + and the 
smallest share of 1 (hence, the ve shares are 
1,6+4 4,12, 17+ 4, 23). Obtain the correct 
answer from these assumptions. 


The generally accepted account of the origin of ge- 
ometry is that it came into being in ancient Egypt, 
where the yearly inundations of the Nile demanded 
that the size of landed property be resurveyed for tax 
purposes. Indeed, the name “geometry,” a compound 
of two Greek words meaning “earth” and “measure,” 


seems to indicate that the subject arose from the necessity of land surveying. The Greek 
historian Herodotus, who visited the Nile about 460-455 B.c., described how the rst 
systematic geometric observations were made. 


They said also that this king [Sesostris] divided the land among all Egyptians so as to give 
each one a quadrangle of equal size and to draw from each his revenues, by imposing a tax 
to be levied yearly. But every one from whose part the river tore away anything, had to 
go to him and notify what had happened. He then sent the overseers, who had to measure 
out by how much the land had become smaller, in order that the owner might pay on what 
was left, in proportion to the entire tax imposed. In this way, it appears to me, geometry 


originated. 
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Whatever opinion is ultimately adopted regarding the rst steps in geometry, it does seem 
safe to assume that in a country where cultivating even the smallest portion of fertile soil 
was a matter of concern, land measurement became increasingly important. To this end 
must be ascribed some of the remarkable results the Egyptians obtained in mathematics. 

The task of surveying was performed by specialists whom the later Greeks called 
rope-stretchers, or rope-fasteners, because their main tool apparently was a rope with 
knots or marks at equal intervals. In a passage written about 420 B.c., the Greek philoso- 
pher Democritus (460-370 B.C.) testies that in his time the Egyptian surveyors still 
ranked high among the great geometers, possessing a skill almost equal to his own. He 
proudly boasted, “No one can surpass me in the construction of plane gures with proof, 
not even the so-called rope-stretchers among the Egyptians.” 

What occupied the Egyptian geometers of some 4000 years ago? The mathematical 
papyri that have come down to us contain numerous concrete examples, without any 
theoretical motivation, of prescription-like rules for determining areas and volumes of 
the most familiar plane and solid gures. Such rules of calculation must be recognized 
as strictly empirical results, the accretion of ages of trial-and-error experience and ob- 
servation. The Egyptians sought useful facts relating to measurement, without having to 
demonstrate such facts by any process of deductive reasoning. Some of their formulas 
were only approximately correct, but they gave results of suf cient acceptability for the 
practical needs of everyday life. 

In the great dedicatory inscription, of about 100 B.C., in the Temple of Horus at Edfu, 
there are references to numerous four-sided elds that were gifts to the temple. For each 
of these, the areas were obtained by taking the product of the averages of the two pairs 
of opposite sides, that is, by using the formula 


A=i(a+c\(b +4), 


where a, b, c, and d are the lengths of the consecutive sides. The formula is obviously 
incorrect. It gives a fairly accurate answer only when the eld is approximately rectangu- 
lar. What is interesting is that this same erroneous formula for the area of a quadrilateral 
had appeared 3000 years earlier in ancient Babylonia. 

The geometrical problems of the Rhind Papyrus are those numbered 41-60, and 
are largely concerned with the amounts of grain stored in rectangular and cylindrical 
granaries. Perhaps the best achievement of the Egyptians in two-dimensional geometry 
was their method for nding the area of a circle, which appears in Problem 50: 


Example of a round eld of a diameter 9 khet. What is its area? Take away : of the 
diameter, namely 1; the remainder is 8. Multiply 8 times 8; it makes 64. Therefore it 
contains 64 setat of land. 


The scribes’ process for nding the area of a circle can thus be simply stated: Subtract 
from the diameter its $ part and square the remainder. In modern symbols, this amounts 


to the formula 2 2 
Beg e) 2s). 
9 9 


where d denotes the length of the diameter of the circle. If we compare this with the 
actual formula for the area of the circle, namely zd?/4, then 


nd? (8d\? 
4  \o)}? 
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so that we get 
m= 4(8) = 3.1605... 


for the Egyptian value of the ratio of the circle’s circumference to its diameter. This is a 
close approximation to 35, which many students nd good enough for practical purposes. 

In the Old Babylonian period (roughly 1800-1600 B.c.), the circumference of a circle 
was found by taking three times its diameter. Putting this equal to zd, we see that their 
calculation is equivalent to using 3 for the value of z. The Hebrews used the same value 
in the Old Testament, for example, in I Kings 7:23, wherein the dimensions of the bath 
in the temple of Solomon are described. The verse was written about 650 B.C., and may 
have been taken from temple records dating back to 900 B.c. It reads, “And he made a 
molten sea, 10 cubits from one brim to the other: it was round all about ... : and a line 
of 30 cubits did compass it round about.” A cuneiform tablet discovered at Susa by a 
French archaeological expedition in 1936 (the interpretation of which was published in 
1950) seems to indicate that the Babylonian writer adopted 3;7,30 or 3t as the value of 
z. This is at least as good as the approximation found by the Egyptians. 

We have no direct knowledge about how the formula 4 = (8d/9)? for the area of a 
circle was arrived at, but it is possible that Problem 48 of the Rhind Papyrus provides a 
hint. In this problem, the usual statement of what the author proposed to do was replaced 
by a_ gure that, although drawn quite roughly, most strongly suggests a square with four 
triangles at the vertices. In the middle of the gure is the demotic sign for 9. Thus it 
appears that the scribe formed an octagon from a square of side 9 units by 


—_ 
S71 


trisecting the sides and cutting off the four corner isosceles triangles (each triangle hav- 
ing an area of 3 square units). The scribe may have concluded that the octagon was 
approximately equal in area to the circle inscribed in the square, because some portions 
of the inscribed circle lie outside the octagon and some portions lie inside, and these 
appear to be roughly equal. 
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Now the area of the octagon equals the area of the original square less the areas of 
the four isosceles triangles made up of the four cut-off corners; that is, 


A=9 —4(3) = 63. 


This is nearly the value that is obtained by taking d = 9 in the expression (8d/9)*. Thus 
a possible explanation of the area formula A = (8d/9)* is that it arose from considering 
the octagon as a__ rst approximation to the circle inscribed in a square. 

Problem 52 of the Rhind Papyrus calls for nding the area of a trapezoid (described 
as a truncated triangle) with apparently equal slanting sides; the lengths 6 and 4 of the 
parallel sides and the length 20 of an oblique sides are given. 


20 


The calculation is carried out by means of the formula 
A=}(b+ bh. 


Did the author of the papyrus think that the area of a trapezoid was half the sum of the 
length of the parallel sides times the slant height, or was one oblique side intended to 
be perpendicular to the parallel sides? In the latter case, he would have been correct. It 
is not at all unlikely that the diagram, which is little more than a rough sketch, is badly 
drawn and that one of the seemingly equal sides is really meant to be perpendicular to 
the parallel sides. 


The Volume of a Truncated Pyramid 


There are only 25 problems in the Moscow Papyrus, but one of them contains the 
masterpiece of ancient geometry. Problem 14 shows that the Egyptians of about 1850 
B.C. were familiar with the correct formula for the volume of a truncated square pyramid 
(or frustum). In our notation, this is 


_A 


V= 3 (a@° + ab +b’), 


where / is the altitude and a and b are the lengths of the sides of the square base and 
square top, respectively. 
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The gure associated with Problem 14 looks like an isosceles trapezoid, 


2 


a 


but the calculations indicate that the frustum of a square pyramid is intended. The exact 
text in this connection may be given: 


Example of calculating a truncated pyramid. If you are told: a truncated pyramid of 6 for 
the vertical height by 4 on the base by 2 on the top: You are to square this 4; result 16. You 
are to double 4; result 8. You are to square this 2; result 4. You are to add the 16 and the 
8 and the 4; result 28. You are to take i of 6; result 2. You are to take 28 twice; result 56. 
See, it is of 56. You will nd_ it right. 


Although this solution deals with a particular problem and not with a general theorem, 
it is still breathtaking; some historians of mathematics have praised this achievement as 
the greatest of the Egyptian pyramids. 

It is generally accepted that the Egyptians were acquainted with a formula for the 
volume of the complete square pyramid, and that it probably was the correct one, 


h 
V = —a’. 
3 


In analogy with the formula 4 = sbh for the area of a triangle, the Egyptians may have 
guessed that the volume of a pyramid was a constant times ha*. We may suppose even 
that they guessed the constant to be 1/3. But the formula 


h 
V=— 
3 


(a* +ab+b’) 

could not very well be a guess. It could have been obtained only by some sort of geometric 
analysis or by algebra from V = (h/3)a?. It is not, however, an easy task to reconstruct 
a method by which they could have deduced the formula for the truncated pyramid with 
the materials available to them. 


Speculations About the Great Pyramid 


Any survey of the mathematics of the Egyptians ought to include a brief reference to 
the Great Pyramid at Gizeh, erected about 2600 B.c. by Khufu, whom the Greeks called 
Cheops. It provides monumental evidence of an appreciation of geometric form and a 
relatively high development of engineering construction, not to mention a very remarkable 
social and governmental organization. According to Herodotus, 400,000 workmen labored 
annually on the Great Pyramid for 30 years—four separate groups of 100,000, each group 
employed for three months. (Calculations indicate that no more than 36,000 men could 
have worked on the pyramid at one time without hampering one another’s movements.) 
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= 4 
la >a 


Extract from the Mathematical Leather Scroll, containing simple relations 
between fractions such as } + <4 = 4. (Courtesy British Museum.) 


Ten years were spent constructing a road to a limestone quarry some miles distant, and 
over this road were dragged 2,300,000 blocks of stone averaging 24 tons and measuring 
3 feet in each direction. These blocks were tted together so perfectly that a knife blade 
cannot be inserted in the joints. 

What has impressed people down through the years is not the aesthetic quality of 
the Great Pyramid but its size; it was the largest building of ancient times and one of 
the largest ever erected. When it was intact, it rose 481.2 feet (the top 31 feet are now 
missing), its four sides inclined at an angle of about 51°51’ with the ground, and the 
base occupied 13 acres—an area equal to the combined base areas of the cathedrals 
of Florence and Milan, St. Peter’s in Rome, and St. Paul’s Cathedral and Westminster 
Abbey in London. Even more amazing was the accuracy with which it was put together. 
The base was almost a perfect square, no one of the four sides differing from the mean 
length of 755.78 feet by more than 4s inches. By using one of the celestial bodies, the 
Cheops builders were able to orient the sides of the pyramid almost exactly with the four 
cardinal points of the compass, the error being only fractions of 1°. 

The Great Pyramid has down to the present red adventurous minds to the wildest 
speculations. These pyramid mystics (or as they are sometimes uncharitably called, pyra- 
midiots) have ascribed to the ancient builders all sorts of metaphysical intentions and 
esoteric knowledge. Among the extraordinary things claimed is that the pyramid was 
built so that half the perimeter of the base divided by the height should be exactly equal 
to a. While the difference between the two values 


2(755.78) 


= 3.1415926... and 
7 anc’ 481.2 


= 3.14123... 
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is only 0.00036..., their closeness is merely accidental and has no basis in any mathe- 
matical law. 

The Egyptian priests, according to a ction that has crept into the recent literature, 
told Herodotus that the dimensions of the Great Pyramid were so chosen that the area of 
each face would be the same as the area of a square with sides equal to the Pyramid’s 
height. Writing 2b for the length of a side of the base, a for the altitude of a face triangle, 
and A for the height of the pyramid, we nd that Herodotus’s relation is expressed by 
the equation 


1 
h? = 3 (2b -a) =ab. 


REY 


The Pythagorean theorem tells us that because a is the hypotenuse of a right triangle 
with legs b and h, then h? + b* = a’, or h? = a? — b’. Equating the two expressions for 
h?, we get 


a —b? =ab. 


When both sides are divided by a’, this last equation becomes 


b\? _b ° 
1— (2) =-, or equivalently, (2) + (2) = 1. 
a a a a 


Now the value of the positive root of the quadratic equation x? + x = lisx = $(/5 — 1). 
Then we get the ratio 

b 1 

— = ~(V/5 — 1) = 0.6180339..., 

a 2 
the reciprocal of the “golden ratio,” a value that has proved signi cant many times in 
mathematics and its applications. 

How successful were the pyramid builders in achieving the golden ratio (if that, in- 

deed, was their aim)? Checking with the actual measurements taken at the Great Pyramid, 
we see that 


a= Vh2 + b2 = / (481.2)? + (377.89) = 611.85, 


leading to the value 
b 
— = 0.61762.... 
a 


The theory that the Egyptians intended to use the golden ratio as a theoretical basis 
for building the Great Pyramid seems to have been rst set down by a certain John Taylor, 
who in 1859 published The Great Pyramid, Why Was It Built and Who Built It? An amateur 
mathematician, Taylor spent 30 years collecting and comparing measurements reported 
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by successive visitors to the Pyramid. Although he made a number of scale models of 
the Pyramid, he never set eyes on it himself. Because the only passage in Herodotus’s 
History concerning its size reads, “Its base is square, each side is 800 feet long and its 
height is the same,” a leap of faith would be required to justify Taylor’s claim. Moreover, 
the dimensions Herodotus recorded are themselves way off the mark. 

Another theory that is often taken as gospel is that the total area of the pyramid can 
be expressed in a way that leads to the golden ratio; that is, the area of the base is to 
the sum of the areas of the triangular faces as this sum is to the sum of the areas of the 
faces and base. Because the sum of the areas of the four face triangles is 4 - 3(2ba) and 
the area of the base is (2b)’, the claim reduces to the assertion that 


4b? _ 4ab 
4ab- 4ab + 4b?’ 


or in an equivalent form, 


ba 
a ats 
Using the previously calculated value for a, one nds_ that 
611. 
oe 205s :.,. 
a+b 989.74 


whence the quotients b/a and a/a + b are nearly the same. Whether this is a matter of 
accident or design is open to speculation. 

There are some wilder theories. Some people maintained, for instance, that the Egyp- 
tians had erected the pyramids as dikes to keep the sands of the desert from moving and 
covering the cultivated area along the Nile. A popular belief during the Middle Ages was 
that they were granaries the captive Hebrews were forced to build for storing corn dur- 
ing the years of plenty. This legend has been preserved in the mosaics, done about A.D. 
1250, of the Church of Saint Mark in Venice. Part of the pictorial narrative of the story of 
Joseph shows his brother being sent to fetch sheaves of grain from the pyramids. Specu- 
lation began to assume a more scientic appearance in 1864, when one highly regarded 
professor of astronomy (Charles Piazzi Smyth, the astronomer royal of Scotland) worked 
out to his own satisfaction a unit of measurement for the Great Pyramid, which he called 
the pyramid inch, equal to 1.001 of our inches. Using this mystical “pyramid inch” to 
measure the bumps and cracks along the walls of the interior passages and chambers, he 
concluded that the Great Pyramid was designed by God as an instrument of prophecy, 
a so-called Bible in stone. (The British Egyptologist Flinders Petrie wrote that he once 
caught one of the pyramid cultists surreptitiously ling down a stone protuberance in 
order to make its measurements conform to his theories.) If one knew how to read its 
message, there would be found in the pyramid all sorts of signi cant information about 
the history and future of humanity: the Great Flood, the birth of Christ, the beginning 
and end of World War I, and so on. When Smyth dated the start of World War I as 
1913, his believers jubilantly pointed out that he had erred “by only one year.” Smyth 
and his followers posed fanciful, extravagant theories about the “secrets” locked in the 
measurements of the Great Pyramid. Their near miss in foretelling the date of the Great 
War notwithstanding, these many enthusiastic speculations must be dismissed as stuff 
and nonsense. 
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Although we can be certain that the pyramid builders had already a fair knowledge 
of geometry, singularly little mathematics of this period has come down to us. Our 
two chief mathematical papyri, although different in age, may be said to represent the 
state of the subject at the time 2000-1750 B.c. Reviewing everything, we are forced to 
conclude that Egyptian geometry never advanced beyond an intuitive stage, in which the 
measurement of tangible objects was the chief consideration. The geometry of that period 
lacked deductive structure—there were no theoretical results, nor any general rules of 
procedure. It supplied only calculations, and these sometimes approximate, for problems 
that had a practical bearing in construction and surveying. 


2.4 Problems 


1. Solve the following geometrical problems from the 


Rhind Papyrus. 


(a) Problem 41. A cylindrical granary of diameter 9 
cubits and height 10 cubits. What is the amount 
of grain that goes into it? [Hint: Use the 
Egyptian value for 7, namely 4 ae to get the 
scribe’s answer.] 

(b) Problem 51. Example of a triangle of land. 
Suppose it is said to thee, what is the area of a 
triangle of side 10 khet and of base 4 khet? 
[Hint: The accompanying gure in the papyrus 
is apparently intended to be a right-angled 
triangle. ] 

(c) Problem 58. If a pyramid (square) is 934 cubits 
high and the side of its base is 140 cubits long, 
what is its seked? [Hint: Given an isosceles 
triangle of base s and height h, we know that 
its seked equals s/2h, or the cotangent ratio of 
trigonometry. The seked in this problem has 
been associated with the slope of the lateral 
faces of the Second Pyramid at Gizeh.] 


(a) The Babylonians generally determined the area 
of a circle by taking it as equal to ~ the square 
of the circle’s circumference. Show that this is 
equivalent to letting 7 = 3. 

(b) A Babylonian tablet excavated in 1936 asserts 
that when a more accurate determination of 

area is needed, the + should be multiplied 

by 0;57,36, that is, by 3. What value for z 


does this correction factor yield? 


. Archimedes (about 287-212 B.C.) in his book 
Measurement of a Circle stated: The area of a circle 
is to the square on its diameter as 11 to 14. Show 
that this geometric rule leads to 2 for the value of z. 


. The sixth-century Hindu mathematician Aryabhata 


had the following procedure for nding the area of a 
circle: Half the circumference multiplied by half the 
diameter is the area of a circle. How accurate is this 
rule? 


. The Babylonians also knew a formula for the volume 


of a truncated square pyramid, namely, 


a+b\ 1 fa—b\’ 
V=h , 
(HY 5 Z) 
where / is the altitude and a and b are the lengths of 
the sides of the square (upper and lower) bases. 


Show that this reduces to the formula of the Moscow 
Papyrus. 


. A Babylonian tablet has been discovered in which 


the volume of the frustum of a cone is determined by 
using the (erroneous) formula 


V = 3h(r? + R?), 


where h is the height and r and R are the radii of the 
bases. Take h = 6, r = 4, and R = 2 and compare 
the Babylonian result with the result from the correct 
formula 


V = tnh(R?+rR+r’). 


. Heron of Alexandria (circa A.D. 75?) found the 


volume of the frustum of a cone by calculations 
equivalent to using the formula 


1 ‘i 
V =ynh(r + RY, 


where / is the height and r and R are the radii of the 
bases. If 2 is taken for the value of 2, what answer 
would Heron have gotten for h = 6, r = 4, and 
R=2? 


. The text of Problem 10 of the Moscow Papyrus is 


illegible at some points, but a calculation is 
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performed, using the equivalent of the formula 


S=(1- 4)’ @da, 


which seems designed to give the surface area of a 
hemispherical basket of diameter d = 4 5. Show that 


if 4 (8) is set equal to z, this yields the right 
formula for the area of a hemisphere, namely md*/2. 


(a) 


(b) 


Starting with the area formula A = 4x y sind 
for a triangle in terms of two sides and the 
angle between them, use the accompanying 
gure to derive the formula 


A= s(ad sin A +absin B 
+ besinC + cd sin D) 


for the area of a quadrilateral. 


B 


d 
D c c 


Show that if A represents the area of the 
quadrilateral in part (a), then 


2 (a+c)b+d) 
~ 4 

with equality holding if and only if the 
quadrilateral is a rectangle. Thus, the ancient 
rule for the area of a quadrilateral overestimates 
the areas of all quadrilaterals that are not 
rectangles, so that tax assessors could well have 
continued to use this convenient formula long 
after they came to suspect that it never led them 


to underestimate the areas of quadrilateral 
elds. 


A 


’ 
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10. (a) Prove that of all triangles having two given 
sides of lengths a and b, the one whose sides 
form a right angle encompasses the maximum 
possible area. 

(b) Another formula giving the area of an arbitrary 
quadrilateral is 


A? =(S —a)(S — b\(S — c)(S — d) — T. 


Here S$ = s(a+b+c+d) is the 
semiperimeter and 


T = abcd cos*(A + C)/2, 


where A and C are a pair of opposite vertex 
angles of the quadrilateral. Show that the 
maximum possible area corresponding to the 
given values a, b, c, and d occurs when the 
angles A and C (and hence B and D also) are 
supplementary. 


11. While measuring the Great Pyramid, Charles Piazzi 
Smyth (named for his godfather, Giuseppe Piazzi, 
discoverer of the rst known asteroid, Ceres) found a 
niche in the Queen’s Chamber that was 185 pyramid 
inches long. Use this dimension to verify that 
accuracy of the following assertions Smyth made in 
his book Our Inheritance in the Great Pyramid: 


(a) The length of the Grand Niche multiplied by 
10z is equal to the height of the Great Pyramid. 


(b) The square root of 10 times the height of the 
north wall in the Queen’s Chamber (182.4 
pyramid inches) divided by the length of the 
Grand Niche is equal to z; that is, 


_ [10082.4) 
a 


Most of what we know about the mathematics devel- 
oped in Mesopotamia, rst by the Sumerians and then 
later by the Akkadians and other people, is relatively 
new. This subject is called Babylonian mathematics, 
as if a single people had created it. Hitherto the great 


emphasis had been placed on the achievements of the Egyptians. For some time, it was 
known that the large Babylonian collections at the British Museum and the Louvre abroad 
and at Yale, Columbia, and the University of Pennsylvania in the United States contained 
many undeciphered cuneiform tablets of unusual types. The exhaustive studies of Otto 
Neugebauer, which reached fruition in the 1930s, revealed these to be mathematical 
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tables and texts, and thus a key to the “reading” of their contents was found. Chie y 
through the decipherment, translation, and interpretation of this scholar, an entirely new 
light has been thrown on what the Babylonians contributed to the development of ancient 
mathematics. 

In investigating Babylonian mathematics, we are much less fortunate than with 
Egyptian mathematics. Because the Babylonian mode of writing on clay tablets discour- 
aged the compilation of long treatises, there is nothing among the Babylonian records 
comparable with the Rhind Papyrus. Nonetheless, several hundred mathematical tablets 
have been recovered, many in an excellent state of preservation. The great majority of 
these (about two-thirds) are “Old Babylonian,” which is to say that they belong roughly 
to the period 1800-1600 B.c. Through this rich mine of source material we now know 
that except possibly for certain geometric rules, the Babylonians far outstripped the 
Egyptians in mathematics. Although Babylonian mathematics too had strong empirical 
roots that are clearly present in most of the tablets that have been translated, it seems to 
have tended toward a more theoretical expression. (The Babylonians can claim priority 
in several discoveries, most notably the Pythagorean theorem, usually ascribed to later 
mathematical schools.) The key to the advances the Babylonians made appears to have 
been their remarkably facile number system. The excellent sexagesimal notation enabled 
them to calculate with fractions as readily as with integers and led to a highly developed 
algebra. This was impossible for the Egyptians, for whom every operation with fractions 
involved a multitude of unit fractions, thereby making a difcult problem out of each 
division. 

The Babylonians, freed by their remarkable system of numeration from the drudgery 
of calculation, became indefatigable compilers of arithmetic tables, some of them ex- 
traordinary in complexity and extent. Numerous tables give the squares of numbers | 
to 50 and also the cubes, square roots, and cube roots of these numbers. A tablet now 
in the Berlin Museum gives lists of not only n* and n> for n = 1,2,..., 20, 30, 40, 50 
but also the sum n* + n°. It is surmised that this was used in solving cubic equations 
that had been reduced to the form x* + x* = a. Another large group of tables deals with 
the reciprocals of numbers. The standard format of such a table usually involves two 
columns of gures, such as 


4 15 
5 12 
6 10 
8 7;30 
9 6;40 
10 6 
12 5 
15 4 
16 3345 
18 3320 


where the product of each pair of numbers is always 60. That is, each pair consists of a 
number on the left-hand side and its sexagesimal reciprocal on the right-hand side. These 
tables have certain gaps in them; missing are such numbers as 7, 11, 13, and 14, and 
some others. The reason is that only nite sexagesimal fractions were comprehensible 
to the Babylonians, and the reciprocals of these “irregular” numbers are nonterminating 
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sexagesimals. For instance, in the sexagesimal expansion for 4 the block 8,34,17 repeats 
itself in nitely often: 


1 = 0;8, 34, 17, 8, 34,17... 


(The analogous situation occurs in our own system, in which the reciprocal of, say, 
- = 0.090909... is in nite when expanded decimally.) When an irregular number like 
7 does appear in the rst column, the statement is made that 7 does not divide, and an 
approximation is given. A Sumerian tablet of 2500 B.c. calls for dividing the number 


5,20,0,0 by 7; the calculation is presented as 
(5, 20, 0, 0)(0; 8, 34, 17, 8) = 45, 42, 51; 22, 40, 


where 5,20,0,0 has been multiplied by the reciprocal of 7 approximated to the fourth 
place. A later table seems to give upper and lower bounds on the size of 4, namely, 


8, 34, 16,59 < 4 < 8, 34, 18. 


We can picture the scope of some tables of reciprocals from a tablet in the Louvre— 
dating from 350 B.c_—that comprises 252 entries of one-place to seventeen-place divisors, 
and one-place to fourteen-place reciprocals. This table is a list of numbers n and n’ for 
which the product nn’ equals 1 or some other power of 60. As a specic example, one 
line contains the values 


2,59, 21, 40, 48, 54 20, 4, 16, 22, 28, 44, 14, 57, 40, 4, 56, 17, 46, 40 
which may be thought of as representing the product 


(2- 60° + 59-604 +---+48- 60+ 54) 
x(20- 60% + 4.602 +...+46- 60+ 40) = 60". 


It appears that calculations of this magnitude were necessary in the work of the as- 
tronomers of the time. 

As suggested previously, the Babylonians did not carry out division by the clumsy 
duplication method of the Egyptians. Instead, they interpreted a divided by b to mean 
that a is multiplied by the reciprocal of 5; that is, a/b = a(1/b). After having found, 
either in a table or by calculation, the reciprocal of the divisor, they needed only to 
multiply it by the dividend. For this purpose, the Babylonian scribes had at their disposal 
multiplication tables, almost always giving the products of a certain number multiplied 
successively by 1, 2,3,...18, 19, 20 and then by 30, 40, and 50. On one tablet of 1500 
B.C. are tables of 7, 10, 124, 16, 24, each multiplied by the foregoing series of values. 
Thus, the procedure for, say, 7 divided by 2 would be to multiply the reciprocal of 2 by 
7: 


7(0;30) = 0;210 = 3;30, 


which is of course the sexagesimal notation for 34. 


The Babylonian Treatment of Quadratic Equations 


Distinct from the table tablets are tablets that deal with algebraic and geometric 
problems. These generally present a sequence of closely related numerical problems, 
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together with the relevant calculations and answers; the text often terminates with the 
words “Such is the procedure.” Although none of them gives general rules, the consis- 
tency with which the problems were treated suggests that the Babylonians (unlike the 
Egyptians) had some sort of theoretical approach to mathematics. The problems often 
seem to be intellectual exercises, instead of treatises on surveying or bookkeeping, and 
they indicate an abstract interest in numerical relations. 

There are scores of clay tablets that indicated that the Babylonians of 2000 B.C. were 
familiar with our formula for solving the quadratic equation. This is well illustrated by 
an Old Babylonian text that contains the following problem: 


I have added the area and two-thirds of the side of my square and it is 0;35. What is the 
side of my square? 


It is often possible to translate such problems directly into our symbolism by replacing 

words like length (or side) and width by the letters x and y. In modern notation, we 

would express the content of this problem as 
x24 2x = 3 


The details of solution are described by verbal instructions in the text as follows: 


You take 1, the coef ci ent [of x]. Two-thirds of 1 is 0;40. Half of this, 0;20, you multiply 
by 0;20 and it [the result] 0;6,40 you add to 0;35 and [the result] 0;41,40 has 0;50 as its 
square root. The 0;20, which you have multiplied by itself, you subtract from 0;50, and 0;30 
is [the side of] the square. 


Converted to modern algebraic notation, these steps tell us that 


0;40\7 0:40 
= palit 0:35. 
vay (7P) +038-5 


= /0;6,40 + 0;35 — 0;20 


= /0;41,40 — 0; 20 
= 0;50 — 0;20 = 0;30. 


Thus, the Babylonian instructions amount to using a formula equivalent to the familiar 


rule 
=\Q+r-$ 


for solving the quadratic equation x? + ax = b. Although the Babylonian mathematician 
had no “quadratic formula” that would solve all quadratic equations, the instructions 
in these concrete examples are so systematic that we can be pretty certain they were 
intended to illustrate a general procedure. 

Historically, it is perhaps more appropriate to speak of rectangular instead of 
quadratic equations, for it was the problem of rectangles that gave rise to these equations. 
In the ancient world, the error was widespread that the area of a plane gure depended 
entirely on its perimeter; people believed that the same perimeter always conned the 
same area. Army commanders estimated the number of enemy soldiers according to the 
perimeter of their camp, and sailors the size of an island according to the time for its 
circumnavigation. The Greek historian Polybius tells us that in his time unscrupulous 
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Old Babylonian cuneiform text containing 16 problems with solutions. (Copyright 
British Museum.) 


members of communal societies cheated their fellow members by giving them land of 
greater perimeter (but less area) than what they chose for themselves; in this way they 
earned reputations for unsel shness and generosity, while they really made excessive 
pro ts. 

Evidently the problem of how the perimeter of a rectangle related to its area was 
systematically investigated in antiquity. A typical problem in early Babylonian mathe- 
matics was the following. Given the semiperimeter x + y = a and the area xy = b of a 
rectangle, nd the length x and width y. How did they go about the solution? We can only 
speculate, for there is no explicit indication anywhere in the mathematical texts of this 
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period of how one arrived at the result. The Babylonian mathematicians were empiricists 
and observers who worked with tables that presented the facts in an orderly fashion. In all 
likelihood, they must have constructed the tables for the different values the area might 
assume, the perimeter being kept constant. Thus, for a rectangle whose semiperimeter 
x + y =a = 20, the resulting area might have been tabulated for the variations of 


a mn d a 
x=-4+2z an =--Z, 
2 1 =F 
where z is one of the numbers 0 through 9. 
a 4 a b (<) i 
x=- Z =--— =x - _ 
2 oe i 2 
z=0 10 10 100 
a—a| 11 9 99 1? 
Z=2 12 8 96 2? 
a) 13 7 91 2 
z=4 14 6 84 2 
a 15 | 75 5? 
z=6 16 4 64 62 
£=7 17 3 51 P 
z=8 18 2 36 8? 
z=9 19 1 19 92 


The lesson demonstrated by the numbers in the table is that the areas decrease with 
the growth of z, and that the difference (a/2)° — b always equals the square of z; that is, 


At some point, it surely dawned on the Babylonians that they could invert the procedure 
and ascertain z from the value of (a/2)? — b. This would give 


and, as a result, the unknowns 


per ee geal oper 

In the beginning, these conclusions were established empirically through observation 
of concrete facts; there were no logical speculations nor any deductive reasoning from 
proven theorems. The best that can be said for the ancient approach is that it substituted 
patience for brilliance. Later Babylonians would undoubtedly have realized that if the 
sum x + y =a were given, then the larger quantity, say x, would exceed a/2 by a 
certain amount z. It is evident that since the sum x + y is xed, x can gain only what 
y loses. Thus, 
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the sum of which is a. Substituting these values in the equation xy = b leads to 


(+) G-2)=0 
whence 


The implication is that 


P= Gy —b 
and so 


= 


The negative root was neglected, and this was usual until modern times. With the value 
of z known, x and y can now be obtained: 


=H @-% of G> 


This approach can be illustrated with a typical example. A cuneiform tablet in the 
Yale Babylonian collection asks (in speci c numbers) for the solution of the two algebraic 
equations, 


xty=8,  xy=3 


The Babylonian method just described calls for setting x and y equal to B. plus or minus 
a correction z; that is, 


Then the rst equation is satis ed, because 


ety=(Ft2)+(Z-2) 
and the second equation xy = _ becomes 


(B4+2)(2-2)=8 
This reduced to 


169 _ 32 _ 15 
16 4 =o? 
or 
2— 169 _ 15 _ #9 
- 16 2 ~ 16" 
Thus z = f, and one nds immediately that 
= B47 =~B_7_3 
x= Gtq=5, Y= 4 42° 


The same idea can be used if the difference x — y is initially given, instead of x + y. 
Proceeding analogously, the Babylonians would have solved the system 


x—y=4, 


xy =b 
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by putting 
a 
S25 7 and yoz- 


from which the solution then follows: 


Qs 1 (F-$ 


More complicated algebraic problems were reduced, by various devices, to the fun- 
damental systems 


xty=a, xy =b, 


which we shall call normal form. For instance, one tablet contains the numerical equiv- 
alent of the problem. 


xty=%, x+ytxy = 14. 


The values of x and y are given as ; and i, respectively, but the manner in which the 
solution was found is not given. It was probably effected by subtracting the rst equation 
from the second, to get 
xy=x+ytxy-@ty)=14-B=2. 
The problem then amounts to solving the system 
x+y= 8, xV= 6 


and by the procedure discussed earlier, 


Two Characteristic Babylonian Problems 


A standard type of Babylonian problem consisted in keeping the condition xy = b 
xed but varying the second equation to arrive at more elaborate expressions in x and 
y. This is evidenced by another tablet, in which one is required to solve, in our notation, 


xy = 600, («+ yy + 120(x — y) = 3700. 


Apparently the Babylonians were aware of the identity (x + y)? = (x — y)? + 4xy, which 
allowed them to convert (x + y)* to (x — y)* + 2400. When this substitution is made, 
the second equation becomes 


(x — y? + 120(x — y) = 1300, 
a quadratic in x — y. An application of their quadratic formula gives the value of x — y: 
x—y =f (22) + 1300 — 22 = ¥4900 — 60 = 70 — 60 = 10. 
The Babylonian mathematician would then need to solve the system of equations 


x—y= 10, xy = 600, 
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which would give no difculty. In fact, the usual method of setting x =z-+5 and 
y =z-—5S gives rise to the solution x = 30, y = 20. 

The Babylonians knew about quadratic equations of the forms x? + ax = b and 
x? = ax +); and their respective solutions 


$l 


were clearly and expressly taught through numerous examples. The negative square root, 
which would have led to a negative value of the solution x, was always neglected; 
nowhere in Babylonian mathematics were negative solutions to quadratics recognized. 
The type of quadratic x* + 5 = ax seems also to have been well known but transformed 
by all sorts of ingenious devices to the system x + y =a, xy = b. The Babylonians’ 
experience showed that x? + b = ax led to two distinct solutions, namely, x = a/2 + 
JV(a/2)2 — b and x = a/2—./(a/2)? — b. Yet the idea of two values for one and the 
same quantity must have seemed a logical absurdity to the Babylonians, something to be 
circumvented at all costs. 

Tablets at Yale University contain hundreds of similar problems (200 on one tablet 
alone), without solutions, arranged in systematic order. Only a few tablets have been 
preserved, so there must have been thousands of problems in the original series. In one 
case, the simultaneous equations for solution are 


xy = 600, $(x + y)? — 60(x — y) = —100, 


an extraordinary example of a negative number in the right-hand member. The concept of 
a negative number standing by itself—as distinguished from an indicated subtraction— 
was not current even in Europe 2500 years later. 

In a_ nal illustration of the algebraic character of Babylonian mathematics, let us 
consider a problem in which a reed, the usual measuring rod, of unknown size is used 
to measure the length and width of a rectangular eld. The translated tablet reads: 


I have a reed. I know not its dimension. I broke off from it one cubit and walked 60 times 
along the length. I restored to it what I have broken off, then walked 30 times along the 
width. The area is 6,15. What is the original length of the reed? 


The common unit for linear measures of land at the time was the ninda, or 12 cubits; 
thus, 4 of a ninda was broken off from the rod of unknown dimension. If the original 
rod is assumed to have size x, then the length of the eld was 60(x — t) because the 
eld was 60 times as long as the shortened rod. When the cubit was returned to the 
rod, the width of the eld was 30 times the restored length of the complete rod, or 30x. 
Because the area of the eld is 375, it is found that 


30x - 60 (x — 4) = 375, 
which leads to the quadratic equation 
1800x” = 150x + 375. 


On multiplying both sides of this equation by 1800, the author of the tablet would have 
gotten 


(1800x)* = 150(1800x) + 1800 - 375, 
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a quadratic in 1800x. And setting vy = 1800x would give 
y? = 150y + 1800 - 375. 


The instructions given in the cuneiform text are equivalent to substituting in the familiar 


formula 
(rs 


for a root of y? = ay + b. Adapting the rule to the numbers of the present problem, one 
gets 


y = 752 + (1800)(375) + 75 = 825 + 75 = 900, 


from which x = 4 ninda is determined. 


2 


2.5 Problems 


1. Write the fractions 2, 3, and 2 in sexagesimal 


notation by 


(a) using the Babylonian method of nding the 
reciprocal of the denominator and then 
multiplying by the numerator; and 

(b) multiplying numerator and denominator by 
60 and simplifying. 


. A tablet in the Yale Babylonian collection reads: 


I found a stone but did not weigh it; after I 
added ; and added a I weighed it: [result] 1 
mina. What was the original weight of the 
stone? The original weight was 2 mina, 8 


shegels and 22: se. 


Use the equivalences 1 mina = 60 shegels and 1 
shegel = 180 se to verify the indicated solution. 
[Hint: Call the original weight of the stone x, so that 
(x + x/7) + 4 +x/7) = 60 shegels.] 


. Find the solution to this ancient Babylonian problem. 


There are two silver rings; 4 of the rst ring 
and + of the second ring are broken off, so 
that what is broken off weighs | sheqel. The 


rst diminished by its ; weighs as much as the 


second diminished by its 4. What did the 


silver rings originally weigh? 


[Hint: Consider the system of equations 


x y_ 6x _ 10y 


i ie a 
where x and y are the weights of the two rings. ] 


. A typical Babylonian problem of 1700 B.C. calls for 


nding the sides of a rectangle given its 
semiperimeter and area; that is, solve systems of 
equations of the type x + y =a, xy = b. Find the 
solution of the particular system x + y = 10, 

xy = 16. [Hint: The Babylonians might have used 
the identity (x — y? =(x + y —4xy to nd x — y.] 


. Another Babylonian problem is 


To the area of a rectangle, the excess of the 
length over the width is added, giving 120; 
moreover, the sum of the length and width is 
24. Find the dimensions of the rectangle. 


[Hint: The problem can be put in the form of two 
equations xy + x —y = 120, x + y = 24; if the 
substitution y = z — 2 is made, the system becomes 
x+z= 26, xz = 144.] 


. Using the Babylonian procedures, solve each of these 


systems: 


(a) x-y=6,xy= 16. 
(b) x-y=4,xy=21. 
(c) x+y=8, xy = 15. 


. On a Babylonian tablet, this problem is solved: 


x+y =27, xy +(x — y) = 183. 


Solve by rst letting z = y + 2 to get the system 
x+z= 29, xz = 210. 


. On a tablet in the British Museum, this problem is 


solved: 


What the length is, the depth is also [except 
for a coef cient of 12]. A box is hollow. If I 
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1;10, and if the length measures 0;30, what is 
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Find the length of the sides of the two squares. 


: 12. Tabulate the values of n3 +n? for n = 1 ton = 10, 
the width? : : . 
and use this table to solve the cubic equation 
3 Ot spe : : 
In solving this problem, let x, y, and z be the length, ais . + 12x" = 48. [Hint: Multiply the given 
width, and depth of the box, respectively, so that equation by 12.] 
13. Using the procedure indicated in the hint, solve each 
oe 1 ee tay Ty Re iat: of the following Babylonian problems: 

9, A classical example of the quadratic equation in ei 2 36,49= ay = 500; (ay Sabi 
Babylonian mathematics is found on a tablet in the h q ion f h f th 
Brite Museum: sihidacemice: the second equation from the square of the rst 

: , to get a quadratic in x — y. Solving this 
I have added 7 times the side of my square to quadratic leads ae system of the form 
11 times its surface to obtain 6;15. Reckon ae emai oe ‘ é ; 
with 7 and 11, (b) x+y =50, x? + y? +(x —y)? = 1400. [Hint: 
Subtract the square of the rst equation from 
Solve for the scribe’s answer of 0;30 for the side of twice the second equation to get a quadratic in 
the square. [Hint: The injunction to “reckon with 7 x—y.] 
and 11” means simply that 11x? + 7x = 6;15. (c) xy = 600, (x +) + 60% — y) = 3100. [Hint: 
Multiply both sides of this equation by 11, thereby The formula (x + vy) = (x — y) + 4xy leads 
turning it into a quadratic in 11x.] to a quadratic in x — y.] 
= = py)? = od 
10. Heron of Alexandria solved the quadratic equation a y «—y¥ y et ni 
ll. , 2 eee : The formula (x — y)* = (x + y)° — 4xy leads 
wx + =x +x = 212 by multiplying both sides by touquadeitic tna: 
11-14. Carry out Heron’s calculations to obtain x. ‘ >, 2 a : 
(e) xy = 600, x° + y° = 1300. [Hint: Square the 
11. An old Babylonian text reads: rst equation to produce a system in u = x? 
and v=y?,] 
I have added the areas of my two squares; (f) x—y=10, x2 +? = 1300. [Hint: Subtract 


[result] 25,25. [The side of] the second square 


aa ; the square of the rst equation from the second 
is ; the side of the rst plus 5. 


equation. ] 


Another oddity in the history of mathematics was brought 
to light when the Babylonian clay tablet Plimpton 322 
(catalog number 322 in the G. A. Plimpton collection at 
Columbia University) was deciphered by Neugebauer and 
Sachs in 1945. This tablet is written in Old Babylonian 
script, which dates it somewhere between 1900 B.c. and 
1600 B.c. The analysis of this extraordinary group of gures establishes beyond any doubt 
that the so-called Pythagorean theorem was known to Babylonian mathematicians more 
than a thousand years before Pythagoras was born. We recall that Pythagoras’s result, 
which gives the relation between the lengths of the sides of a right triangle, is expressed 
succinctly in the formula x? + y? = z?. 

The text in question, Plimpton 322, is the right-hand part of a larger tablet with 
several columns. As is evident from the break at the left-hand side, this tablet was 
originally larger. The existence of modern glue on the break implies that the other part 
was lost after the tablet was excavated. The tablet is further marred by a deep chip near 
the middle of the right-hand edge and a aked area in the top left corner. The following 
list conveys its contents. 


2.6 — Plimpton 322 


A Tablet Concerning 
Number Triples 


Plimpton 322 


B 

119 169 1 
3367 4825 (11521) 2 
4601 6649 3 
12709 18541 4 
65 97 5 
319 481 6 
2291 3541 7 
799 1249 8 
481 (541) 769 9 
4961 8161 10 
45 75 11 
1679 2929 12 
161 (25921) 289 13 
1771 3229 14 
56 106 (53) 15 


Three columns of numbers are preserved, each with a heading. The last column 
contains nothing but the numbers 1, 2,..., 15, indicating that it enumerates the lines. 
The preceding two columns are more interesting and are headed by words that might 
be translated as “width” and “diagonal.” It is not dif cult to verify that they form the 
leg and hypotenuse of an integral-sided right triangle. In other words, if the numbers in 
the middle column are squared and one subtracts from each of them the square of the 
corresponding number in the rst column, a perfect square results. For instance, the rst 
row contains the equation 


(169)* — (119)° = (120). 


The text involves several errors, and in the list, the original readings on the tablet 
appear in parentheses to the right of the corrected gures. In line 9, the occurrence of 
541 instead of 481 is undoubtedly a scribal error, because in sexagesimal notation, 541 
is written 9,1 and 481 is written 8,1. In line 13, the scribe wrote the square of 161 in 
place of the number itself, and the number in the last line is half the correct value. There 
remains an unexplained error in the second line. 

The question naturally arises about how the Babylonians derived the numbers x, y, 
and z satisfying the equation x7 + y? = z*. The values involved in Plimpton 322 are 
so large that they could not have been obtained simply by guesswork; using trial-and- 
error methods, one would have run across many simpler solutions before these. If the 
Babylonians possessed a clearly discernible method for solving the Pythagorean equation, 
what was it? A clue is found in a fourth, but incomplete, column along the broken left- 
hand edge of the Plimpton tablet. It contains a list of the values z7/x*, which suggests 
that the relation x7 + y* = z? was reduced to 


If @ =z/x and 6B = y/x, this becomes 
a? — p? =1. 


The problem would then be to construct right triangles whose sides have the rational 
lengths 1, a, and 6, where a* — 8? = 1. Now, the critical step is recognizing that this 
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Babylonian tablet, Plimpton 322. (By courtesy of Columbia University.) 


last equation can be expressed as 
(a+ Ba — B)=1. 


All the numbers concerned are rational, so if the product of two numbers is 1, they are 
reciprocals. That is, one number must be m/n and the other n/m, where m and n are 
integers. Setting 


m n 
a+p=— and a-p=-, 
n m 
we nd by addition that 


and by subtraction that 


Consequently, 


(1) a= 


2 
m 
’ B = 


But y = 6x and z = ax; if we now put x = 2mn, so as to get a solution in integers, it 
follows that 


2mn 


x = 2mn, y=m —n’, z=m +n’. 
These are well-known formulas for nding right triangles with sides of integral length and 
were used in Hellenistic times by Diophantus (circa 150), the most original mathematician 
of late antiquity. 


Plimpton 322 


iD 


To arrive at these formulas, apart from the ability to add and subtract fractions, one 
needs as the key result the algebraic formula 


a” — B? = (a + B)(w — ). 


This may have been discovered by consideration of a gure like the one herewith. The 
shaded area aw? — B? can be dissected as shown and then rearranged as shown, that 
is, as a rectangle with sides of length aw + 6 and w — B. Hence, we have a? — B* = 


(a + B)(a — B). 
id 
B 
| se 


——- a o| 


The accompanying table exhibits the values of m and n that give rise to the solutions 
in Plimpton 322. For example, taking m = 12 and n = 5 in formula (1), we arrive at 


x = 120 y=119, z= 169. 


The latter two numbers are entries in the rst line of the tablet. The only exception is in 
line 11. Here the choice m = 2 and n = 1 leads to x = 4, y = 3, and z = S, and each 
of these must be multiplied by 15 to produce the values listed. One interesting point 
that emerges from examining the table is that m and n always factor into products of 
powers of 2, 3, and 5. 


m n x =2mn y=m —n z=m +n? 
2.3 5 120 119 169 
26 33 3456 3367 4825 
3.57 25 4800 4601 6649 
5 23? 13500 12709 18541 
3? 2? 72 65 97 
27.5 3? 360 319 481 
2.33 5? 2700 2291 3541 
25 3-5 960 799 1249 
5 2? .3 600 481 769 
34 22:35 6480 4961 8161 
Exception 60 45 75 
O43 5? 2400 1679 2929 
3-5 23 240 161 289 
2.5? 33 2700 1771 3229 


3? =) 90 56 106 
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The lists of m and that make up the rst two columns of the table are such 
that their reciprocals all have terminating sexagesimal expansions. If 1/N has the nite 
expansion 


N60 602) + 60F 
then it can be written as 1/N = a/60*, whence 60° = aN. The implication is that N 
contains only the prime factors that appear in 60‘ and therefore in 60. But because 60 
has the factorization 60 = 27 - 3-5, the permissible factors of N are 2, 3, and 5; that is 
to say, with suitable exponents a, 6, and y, we must have N = 2% - 3° . 5”. 
It has been suggested that the values of z in the Plimpton tablet were not computed 
directly from z = m? + n* but from the equivalent formula 


z=(m+ny —2mn. 


This proposal furnishes an intriguing explanation of the scribal error in line 2 of the 
tablet (the case in which m = 2° = 64, n = 3° = 27). In using the displayed formula, 
the writer may have made two mistakes. First, he or she may have added the term 2mn, 
when it should have been subtracted; and then, in calculating the term itself, may have 
written 2-60-27, where 2-64-27 was called for. This would produce the incorrect 
value 


z = (64427) +3240 = 8281 + 3240 = 11521, 
instead of 


z = (644 27) — 3456 = 8281 — 3456 = 4825. 


Babylonian Use of the Pythagorean Theorem 


Some of the most impressive treasures of the Babylonian past have been unearthed 
at Susa, capital of ancient Elam, a country bordering on Babylonia and often hostile 
to it. Susa has been more or less continuously excavated and for a longer time than 
any other site in southern Mesopotamia. Its shapeless mounds were identi ed by the 
British archaeologist William Kennett Loftus, who directed his workmen in sinking the 
rst trenches in 1854. But large-scale excavations did not really begin until the French 
archaeological mission took over the diggings in 1884. In 1902, the mission discovered 
in the acropolis of Susa one of the outstanding landmarks in the history of humanity: 
the code of laws of King Hammurabi I (circa 1750 B.c.). The code is carved on a well- 
polished column of black diorite, which was carried back to Susa from Babylon as a 
trophy of war. Judged by present-day standards, the 285 articles are a strange mixture 
of the most enlightened adjudication with the most barbarous punishment. They stressed 
the principle of “equivalent retaliation,” according to which a punishment would be the 
equivalent of the wrong done: “If a man has destroyed the eye of an aristocrat, they 
shall destroy his eye.” Although Hammurabi used to be called the rst lawgiver, recent 
discoveries have shown that there were several earlier collections of Sumerian legal 
decrees. 
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Of more mathematical interest is a group of tablets uncovered by the French at 

Susa in 1936. These provide some of the oldest Babylonian examples of the use of the 
theorem of Pythagoras. One tablet computes the radius 7 of a circle that circumscribes 
an isosceles triangle of sides, 50, 50, and 60. 
The solution is as follows. The Pythagorean theorem is used rst with regard to trian- 
gle ADB to get AD = 40. Because r = AE, we then have ED = 40 —r. A second 
application of the Pythagorean theorem, this time to the triangle E DB, leads to the 
equation 


r? =30°+(40-ry, 


which can be solved to give r = a, ory = 31315, 


Another Old Babylonian tablet contains the problem, 


A beam of length 0;30 [stands in an upright position against a wall]. The upper end has 
slipped down a distance 0;6. How far did the lower end move [from the wall]? 


The answer is correctly found with the aid of the Pythagorean theorem. 


The Cairo Mathematical Papyrus 


Until recently, scholarly opinion has differed about whether the ancient Egyptians 
were aware of even a single instance of the Pythagorean theorem, let alone acquainted 
with the general validity of the proposition. It is well known that as early as 4000 years 
ago the Egyptians had trained surveyors, the harpedonaptae, whose principal measuring 
instrument was a stretched rope. The precise orientation of the foundations of the im- 
mense structures of ancient Egypt with the four cardinal points of the compass led some 
historians to surmise that these “rope-stretchers” were able to construct right angles using 
ropes divided by two knots into sections that were in the proportion 3:4:5. When the two 
ends of the rope were tied and the sections drawn taut around pegs laid out at the three 
knots, the rope would take the shape of a right triangle. However simple this approach 
may appear today, no surviving document from antiquity conrms that it actually took 
place. 

The so-called Cairo Mathematical Papyrus, unearthed in 1938 and rst examined 
in 1962, establishes conclusively that the Egyptians of 300 B.c. not only knew that the 


78 


Chapter 2 Mathematics in Early Civilizations 


Tablet in the Yale Babylonian Collection, showing a square with its 
diagonals. (Yale Babylonian Collection, Yale University.) 


(3, 4, 5) triangle was right-angled, but also that the (5, 12, 13) and (20, 21, 29) triangles 
had this property. Dating from the early Ptolemeic dynasties, this papyrus contains 40 
problems of a mathematical nature, of which 9 deal exclusively with the Pythagorean 
theorem. One, for instance, translates as, “A ladder of 10 cubits has its foot 6 cubits from 
a wall; to what height will it reach?” 

Two problems are particularly interesting, because they demonstrate the advance in 
Egyptian mathematical technique from the time of the Rhind Papyrus. These concern 
rectangles having areas of 60 square cubits and diagonals of 13 and 15 cubits. One is 
required to nd the lengths of their sides. Writing, say, the rst of the problems in modern 
notation, we have the system of equations 


x? 4+ y* = 169, xy = 60. 
The scribe’s method of solution amounts to adding and subtracting 2xy = 120 from the 
equation x? + y? = 169, to get 

(x+yP = 289, (x—y) = 49; 

or equivalently, 

x+y=17, x-y=7. 
From this it is found that 2y = 10, or y = 5, and as a result x = 17-5 = 12. 

The second problem, 


x? + y? = 225, xy = 60, 
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is similar, except that the square roots of 345 and 105 are to be found. There were several 
methods for approximating the square root of a number that was not a perfect square. In 
this case, the scribe used a formula generally attributed to Archimedes (287-212 B.C.), 
which is also found in Babylonian texts, 


b 
Va+be —. 
oe ae 2a 
The approximations arrived at are 


J/345 = J182 +21 + 18+2=18+44+4 
and 
V105 = V10? +5 © 10+ # = 10+ j. 


The Babylonians arrived at the geometric equivalent of Archimedes’ formula through 
a procedure known as “square plus sides.” To obtain /N, the rst step was to form a 
square whose area a’ was near to, but smaller than, NV. The difference b = N — a? was 
viewed as the area of a rectangle with sides of length a and b/a. This rectangle was then 


bisected, and its pieces appended to adjacent edges of the square: 


The con guration produced could be interpreted as a square, provided that the missing 
shaded corner piece is ignored. Because this new square has sides a+ 56/2a it fol- 
lows that a? + b © (a + b/2a)*, which then leads to the Archimedean approximation for 
VN. 

Among the Babylonian tablets seeking square roots, there is one from about 2000 
B.c. that asks for the diagonal of a rectangle with sides 0; 40 and 0; 10. After posing the 
problem, the scribe gives the following instructions: 


You square the base 0;10 to get 0;01,40. Take the reciprocal of the height 0; 40 [result 
1;30] and multiply by 0;01,40. You will see 0;02,30. Take its half. Add this 0;01;15 to 
0; 40. You will see 0;41, 15, the diagonal. 


Expressed in algebraic terms, the calculations carried out are 


1 1 
d= 0; 40)2 0; 10)? = 0;40 + — | —— 0; 10)? = 0:41, 15, 
V (0; 40)2 + (0; +3(sa)t )? = 0;41, 15 


which is recognized as a particular case of the approximation 


b? 
le a at 
a 


2.6 Problems 


In a Babylonian tablet, the following problem is 
found. Given that the circumference of a circle is 60 
units and the length of a perpendicular from the 
center of a chord of the circle to the circumference is 
2 units, nd the length of the chord. In solving this 
problem, take 7 = 3. 


An Old Babylonian tablet calls for nding the area of 
an isosceles trapezoid whose sides are 30 units long 
and whose bases are 14 and 50. Solve this problem. 


14 


30 30 


50 


In another tablet, one side of a right triangle is 50 
units long. Parallel to the other side and 20 units 
from this side, a line is drawn that cuts off a right 
trapezoid of area 5, 20 = 320 units. Find the lengths 
of the bases of the trapezoid. [Hint: If A is the area 
of the original triangle, then 320+ I5y = A = 25x, 
and $(x + y)20 = 320.] 


[oe \ 


x 


In a similar problem, a right triangle whose base is 
30 units is divided into two parts by a line drawn 
parallel to the base. It is given that the resulting right 
trapezoid has an area larger by 7,0 = 420 than the 
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upper triangle, and that the difference between the 
height y of the upper triangle and the height z of the 
trapezoid is 20. If x is the length of the upper base of 
the trapezoid, these statements lead to the relations 
$2(x + 30) = txy + 420, y—z = 20. 


30 


The problem calls for nding the values of the 
unknown quantities x, y, and z. [Hint: By properties 
of similar triangles, y/(v + z) = x/30.] 


Another Old Babylonian problem calls for nding the 
length of the sides of an isosceles trapezoid, given 
that its area is 150, that the difference of its bases is 
5 (that is, b} — by = 5), and that its equal sides are 
10 greater than two-thirds of the sum of its bases 
(that is, spH=SQ.= 2(, + b>) + 10). 


Solve this problem using an incorrect Babylonian 
formula for the area of a trapezoid, namely, 
b +b 
AaU +b 81+ 5a 
2 2 
A Babylonian tablet of 2000 B.C. gives two methods 
for calculating the diagonal d of a rectangle with 
sides of length 40 and 10 units. The rst leads (in 
speci c numbers) to the approximation 


2ab* 


Pi) Ean 
4+ 3600 


where a is the larger side and 5 is the shorter side, 
and the second method to the approximation 


hb? 
dxya+—. 
: 2a 
Check the accuracy of these approximations to 
1700 by squaring the respective answers. 


Bibliography 


7. Because a is smaller than Ja? + b when b > 0, 
whereas a + b/a is larger, the Babylonian 
mathematician often approximated /a? + b by 
taking the average of these two values; that is, 


Use this formula to get rational approximations to 
V2, V5, and V/17. [Hint: In the rst case put a = 4, 
b= 3; in the second case, put a = 2, b = 1.| 


8. An iterative procedure for closer approximations to 
the square root of a number that was not a square 
was obtained by Heron of Alexandria. In his work 
Metrica (discovered as recently as 1896 in 
Constantinople in a manuscript dating from the 
eleventh or twelfth century), he merely states a rule 
that amounts to the following. If A is a nonsquare 
number and a? is the nearest perfect square to it, so 
that A =a*+b,a_ rst approximation to JA is the 
average of the values a and A/a; that is, 


1 A 
= 5 aa et ‘ 


This number can be used to get a more accurate 


approximation, 
1 4. A 
Dl _ i 
aay tas = 


and the process is repeated as often as desired. 


(a) Find approximate square roots, through two 
approximations, of the numbers Heron used to 
explain his method, namely, 720 and 63. 

(b) Show that Heron’s rst approximation was 
equivalent to a formula of Archimedes—which 
in turn was a generalization of the Babylonian 
method—namely, 


b 
V@etbrat—. 
2a 
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3.1 


CHAPTER 5 
The Beginnings of Greek Mathematics 


He is unworthy of the name of man who does not know that the diagonal of a square is incommensurable 
with its side. 


PLATO 


The Greeks made mathematics into one dis- 


The Geometrical Discoveries of Thales cipline, transforming a varied collection of 


empirical rules of calculation into an orderly 
Greece and the Aegean Area and systematic unity. Although they were 

plainly heirs to an accumulation of Eastern 
knowledge, the Greeks fashioned through their own efforts a mathematics more profound, 
more abstract (in the sense of being more remote from the uses of everyday life), and 
more rational than any that preceded it. In ancient Babylonia and Egypt, mathematics 
had been cultivated chie y as a tool, either for immediate practical application or as part 
of the special knowledge be tting a privileged class of scribes. Greek mathematics, on 
the other hand, seems to have been a detached intellectual subject for the connoisseur. 
The Greeks’ habits of abstract thought distinguished them from previous thinkers; their 
concern was not with, say, triangular elds of grain but with “triangles” and the char- 
acteristics that must accompany “triangularity.” This preference for the abstract concept 
can be seen in the attitude of the different cultures toward the number 2; the Babylo- 
nians had computed its approximations to a high accuracy, but the Greeks proved that it 
was irrational. The notion of seeking after knowledge for its own sake was almost com- 
pletely alien to the older Eastern civilizations, so that in the application of reasoning to 
mathematics, the Greeks completely changed the nature of the subject. Plato’s inscription 
over the door of his Academy, “Let no man ignorant of geometry enter here,” was not 
the admonition of an eccentric but rather a tribute to the Greek conviction that through 
the spirit of inquiry and strict logic one could understand a person’s place in an orderly 
universe. 

All history is based on written documents. Although documentation concerning 
Egyptian and Babylonian mathematics is often very precise, the primary sources that can 
give us a clear picture of the early development of Greek mathematics are meager. In 
Greece, there was no papyrus such as was available in Egypt, no clays as in Babylonia. 
Such “books” as were written must have been very few; and with the passing of time and 
ravages of the elements, little original material has survived. Consequently, early Greek 
history is a morass of myths, legends, and dubious anecdotes, preserved by writers who 


83 


84 


Chapter 3. The Beginnings of Greek Mathematics 


lived centuries later than the events under consideration. We depend on fragments and 
copies of copies many times removed from the original document. However scrupulous 
the copyist may have been in Iling in obscure passages in an earlier text, we can never 
be sure how much the copyist had to call on his or her own imagination or indeed, how 
well the copyist understood the original. 

The Greeks were not always conned to the southeastern corner of Europe, their 
location in modern times. Although the Egyptians had kept to themselves, the Greeks were 
great travelers. Their colonization of the coasts and the offshore islands of Asia Minor 
from the eleventh to the ninth century B.C. was a prologue to later large-scale movements 
from mainland Greece. About the middle of the eighth century B.C., a network of Greek 
cities was founded on the coastal reaches of the Mediterranean, with scattered settlements 
as far aeld as the eastern end of the Black Sea. Down to 650 B.c., the main vent for 
Greek expansion was lower Italy and Sicily; the many ourishing colonies there caused 
the whole area to be given the name Greece-in-the-West. Although the earlier migration to 
Asia Minor was probably the result of the Dorian conquest of large portions of the Balkan 
Peninsula, economic distress and political unrest in the homeland were the new incentives 
to spread overseas. An increase in population caused a crisis in land ownership, as well as 
a serious shortage of food. All these migrations not only provided an outlet for dissatis ed 
elements of the population at home but also served to establish foreign markets and to 
lay the material foundations of art, literature, and science. Although Hellenic culture had 
its beginnings in an expanded Greece, in due course peninsular Greece became only a 
part of “Greater Greece.” By 800 B.c., there was, broadly speaking, a unity of language 
and custom throughout the ancient Mediterranean world. 

The wave of colonization that took place outside of the Aegean from the eighth to 
the sixth century B.C. paved the way for an extraordinary breakthrough of reason and 
the attendant cultural advancement. Historians have called this phenomenon the Greek 
miracle. The miracle of Greece was not single but twofold— rst the unrivaled rapidity 
and variety and quality of its achievement; then its success in permeating and imposing 
its values on alien civilizations. For this, the colonies were like conduits through which 
Greek culture owed to the world of the “Barbarians,” and the older Egyptian and 
Babylonian cultures streamed to the Greeks. It is remarkable that all the early Greek 
mathematics came from the outposts in Asia Minor, southern Italy, and Africa, and not 
from mainland Greece. It is as if the scanty Greek populations living next door to the more 
developed societies had their wits sharpened by this contact, as well as having access to 
the knowledge gathered by them. The most decisive of all Greek borrowings was the art of 
writing with the convenient Phoenician alphabet. Each of the symbols of the Phoenician 
alphabet stood for a consonant; there were no signs for vowels. The Phoenician alphabet 
had more consonant symbols than the Greek language required, so the Greeks set out 
by selecting and adapting the consonant symbols they needed. Thereafter, they assigned 
vowel values to the remaining symbols, adding only such new signs as they needed 
(for instance a, which had a consonantal value in the Phoenician alphabet, became the 
symbol for the vowel A in the Greek alphabet). As in other matters, the Greek city-states 
vied with each other in the elaboration of the alphabet, with as many as 10 different 
versions getting under way. Gradually, one of these local alphabets, the Ionian, gained 
the ascendancy; and after its of cial adoption by Athens in 403 B.c., it spread rapidly 
through the rest of Greece. Although the acceptance of alphabetic writing did not initiate 
anything like popular education, the ease with which it could be learned made possible a 
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wider distribution of learning than had prevailed in the older cultures, where reading and 
writing were the property of a priestly class. (Although the Phoenician traders eventually 
spread the new device throughout the Mediterranean world, the intelligentsia of Egypt 
and Babylonia disdained the alphabet—possibly because they had invested lifetimes in 
learning the elaborate ideograms that were the mysterious delight of specialists.) 
Coinage in precious metals was invented in the Greek cities of Asia Minor about 
700 B.C., stimulating trade and giving rise to a money economy based not only on 
agriculture but also on movable goods. In rendering possible the accumulation of wealth, 
this new money economy permitted the formation of a leisure class from which an 
intellectual aristocracy could emerge. Aristotle recognized how important nonpractical 
activity is in the advancement of knowledge when he wrote in his Metaphysics: 


When all the inventions had been discovered, the sciences which are not concerned with the 
pleasures and necessities of life were developed rst in the lands where men began to have 
leisure. This is the reason why mathematics originated in Egypt, for there the priestly class 
was able to enjoy leisure. 


In most ancient civilized societies, an educated elite, usually priests, directed the 
activities of the community. Whether the priests were themselves the government (as 
in early Babylonia) or merely its servants (as in Egypt), prociency in writing and 
mathematics was considered part of their special skills. In the structure of Egyptian 
bureaucracy, the man of learning held a position of great privilege and potential power. 
The Greek historian Polybius remarked that “the Egyptian priests obtained positions 
of leadership and respect because they surpassed their fellows in knowledge.” Eastern 
learning was a mystery shared only by the specialists and not destined for the citizenry. 
Although an able and ambitious man had some opportunity to improve his lot through 
education, these hopes were seldom realized—just as very few of Napoleon’s soldiers 
ever became eld marshals. By contrast, Greek education was far more broadly based 
and designed to produce gentleman amateurs. Perhaps the difference was that the Greeks 
had no powerful priesthood that could monopolize learning as its own preserve; no 
sacred writings or rigid dogmas that required the mind’s subservience. In any event, the 
rst Greek intellectuals came not from the class of governmental managers but from 
people of affairs, for whom business was a profession and learning a pastime. 

Geography shaped the pattern of Greek political life. In Egypt and Babylonia, it was 
easy to subject a large population to a single ruler, but in Greece, where every district was 
separated from the next by mountains or the sea, central control by an absolute monarch 
was impossible. Mountainous barriers were not enough to prevent invasion, but they 
were enough to prevent one state from being merged with another. Patriotic loyalty was 
to the native city—Athens, Corinth, Thebes, or Sparta—and not to Greece as a whole. In 
great emergencies the Greek states acted collectively, seeing that they must unite or be 
destroyed. During the Persian invasions of the later sixth and early fth centuries B.c., 
they pooled their ghting forces to defeat Darius at Marathon (490 B.c.) and Xerxes at 
Salamis (480 B.c.), after a rearguard action by 300 Spartans at Thermopylae. On none 
of these occasions was the union successful or long-lasting, because with each victory 
the city-states would promptly fall out and exhaust themselves in long local wars. The 
lack of political unity made the outcome inevitable. The end came when Philip II of 
Macedonia overpowered the mixed Greek forces at the battle of Charonea in 338 B.C. 
and established himself as the head of all the Greek states except Sparta. Philip died 
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two years later, and the power passed into the hands of his son, Alexander the Great, 
who achieved what no leader before had done. He unied Greece and carried Greek 
civilization to the limits of the known world. In 323 B.c., when Alexander died at age 
32, he ruled over conquests of more than 2 million square miles. But neither Greeks 
nor Greek culture vanished with the change of masters. The years that followed—from 
the time of Alexander the Great into the rst century B.c—formed a brilliant period of 
history known to scholars as the Hellenistic Age. 


The Dawn of Demonstrative Geometry: Thales of Miletos 


The rise of Greek mathematics coincides in time with the general owering of Greek 
civilization in the sixth century B.C. (“Greek civilization” usually indicates a culture be- 
ginning in the Iron Age and_ourishing most brilliantly in the fth and fourth centuries 
B.C.) From the modest beginnings with the Pythagoreans, number theory and geome- 
try developed rapidly, so that early Greek mathematics reached its zenith in the work 
of the great geometers of antiquity—Euclid, Archimedes, and Apollonius. Thereafter, 
the discoveries were less striking, although great names such as Ptolemy, Pappus, and 
Diophantus testify to memorable accomplishments from time to time. These pioneering 
contributors exhausted the possibilities of elementary mathematics to the extent that little 
signi cant progress was made, beyond what we call Greek mathematics, until the six- 
teenth century. What is more striking still is that almost all the really productive work 
was done in the relatively short interval from 350 to 200 B.C., and not in the old Aegean 
world but by Greek immigrants in Alexandria under the Ptolemies. 

The rst individuals with whom specic mathematical discoveries are traditionally 
associated are Thales of Miletus (circa 625-547 B.C.) and Pythagoras of Samos (circa 
580-500 B.c.). Thales was of Phoenician descent, born in Miletus, a city of Ionia, at 
a time when a Greek colony ourished on the coast of Asia Minor. He seems to have 
spent his early years engaged in commercial ventures, and it is said that in his travels 
he learned geometry from the Egyptians and astronomy from the Babylonians. To his 
admiring countrymen of later generations, Thales was known as the rst of the Seven 
Sages of Greece, the only mathematician so honored. In general, these men earned the 
title not so much as scholars as through statesmanship and philosophical and ethical 
wisdom. Thales is supposed to have coined the maxim “Know thyself,” and when asked 
what was the strangest thing he had ever seen, he answered “An aged tyrant.” 

Ancient opinion is unanimous in regarding Thales as unusually shrewd in politics 
and commerce no less than in science, and many interesting anecdotes, some serious and 
some fanciful, are told about his cleverness. On one occasion, according to Aristotle, 
after several years in which the olive trees failed to produce, Thales used his skill in 
astronomy to calculate that favorable weather conditions were due the next season. 
Anticipating an unexpectedly abundant crop he bought up all of the olive presses around 
Miletus. When the season came, having secured control of the presses, he was able to 
make his own terms for renting them out and thus realized a large sum. Others say that 
Thales, having proved the point that it was easy for philosophers to become rich if they 
wished, sold his olive oil at a reasonable price. 

Another favorite story is related by Aesop. It appears that once one of Thales’ mules, 
loaded with salt for trade, accidentally discovered that if it rolled over in a stream, the 
contents of its load would dissolve; on every trip thereafter, the beast deliberately repeated 
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the same stunt. Thales discouraged this habit by the expedient of lling the mule’s 
saddlebags with sponges instead of salt. This, if not true, is certainly well invented and 
more in character than the amusing tale Plato tells. One night, according to Plato, Thales 
was out walking and looking at the stars. He looked so intently at the stars that he fell 
into a ditch, whereupon an old woman attending him exclaimed, “How can you tell what 
is going on in the sky when you can’t see what is lying at your feet?” This anecdote was 
often quoted in antiquity to illustrate the impractical nature of scholars. 

As we have seen, the mathematics of the Egyptians was fundamentally a tool, crudely 
shaped to meet practical needs. The Greek intellect seized on this rich body of raw 
material and re ned from it the common principles, thereby making the knowledge more 
general and more comprehensible and simultaneously discovering much that was new. 
Thales is generally hailed as the rst to introduce using logical proof based on deductive 
reasoning rather than on experiment and intuition to support an argument. Proclus (about 
450), in his Commentary on the First Book of Euclid’s Elements, declared: 


Thales was the rst to go into Egypt and bring back this learning [geometry] into Greece. 
He discovered many propositions himself and he disclosed to his successors the underlying 
principles of many others, in some cases his methods being more general, in others more 
empirical. 


Modern reservations notwithstanding, if the mathematical attainments attributed to Thales 
by such Greek historians as Herodotus and Proclus are accepted, he must be credited 
with the following geometric propositions. 


e An angle inscribed in a semicircle is a right angle. 

e Acircle is bisected by its diameter. 

e The base angles of an isosceles triangle are equal. 

e If two straight lines intersect, the opposite angles are equal. 
e The sides of similar triangles are proportional. 


e Two triangles are congruent if they have one side and two adjacent angles, respec- 
tively, equal. 


Because there is a continuous line from Egyptian to Greek mathematics, all of the listed 
facts may well have been known to the Egyptians. For them, the statements would remain 
unrelated, but for the Greeks they were the beginning of an extraordinary development in 
geometry. Conventional history inclines in such instances to look for some individual to 
whom the “miracle” can be ascribed. Thus, Thales is traditionally designated the father of 
geometry, or the rst mathematician. Although we are not certain which propositions are 
directly attributable to him, it seems clear that Thales contributed something to the rational 
organization of geometry—perhaps the deductive method. For the orderly development 
of theorems by rigorous proof was entirely new and was thereafter a characteristic feature 
of Greek mathematics. 


Measurements Using Geometry 


Several stories purport to illustrate Thales’ interest in Egypt. According to legend, 
his most spectacular accomplishment while there was the indirect measurement of the 
height of the Great Pyramid by means of shadows. There are two versions of the story, 
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one describing a very simple method of measurement and the other a more complex 
method. The earliest version is that Thales observed the length of the shadow of the 
pyramid at that hour of the day when a man’s shadow is the same length as himself. 
Plutarch improved on this when he wrote in the Convivium: 


Although he [the king of Egypt] admired you [Thales] for other things, yet he particularly 
liked the manner by which you measured the height of the pyramid without any trouble 
or instrument; for, by merely placing your staff at the extremity of the shadow which the 
pyramid casts, you formed, by the impact of the sun’s rays, two triangles and so showed that 
the height of the pyramid was to the length of the staff in the same ratio as their respective 
shadows. 


Both versions of the story depend on the same geometric proposition, namely, that the 
sides of equiangular triangles are proportional. Thales, having thus conceived of two 
similar triangles, argued that the height / of the pyramid was to the length h’ of the staff 
as the length s of the pyramid’s shadow was to the length s’ of the shadow cast by the 
staff when it was held vertically: 


h/h' =s/s’. 


Thales knew already that the distance along each side of the base of the Great Pyramid 
was 756 feet and that his staff was 6 feet long. It was necessary only to measure the 
shadow of the pyramid (the distance from the tip of the shadow to the center of the base 
of the pyramid) and the shadow of the staff. It was 342 feet from the tip of the pyramid’s 
shadow to the edge of the base, and the shadow of the staff measured 9 feet. 


720 


Now Thales had all the required dimensions, for three items of the proportion would 
give him the missing fourth item. The height of the Great Pyramid was 


pe 2? pesaeaet 
s! 9 3 

Another practical application of geometry attributed to Thales is determining how far 
a ship at sea is from the shore. How he used his knowledge of geometry for this purpose 
can only be conjectured. According to Proclus, Thales used the congruence theorem, 
which asserts that a triangle is completely determined if one side and two adjacent 
angles are known. The most probable assumption is that Thales, observing the ship from 
the top of a lookout tower (say of height /) used the proportionality of the sides of two 
similar right triangles. All he needed was a simple instrument with two legs forming a 
right angle, so that he could mark off the point E where the line of sight with the ship 
cut the leg parallel to the ground. This would produce similar triangles ACB and DCE. 
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C (Eye of observer) 


E (Point on rod) 


Sea level 


B (Shi 
A (Base of tower) x (Ship) 


If x is the unknown distance of the ship, then by properties of similar triangles, one has 


x I+h 

yf 
or 

y= er) 


The only objection to this approach is that it does not depend directly on the theorem 
concerning two triangles in which corresponding sides and their adjacent angles are equal, 
as Proclus implied. 

Another possible approach is that to nd the distance x from the shore A to the ship 
B, one measures from A along a straight line perpendicular to AB an arbitrary length AC 
and determines its midpoint D. From C, construct a line CE perpendicular to AC (in a 
direction opposite to AB) and let E be the point on it which is in a straight line with B 
and D. Clearly, CE has the same length as 4B, and CE can be measured, so that AB is 
known. This supposition is open to a different objection. It hardly seems credible that to 
ascertain the distance of the ship, the observer should have had to reproduce and measure 
on land an enormous triangle. Such an undertaking would have been so inconvenient as 
to deprive Thales’ discovery of any practical value. 


B (Ship) 


E (Eye of observer) 


It is more likely that triangle ECD was constructed smaller and similar to triangle BAD, 
rather than congruent to it. 

Among his contemporaries, Thales was more famous as an astronomer than as a 
mathematician. A legend that reappears from time to time is that he amazed his fellow 
Greeks by predicting a solar eclipse in 585 B.c. Herodotus records that the event took 
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place during a battle between the Lydians and the Medes, and when day turned into 
night, the ghting ceased. The warring kings were so awed that they concluded a lasting 
peace. Although Thales’ fame as a scientist rests mainly on this achievement, it is almost 
impossible to give credence to the tale. The astronomical records of that time were not 
accurate enough to allow anything like a precise forecast. Thales may well have had some 
knowledge of cycles of lunar eclipses—these were already known to the Babylonians— 
but his alleged prediction of the year of a solar eclipse would only have been a fortunate 
guess. A likely explanation is that he happened to be the “wise man” known to the people 
who saw this striking phenomenon, and so they assumed that Thales must have been 
able to foresee it. 

Although Thales left no written record of any book or document behind him, he 
ranks high among mathematicians for his pioneering contribution to the logical devel- 
opment of geometry. He may even have been Pythagoras’s teacher; some sources tell 
that Thales recognized the genius of the young Pythagoras, to whom he taught all that 
he knew. 


The study of numbers in the abstract begins in 


Pythagorean Mathematics sixth century B.C. Greece with Pythagoras and 


the Pythagoreans. Our knowledge of the life 
Pythagoras and His Followers of Pythagoras is scanty, and little can be said 

with any certainty. Those scraps of information 
that have Itered down to us come from early writers who vied with each other in 
inventing fables concerning his travels, miraculous powers, and teachings. According 
to the best estimates, Pythagoras was born between 580 and 569 B.c. on the Aegean 
island Samos. He appears to have left Samos permanently as early as his eighteenth 
year to study in Phoenicia and Egypt, and he may have extended his journeys as far 
eastward as Babylonia. Some none-too-trustworthy sources say that when Egypt was 
conquered by the Persian king Cambyses in 525 B.c., Pythagoras was carried back to 
Babylonia with the other Egyptian captives. Other authorities indicate, however, that he 
followed Cambyses voluntarily. When Pythagoras reappeared after years of wandering 
(around the age of 50), he sought out a favorable place for a school. Banned from his 
native Samos by the powerful tyrant Polycrates, he turned westward and nally settled 
at Crotona, a prosperous Dorian colony in southern Italy. 

Founding a school was not unusual in the Greek world. The distinctive feature of the 
school of Pythagoras was that its aims were at once political, philosophical, and religious. 
Formed of some 300 young aristocrats, the community had the character of a fraternity 
or a secret society: it was a closely knit order in which all worldly goods were held in 
common. The school tried rigidly to regulate the diet and way of life of its members and 
to invoke a common method of education. Pupils concentrated on four mathemata, or 
subjects of study: arithmetica (arithmetic, in the sense of number theory as opposed to 
calculating), harmonia (music), geometria (geometry), and astrologia (astronomy). This 
fourfold division of knowledge became known in the Middle Ages as the “quadrivium,” 
to which was then added the trivium of logic, grammar, and rhetoric—subjects connected 
with the use of language. These seven liberal arts came to be looked on as the necessary 
and proper course of study for the educated person. 

Pythagoras divided those who attended his lectures into two grades of disciples: 
the acoustici (or listeners) and the mathematici. After three years of listening in mute 
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Pythagoras 
(circa 580-500 B.C.) 


(The Bettmann Archive.) 


obedience to Pythagoras’s voice from behind a curtain, a pupil could be initiated into 
the inner circle, to whose members were con ded the main doctrines of the school. 
Although women were forbidden by law to attend public meetings, they were admitted 
to the master’s lectures. One source indicates that there were at least 28 women in the 
select category of mathematici. When Pythagoras was close to 60 years old, he married 
one of his pupils, Theano. She was a remarkably able mathematician who not only 
inspired him during the latter years of his life but continued to promulgate his system of 
thought after his death. (Some contradictory sources say that Theano was Pythagoras’s 
daughter; yet others, that she was only a highly gifted pupil, never his wife.) 

Pythagoras followed the custom of Eastern teachers by passing along his views 
by word of mouth. He seems not to have committed any of his teachings to writing. 
And furthermore, the members of his community were bound not to disclose to outsiders 
anything taught by the master or discovered by others in the brotherhood as a result of the 
master’s teaching. Legend has it that one talkative disciple was drowned in a shipwreck 
as the gods’ punishment for his public boast that he had added the dodecahedron to the 
set of regular solids Pythagoras had enumerated. The symbol on which the members of 
the Pythagorean community swore their oaths was the “‘tetractys,” or holy fourfoldness, 
which was supposed to stand for the four elements: re, water, air, and earth. The 
tetractys was represented geometrically by an equilateral triangle made up of 10 dots, 
and arithmetically by the number 1+2+3+4= 10. 
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According to the Greek writer and satirist Lucian (120-180), Pythagoras asked some- 
one to count; when he had reached 4, Pythagoras interrupted, “Do you see? What you 
take to be 4 is 10, a perfect triangle and our oath.” 

Like other mystery cults of that time, the Pythagoreans had their strange initiations, 
rites, and prohibitions. They refused, for example, to eat beans, drink wine, pick up 
anything that had fallen, or stir a re with an iron. They insisted, in addition to these 
curious taboos, on a life of virtue, especially of friendship. From Pythagoreanism comes 
the story of Damon and Pythias. (Pythias, condemned to death for plotting against the 
king, was given leave to arrange his affairs after Damon pledged his own life if his friend 
did not return.) The ve-pointed star, or pentagram, was used as a sign whereby members 
of the brotherhood could recognize one another. It is told that a Pythagorean fell ill while 
traveling and failed to survive, despite the nursing of a kind-hearted innkeeper. Before 
dying, he drew the pentagram star on a board and begged his host to hang it outside. 
Some time later another Pythagorean, passing by, noticed the symbol and after hearing 
the innkeeper’s tale, rewarded him handsomely. 


The Pythagoreans fancied that the soul could leave one’s body, either temporarily 
or permanently, and that it could inhabit the body of another person or animal. As a 
result of this doctrine of transmigration of souls, they would eat no meat or sh lest the 
animal slaughtered be the abode of a friend. The Pythagoreans would not kill anything 
except as a gift to the gods, and they would not even wear garments of wool, since 
wool is an animal product. A story is told in which Pythagoras, coming across a small 
dog being thrashed, said, “Stop the beating, for in this dog lives the soul of my friend; 
I recognize him by his voice.” 

Accounts of Pythagoras’s death do not agree. What is clear is that political ideas were 
gradually added to the other doctrines, and for a time, the autocratic Pythagoreans suc- 
ceeded in dominating the local government in Crotona and the other Greek cities in south- 
ern Italy. About 500 B.C., there was a violent popular revolt in which the meetinghouse of 
the Pythagoreans was surrounded and set are. Only a few of those present survived. In 
several accounts, Pythagoras himself is said to have perished in the inferno. Those with a 
sense of drama would have us believe that Pythagoras’s disciples made a bridge over the 
re with their bodies, so that the master might escape the frenzied mob. It is said in these 
versions that he ed to nearby Metapontum but in the ensuing ight, having reached a 
eld of sacred beans, chose to die at the hands of his enemies rather than trample down 
the plants. With the death of Pythagoras, many members of the school emigrated to the 
Greek mainland; some stayed behind for a time but by the middle of the fourth century 
B.C. all had left Italy. Although the political in uence of the Pythagoreans was destroyed, 
they continued to exist for several centuries longer as a philosophical and mathematical 
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order. To the end, the dwindling band of exiles remained a secret society, leaving no 
written record, and with notable self-denial, ascribing all their discoveries to the master. 

What set the Pythagoreans apart from the other sects was the philosophy that “knowl- 
edge is the greatest puri cation,” and to them knowledge meant mathematics. Never be- 
fore or since has mathematics had such an essential part in life and religion as it did with 
the Pythagoreans. At the heart of their scheme of things was the belief that some sort of 
an operative reality existed behind the phenomena of nature, and that through the volition 
of this supreme architect, the universe was created—that beneath the apparent multiplic- 
ity and confusion of the world around us there was a fundamental simplicity and stability 
that reason might discover. They further theorized that everything, physical and spiritual, 
had been assigned its allotted number and form, the general thesis being “Everything is 
number.” (By “number” was meant a positive integer.) All this culminated in the notion 
that without the help of mathematics, a rational understanding of the ruling principles at 
work in the universe would be impossible. Aristotle wrote in the Metaphysics: 


The Pythagoreans...devoted themselves to mathematics; they were the rst to advance 
this study and having been brought up in it they thought its principles were the principles 
of all things. 


About Pythagoras himself, we are told by another chronicler that “he seems to have 
attached supreme importance to the study of arithmetic, which he advanced and took out 
of the domain of commercial utility.” 

Music provided the Pythagoreans with the best instance of their principle that “num- 
ber” was the cause of everything in nature. Tradition credits Pythagoras with the discovery 
that notes sounded by a vibrating string depended on the string’s length, and in particular, 
that a harmonious sound was produced by plucking two equally taut strings, one twice the 
length of the other. In modern terms, the interval between these two notes is an octave. 
Similarly, if one string were half again the length of the other, the shorter one would 
give off a note, called a “ fth,” above that emitted by the longer; whereas if one were a 
third longer than the other a “fourth” would be produced—one note four tones above the 
other. It was concluded that the most beautiful musical harmonies corresponded to the 
simplest ratios of whole numbers, namely, the ratios 2:1, 3:2, and 4:3 (the four numbers 
1, 2, 3, and 4 being enshrined in the famous Pythagorean tetractys, or triangle of dots). 

The Pythagorean views on astronomy could be considered an extension of this 
doctrine of harmonic intervals. Pythagoras held that each of the seven known planets, 
among which he included the sun and the moon, was carried around the earth on a 
crystal sphere of its own. Because it was surely impossible for such gigantic spheres to 
whirl endlessly through space without generating any noise by their motion, each body 
would have to produce a certain tone according to its distance from the center. The 
whole system created a celestial harmony, which Pythagoras alone among all mortals 
could hear. This theory was the basis for the idea of the “music of the spheres,” a 
continually recurring notion in medieval astronomical speculation. 

The Pythagorean doctrine was apparently a curious mixture of cosmic philosophy 
and number mysticism, a sort of supernumerology that assigned to everything material 
or spiritual a de nite integer. Among the writings of the Pythagoreans, we nd _ that 1 
represented reason, because reason could produce only one consistent body of truths; 2 
stood for man and 3 for woman; 4 was the Pythagorean symbol for justice, because it 
was the rst number to be the product of equals; 5 was identi ed with marriage, formed 
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as it was by the union of 2 and 3; 6 was the number of creation; and so forth. All the 
even numbers, after the rst even number, were separable into other numbers; hence 
they were prolic and were considered feminine and earthy—and somewhat less highly 
regarded in general. And because the Pythagoreans were a predominantly male society, 
they classied the odd numbers, after the rst one, as masculine and divine. 

Although these speculations about numbers as models of “things” strike us as far- 
fetched and fanciful today, it must be remembered that the intellectuals of the classical 
Greek period were largely absorbed in philosophy and that these same men, because they 
possessed intellectual interest, were the very ones who were laying the foundations for 
mathematics as a system of thought. To Pythagoras and his followers, mathematics was 
largely a means to an end, an end in which the human spirit was ennobled through a 
mystical contemplation of the good and the beautiful. Only with the foundation of the 
School of Alexandria do we enter a new phase in which mathematics is made into an 
intellectual exercise pursued for its own sake, independent of its utilitarian applications. 

Even though the Pythagoreans_ rst studied numbers less for themselves than for 
the things they represented, they were nonetheless led to recognize all sorts of new 
arithmetical properties. 


Nicomachus’s Introductio Arithmeticae 


The most complete exposition that has come down to us of the arithmetic of 
Pythagoras and his immediate successors is contained in the /ntroductio Arithmeticae of 
Nicomachus of Gerasa (circa 100). Though Nicomachus did not contribute signi cant 
new mathematical results, his Jntroductio Arithmeticae 1s noteworthy as the rst 
systematic work in which arithmetic was treated independent of geometry. The content 
is much the same as that of the number-theoretic books of Euclid’s Elements (Books 
VU, VII, and IX), but the approach is different. Whereas Euclid represented numbers 
by straight lines with letters attached—a system that allowed him to work with numbers 
in general without having to assign them specic values—Nicomachus represented 
numbers by letters with de nite values, thereby having to resort to all sorts of circum- 
locution to distinguish among undetermined numbers. Euclid always offered proofs of 
his propositions, a thing wholly lacking in Nicomachus. At times, Nicomachus simply 
enunciated a general result and gave concrete examples of it; on other occasions he left 
the general proposition to be inferred from the particular examples presented alone. 

Euclid did not share the philosophical proclivities (or more accurately, the 
Pythagorean tendencies) of Nicomachus but held himself to a more strictly scienti c 
level. Nicomachus’s treatise was probably like this because he was not a creative math- 
ematician and because he intended his work to be a popular treatment of arithmetic 
designed to acquaint the beginner with the important discoveries to that date. 

Despite a lack of originality and a mathematical poverty, Nicomachus’s /ntroductio 
became a leading textbook in the Latin West from the time it was written until the 1500s. 
The Arab world also became acquainted with Greek arithmetic through a translation 
of the Jntroductio by Thabit-ibn-Korra in the ninth century. Indeed, the in uence of 
Nicomachus’s treatise can be judged by the number of versions or commentaries that 
appeared in ancient times and also by the number of authors who quoted it. An indication 
of the book’s renown is that the second-century Greek writer Lucian, wishing to pay the 
highest compliment to a calculator, said: “You reckon like Nicomachus of Gerasa.” 
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Sometime about 450 B.c., the Greeks adopted an alphabetic notation for representing 
numbers; the rst nine letters of the Greek alphabet were associated with the rst nine 
integers, the next nine letters represented the rst nine integral multiples of 10, and the 
last nine letters were used for the rst nine integral multiples of 100. (Three older letters, 
not found in the present-day Greek alphabet, were introduced to make the required 27.) 
It is unlikely that the early Pythagoreans had any number symbols, so they must have 
thought of numbers in a strictly visual way, either as pebbles placed in the sand or as dots 
in certain geometric patterns. Thus, numbers were classi ed as triangular, square, pentag- 
onal, and so on, according to the shapes made by the arrangement of the dots. Numbers 
that can be represented in geometric form are nowadays called gurative, or polygonal, 
numbers; and such were considered by Nicomachus in the /ntroductio. Pythagoras him- 
self was acquainted at least with the triangular numbers, and very probably with square 
numbers, and the other polygonal numbers were treated by later members of his school. 

The numbers 1, 3, 6, and 10 are examples of triangular numbers, because each of 
these counts the number of dots that can be arranged evenly in an equilateral triangle. 


on A A 


i=! t,= 10 


Similarly, the numbers 1, 4, 9, and 16 are said to be square numbers, because as dots 
they can be depicted as squares. 


e [| H | 1 | 
5, =4 53=9 54= 16 


sp=l 


One can read off some remarkable number-theoretic laws from such con gurations. For 
instance, the sum of two consecutive triangular numbers always equals the square number 
whose “side” is the same as the “side” of the larger of the two triangles. This can be 
conrmed geometrically by separating the dots with a slash and then counting them, as in 
the accompanying gure. It is just as easy to prove the result by an algebraic argument. 
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However, rst notice how the triangular numbers are formed; each new one is obtained 
from the previous triangular number by adding another row containing one more dot 
than the previous row added. Thus, if ¢, designates the nth triangular number, then 


ty =tm-1 +H 
=tr2+(n—1)+n 


=f+24+3+---+(m—1)+n 
=1+24+3+---+(@-—l1)+n 


Our plan is to t together two triangles, each representing ¢,, (hence, each consists of n 


rows of dots), to produce a rectangular array whose sides are n and n + 1. In the next 
gure, for example, 1 = 5S. 


It is clear that such an array contains n(n + 1) dots and so 


2t, =n(n+ 1), 
or equivalently, 
_ aft) 
no 2 . 


With this formula available, one sees easily that the nth square number s, is the sum of 
two successive triangular numbers; for 


> nn+1) (n—I)n 
= = 4 = t+ hh. 
S n 5 + 5) + ht 


Gathering up the pieces, we get as a bonus an expression for the sum of the rst n 
numbers: 


n(n + 1) 


[$243 40048 = — > 


Likewise, a formula for the sum of the rst m odd numbers can be found. The 
appropriate starting point is the observation that a square made up of n dots on a side 
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can be divided into a smaller square of side n — | and an L-shaped border (a gnomon). 


By repeating this subdivision, as in the next diagram, it becomes evident that the differ- 
ences between successive nested squares produce the sequence of odd numbers; thus 


1434+5+---+(Qn-lh=n’. 


The Pythagoreans may have proved this result by rst considering the n equations 


Po =1 
2-1 =3 
3-2 =5 


Y—3?=7 


n? —(n—1)' =2n—-1. 
Adding these equations, we get 
1? + (2? — 17) + G? — 27) +--+ + [n? —(n — 19°] 
=14+34+5+---+(n—1), 


which reduces to 


n=14345+---+(2n-1). 


The Theory of Figurative Numbers 


The Pythagoreans could not have expected the theory of gurative numbers to attract 
the attention of later scholars of the highest rank. In 1665, the mathematician-philosopher 
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Pascal wrote his Treatise on Figurative Numbers. In it, he asserted that every positive 
integer was the sum of three or fewer triangular numbers. For instance, 


16=6+ 10, 39 =34 15421, 
25=1+3+421, 150 = 6 + 66 + 78. 


This remarkable result was conjectured by Fermat in a letter to Mersenne dated 1636 


and rst proved by Gauss in 1801. 
Another interesting pattern can be observed from the following equations: 


Pe to7 
P+2= 9=f 
P+243?= 36= 
P4234+334+4 = 100=¢. 
So far, the right-hand column gives the sequence of squares of the triangular numbers. 


This pattern leads one to suspect that the sum of the rst cubes equals the square of the 
nth triangular number. For a formal veri cation, let us begin by noting that the algebraic 


identity 
ea ae Te AR =) 3] oP [ee = ye] 
+--+ [k(k-1)+Qk-1L] = 


can be used to produce cubes. Taking successively k = 1,2,3,...,” in this formula 
produces the following set of equations: 


1=13 
34+5=23 
7494+11=33 


13+154+174+19=4 


[n(n — 1) + 1] + [n@ — 1) 4-3] +--+ + [n@ - 1)4 Qn - 1] =n". 
Adding together these last n equations, one nds_ that 
1+34+5474+9+4+---+ [n(n —1)+(2n-1)] 
= 13423433 4..-47, 


where the left-hand side consists of consecutive odd integers. The key to success lies 
in calculating the number of terms that appear on the left. For this, let us write the last 


term as 


non) Qn == wen — 12] EO], 
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so that the expression in question involves the sum of all odd integers from 1 to 
2[n(n + 1)/2] — 1, a total of n(n + 1)/2 terms. From what was proved earlier, we know 
that the sum of the rst n(n + 1)/2 odd integers equals [n(n + 1)/2[; it turns out that 


=f 
i) 


n* 


l 2 
PED 434-40 = [ED = 


This unexpected identity, relating sums of cubes to triangular numbers, goes back to the 
rst century and is usually attributed to Nicomachus himself. 

Finding a formula for the sum of the squares of the rst m numbers takes a bit more 
effort. Let us rst give a geometric argument for the case n = 4, using reasoning that can 
be generalized for any positive integer n. We begin by placing square arrays containing 
1°, 27, 3, and 4? dots adjacent to each other. 


oo id Es 


Next let us add horizontal rows consisting of 1, 3, 6, and 10 dots, respectively, to form a 
rectangle of width 1+ 2-+3-+4 and height 4+ 1. This is pictured here geometrically. 


~U 


Counting the dots in the squares and rows should make it clear that 
(Perera 1(143464 Wj] 042434044 1), 
and consequently that 


4.5.9 
6 


(174+ 2?+ 374 47) = 10-5—-20=30= 
Proceeding along similar lines, one can get a formula for 
P4243? 4---+n7, 


where n is arbitrary. Simply place square arrays for 17, 2, 37,...,n* dots side by side 
and tn rows of dots together, beginning with the shortest row on the bottom, to get a 
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rectangle. 


(n=5) 


The dimensions of the rectangle are 1 +2+3+---+n byn + 1, so that it encompasses 
a total of 


(1+2+3+4+---+a)@+1) 


dots. This gives one side of the desired identity. For the other side, we add the dots in 
consecutive squares and rows, to arrive at the sum 


(12422432 4.-.4n2) 
+ [Ll + 2) (2 3) ee 2 3s em), 
In algebraic form, our identity is 
(122422 4324...4n2) 
i a a a ae a | 
S(1494345.-4 nth, 
If we let S = 12+ 2? + 32+.---+n7, this becomes 


S+{1+04+2)+(0+24+3)+---+(+24+3+4+---+n)] 
=(14+2+34---+n)n+1). 


The foregoing expression can be simplied by appealing to the fact that the sum of the 
rst k integers is k(k + 1)/2; after making the appropriate substitutions, we get 


> 


1-2 2-3 3-4 n(n + 1) n(n + 1p 
S De ee sega el ete | 
+| 2 2 2 - 2 2 


which can be written 


! n(n +19 
S+ 510 +1) +22+)+364+)+--+a@4 =. 
This yields 
l n(n +1) 
St UP HLEP ttm ¢ Mh 2b 344m =e EY, 
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whence 


1 n(n+1)] _ n(n + 1) 
s+5[s+ Tee). ee 


It now becomes a matter of solving for S: 


358 n(n + 1) 7 n(n + 1) 


2 2 4 

_ a(nt In +1) 

= ri , 
which leads at once to 

n(n + 1)(2n + 1) 

6 : 


All in all, we have shown that the sum of the rst m squares has a simple expression in 
terms of n; namely, 


s= 


1)(2 1 
PEPE pe pate MET TD 


A strikingly original proof of the last result, propounded by the thirteenth-century math- 
ematician Fibonacci, comes from the identity 


k(k + 1)(2k + 1) = (k — 1)k(2k — 1) + 6k’. 
By putting k = 1,2,3,...,” in turn into this formula, one gets the set of equations 
Li2esa Ge? 
253-5 = 1+2+346-27 
3-4-7=2-3-546-32 


és — 1)n(QQn — 1) =H —2)(n — 1)Q2n — 3) + 6(n — 17° 
n(n + 1)(2n + 1) = (n — 1)n(2n — 1) 4+ 6n?. 


What is important is that a common term appears on the left-hand and right-hand sides 
of successive equations. When these n equations are added and common terms canceled, 
it is easily shown that 


n(n + 1)(2n + 1) = 6(1? + 27 +37 +--- +n”), 


leading to the desired conclusion. 


Zeno’s Paradox 


Not very far from Crotona was the Eleatic school, a philosophical movement chal- 
lenging the Pythagoreans’ doctrine that all natural phenomena can be expressed in some 
way by whole numbers. This rival school took its name from the Ionian colony at Elea 


102 


Chapter 3. The Beginnings of Greek Mathematics 


on the western coast of southern Italy and had as its most prominent member Zeno (circa 
450 B.c.). We know little of Zeno’s life other than Plato’s assertion that he went to Athens 
when nearly 40 years old, where he met with the youthful Socrates. Apparently Zeno 
was originally a Pythagorean and, like Pythagoras, played an active part in the politics of 
his native city. There is a widespread legend that he was tortured and killed by a tyrant 
of Elea whom he had plotted to depose. 

Zeno is remembered today for four clever paradoxes—preserved by Aristotle in his 
Physics—about the reality of motion. In these, Zeno pointed out the logical absurdities 
arising from the concept of “in nite divisibility” of time and space. The paradox most 
often quoted concerns Achilles and a tortoise: Achilles, the swiftest runner in Greece, 
can never catch a tortoise that has been given a head start. For, by the time Achilles 
reaches the tortoise’s starting point, the animal will have moved to another point; by the 
time Achilles reaches that point the tortoise will have advanced somewhat further. As 
the process continues inde nitely, Achilles—though the faster runner—always advances 
on the slower tortoise yet cannot overtake it. 

Although Zeno’s argument confounded his contemporaries, a satisfactory explana- 
tion incorporates a now-familiar idea, the notion of a “convergent in nite series.” The 
paradox rests partly on the misconception that an in nite number of ever-shorter lengths 
(and, similarly, time durations) must add up to an innite total. But an in nite series 
may have a nite sum. Suppose that Achilles runs 10 times as fast as the determined 
tortoise and gives it an initial start of 100 yards; say, Achilles runs 10 yards per second. 
Consider the distances he has to cover. They are successively 100 yards, 10 yards, 
1 yard, 1/10 yard, and so on. The total number of yards Achilles must travel in order 
to catch his slower competitor is 


ae 
100-104 1 4 th ax. 
ere ig 0G 


which forms a convergent geometric series with sum 1112 yards. In the same elapsed 
time (that is, 113 seconds) the distance covered by the tortoise will be the sum of the 
geometric series 10 + 1+ 4 + an +.---, which is found to be 113 yards. Accordingly, 
when Achilles has traversed 1113 yards he will be dead even with the tortoise, and 
ahead of him thereafter. 

Of course Zeno knew perfectly well that Achilles would win a race with a tortoise, 
but he was drawing attention to opposing theories on the nature of space and time. 
(There is a frequently told anecdote that Diogenes the Cynic refuted Zeno’s argument, 
while the latter was lecturing in Athens, simply by getting up and walking; but the 
story cannot be true because Zeno and Diogenes were not contemporary.) The Eleatic 
mathematical philosophers held that space and time are undivided wholes, or continua, 
that cannot be broken down into small indivisible parts. This was at variance with the 
Pythagorean idea that a line is made up of a series of points—like tiny beads or “numerical 
atoms”—and that time is likewise composed of a series of discrete moments. Zeno was 
partly responsible for the subsequent course of Greek mathematical thought. For at heart, 
his famous paradoxes were related to the application of in nite processes to geometry. 
Because of the inability of the Greek geometers to answer them in a clear manner, they 
banished from mathematics the use of methods that involved the concept of in nity and 
made a “horror of the in nite” part of the Greek mathematical tradition. 
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3.2 Problems 


. Plutarch (about A.D. 100) stated that if a triangular 
number is multiplied by 8, and 1 is added, then the 
result is a square number. Prove that this is fact and 
illustrate it geometrically in the case of fo. 


. Prove that the square of any odd multiple of 3 is the 
difference of two triangular numbers, speci cally 
that 


[3(2n + I) = tong — Bn4t- 


. Prove that if z, is a triangular number, then 94, + 1 is 


also triangular. 


. Write each of the following numbers as the sum of 
three or fewer triangular numbers: 


(a) 56, (b) 69, (c) 185, (d) 287. 


. Forn > 1, establish the formula 


(2n +1) = (4t, + 1° — (4t,). 


. Verify that 1225 and 41,616 are simultaneously 
square and triangular numbers. [Hint: Finding an 
integer n such that 


1 
i= mu = 1225 


is equivalent to solving the quadratic equation 
n> +n — 2450 = 0.] 


. An oblong number counts the number of dots in a 
rectangular array having one more row than it has 
columns; the rst few of these numbers are 


0, =2 0, =6 03 = 12 04 = 20 


and in general, the nth oblong number is given by 
On = n(n + 1). Prove algebraically and geometrically 
that 


(a) 0, =2+4+6+4+---+2n. 

(b) Any oblong number is the sum of two equal 
triangular numbers. 

(c) 0, +n? = toy. 

(d) 0, -—n? =n. 

(e) n? +20, +(n+ 1% =(2n +19. 

(f) 0, =14+24+34+---+n4+(n4+1) 
+ (n—1)+(n—2)4+---4+342. 


8. 


10. 


11. 
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In 1872, Lebesgue proved that (1) every positive 
integer is the sum of a square number (possibly 07) 
and two triangular numbers and (2) every positive 
integer is the sum of two square numbers and a 
triangular number. Conrm these results in the cases 
of the integers 9, 44, 81, and 100. 


Display the consecutive integers 1 through in two 
rows as follows: 


If the sum obtained by adding the ” columns 
vertically is set equal to the sum obtained by adding 
the two rows horizontally, what well-known formula 
results? 


Derive the identity 


[n(a — 1) 4+ 1] + [n@— 1)4+ 3]4+--- 
+ [n(n — 1) 4+ Qn —- 1) = 7°, 


where 7 is any positive integer. 


For any integer n > 1, prove that: 


(a) 14+24+3+---+(n—-l1)4+n 

+(n—1)+---4+34241=n’. 
1 

ba 3 4 

_ on 

~ (a+) 

[Hint: Use the splitting identity 


(b) ee ely 


1/k —1/(k +1) = 1/k(k +1) 


to rewrite the left-hand side. ] 
(c) 1-24+2-343-4+---+n(n+1) 
_ n(n + 1) +2) 
——$- 
[Hint: Use the identity k(k + 1) =k? +k and 
collect the squares. ] 


1 1 1 
Ts ast ete 


1 n 


Qn—lQn+1) 2n+1 
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12. 


13. 


14. 


| Hin Use the identity 


1 
(2k — 12k + 1) 


1 1 1 
~2\2%k-1  2k+1)° 


433453 4.-.4(2n — 13 = n2(2n? — 1). 
[Hint: Separate the left-hand side of the 
identity 


(e) 


13423433 +..-+4(2ny 


_ [Anant yy 
| 


into odd and even terms and solve for the sum 
of odd cubes.] 


Prove that the sum of a_ nite arithmetic series 
equals the product of the number of terms and 
half the sum of the two extreme terms; in 
symbols, this reads 


(a) 


a4 dy $20) 4a 3d) ees 


we 


+ (a+nd)=n| 5 


(b) Use the result of part (a) to conrm the 
identities 
n(3n — 1) 
14+4+7+---+@Gn—2)= > 


and 


14345+4---+(Qn-1)=n’. 


The identity 


(1+24+34+---4+ny 

= 134234334---4+73, n>1 
was known as early as the rst century. Provide a 
derivation of it. 


Prove the following formula for the sum of triangular 
numbers, given by the Hindu mathematician 
Aryabhata (circa 500): 


n(n + 1)(n +2) 


ht+totetee += 6 
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15. 


16. 
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[Hint: Group the terms on the left-hand side in pairs, 
replacing %_; +t, by k?; consider the two cases 
where n is odd and n is even.] 


Archimedes (287-212 B.c.) also derived the formula 


n(n + 1)(2n + 1) 


427437 4+..-47 = - 


for the sum of squares. Fill in any missing details in 
the following sketch of his proof. In the formula 
nw? =[k+(n—/FP 
=h+4+2kn-bH+(n—-ky, 
let k take on the successive values 1, 2, 3,...,” — 1. 


Add the resulting n — 1 equations, together with the 
identity 2n? = 2n’, to arrive at 


(4) (24-1)? = 2112 +2? +32 +--+ +7) 
+ 2[l(n — 1) + 2(n — 2) 
+ 3(n —3)+---+(n— 11]. 
Next, let k = 1,2,3,...,” in the formula 


RP =k+21+2+3+---+K&-D] 
and add the n equations so obtained to get 


(2%) 174274374..-+7n? 


=(1+24+3+---+n) 
+ 2[1(n — 1) + 2(n — 2) 
+ 3(n —3)4+---+(n— Dl]. 


The desired result follows on combining (*) 
and (**). 


The tetrahedral numbers count the number of dots in 
pyramids built up of triangular numbers. If the base 
is the triangle of side n, then the pyramid is formed 
by placing similarly situated triangles upon it, each 
of which has one less in its sides than that which 
precedes it. 
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17. 


33 


In general, the nth tetrahedral number 7, is given by 
the formula 


Th=tt+h+th+---+h. 
where ¢, is the Ath triangular number. Prove that 


_ntatD(n+2)_ ntl 


= —— (5 EH. 
G ra +n) 


[Hint: See Problem 14.] 


Use the following facts to derive the formula 
for the sum of the squares of the rst n 
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Geometric Proofs of the 
Pythagorean Theorem 
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integers: 
= 17), 
P4+2=5=342-1=H+2N7, 
124+274+37=14=642-4=4,42h, 


P42?+3?+4 =30=1042-10 


= t4+ 273, 
1274274+3°4+4+457=55=1542-20 
=ts+2T, 


where ¢, and 7; are the Ath triangular and tetrahedral 
numbers, respectively. 


Although tradition is unanimous in ascribing the 
so-called Pythagorean theorem to the great teacher 
himself, we have seen that the Babylonians knew 
the result for certain specic triangles at least a 
millennium earlier. We recall the theorem as “the 
area of the square built upon the hypotenuse of a 


right triangle is equal to the sum of the areas of the squares upon the remaining sides.” 
Because none of the various Greek writers who attributed the theorem to Pythagoras 
lived within ve centuries of him, there is little convincing evidence to corroborate the 
general belief that the master, or even one of his immediate disciples, gave the rst 
rigorous proof of this characteristic property of right triangles. Moreover, the persistent 
legend that when Pythagoras had discovered the theorem, he sacri ced a hundred oxen 
to the Muses in gratitude for the inspiration appears an unlikely story, because the 
Pythagorean ritual forbade any sacri ce in which blood was shed. What is certain is that 
the school Pythagoras founded did much to increase the interest in problems directly 
connected with the celebrated result that bears his name. 

Still more are we in doubt about what line of demonstration the Greeks originally 
offered for the Pythagorean theorem. If the methods of Book II of Euclid’s Elements 
were used, it was probably a dissection type of proof similar to the following. A large 
square of side a + b is divided into two smaller squares of sides a and b respectively, and 
two equal rectangles with sides a and b; each of these two rectangles can be split into 
two equal right triangles by drawing the diagonal c. The four triangles can be arranged 
within another square of side a + b as shown in the second gure. 


106 


Chapter 3 The Beginnings of Greek Mathematics 


Now the area of the same square can be represented in two ways: as the sum of the areas 
of two squares and two rectangles, 


(a+b) =a? +b? + 2ab; 
and as the sum of the areas of a square and four triangles, 


+p =era(S). 


When the four triangles are deducted from the larger square in each gure, the resulting 
areas are equal; or equivalently, c? = a* + b*. Therefore, the square on c is equal to the 
sum of the squares on a and b. 

Such proofs by addition of areas are so simple that they may have been made earlier 
and independently by other cultures (no record of the Pythagorean theorem appears, how- 
ever, in any of the surviving documents from ancient Egypt). In fact, the contemporary 
Chinese civilization, which had grown up in effective isolation from both the Greek and 
Babylonian civilizations, had a neater and possibly much earlier proof than the one just 
cited. This is found in the oldest extant Chinese text containing formal mathematical the- 
ories, the Arithmetic Classic of the Gnomon and the Circular Paths of Heaven. Assigning 
the date of this work is dif cult. Astronomical evidence suggests that the oldest parts go 
back to 600 B.C., but there is reason to believe that it has undergone considerable change 
since rst written. The rst rm dates that we can connect with it are over a century later 
than the dates for Nine Chapters on the Mathematical Art. A diagram in the Arithmetic 
Classic represents the oldest known proof of the Pythagorean theorem. 


The proof inspired by this gure was much admired for its simple elegance, and it 
later found its way into the Vijaganita (Root Calculations) of the Hindu mathematician 
Bhaskara, born in 1114. Bhaskara draws the right triangle four times in the square of the 
hypotenuse, so that in the middle there remains a square whose side equals the difference 
between the two sides of the right triangle. This last square and the four triangles are then 
rearranged to make up the areas of two squares, the lengths of whose sides correspond 
to the legs of the right triangle. “Behold,” said Bhaskara, without adding a further word: 
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Early Solutions of the Pythagorean Equation 


The geometrical discovery that the sides of a right triangle were connected by a law 
expressible in numbers led naturally to a corresponding arithmetical problem, which we 
shall call the Pythagorean problem. This problem, one of the earliest problems in the 
theory of numbers, calls for nding all right triangles whose sides are of integral length, 
that is, nding all solutions in the positive integers of the Pythagorean equation 


a y? = 2’, 
A triple (x, y, z) of positive integers satisfying this equation is said to be a Pythagorean 
triple. 
Ancient tradition attributes to Pythagoras himself a partial solution of the problem, 
expressed by the numbers 


x=2n+1, y =2n? +2n, z=2n?+2n4+1, 


where n > | is an arbitrary integer. As is perhaps more often the rule than the exception 
in such instances, the attribution of the name may readily be questioned. 

Pythagoras presumably arrived at his solution by a relation that produces a square 
number from the next smaller square number, namely 


(1) (2k—-1)+(k-1P =P. 
The strategy was to suppose that 2k — 1 is a perfect square. (This happens in nitely 


often; for instance, if k = 5, then 2k —1 = 37.) Letting 2k -—1= m? and solving for k, 
we get 


When these values are substituted in (1), it follows that 


2 m2? —1 oe m +1 
m 5 — 5 : 


whence 


(2) x=m, y= 
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satisfy the Pythagorean equation for any odd integer m > | (m must be odd, because 
m* = 2k — 1 is odd). When m = 2n + 1, where n > 1, the numbers in (2) become 


(3) x=2n+1, y = 2n? +2n, z=2n*+2n+1, 


which is Pythagoras’s result. Some of the Pythagorean triples that can be obtained from 
(3) are given in the accompanying table. 


n xX y Zz 
1 3 4 5 
2 5 12 13 
4 q 24 25 
4 9 40 Al 
5 11 60 61 


As one sees, Pythagoras’s solution has the special feature of producing right triangles 
having the characteristic that the hypotenuse exceeds the larger leg by 1. 

Another special solution in which the hypotenuse and a leg differ by 2 is ascribed 
to the Greek philosopher Plato, to wit, 


(4) x=2n,  y=n-1, z=n'+1. 
This formula can be obtained, like the other, with the help of the relation (1); but now, 
we apply it twice: 
(K+ 1? =k? + (2k +1) 
= [(k-— 1° + 2k -—1)] + 2k +1 = (k—- 1)? + 4k. 
Substituting n? for k to make 4k a square, one arrives at the Platonic formula 
(2ny + (nr? —1P = (nr? + 1). 


Note that from equations (4) it is possible to produce the Pythagorean triple (8, 15, 17), 
which cannot be gotten from Pythagoras’s formula (3). 

Neither of the aforementioned rules accounts for all Pythagorean triples, and it was 
not until Euclid wrote his Elements that a complete solution to the Pythagorean problem 
appeared. In Book X of the Elements, there is geometric wording to the effect that 


(5) x = 2mn, y=m—n’, z=m' +n’, 


where m and n are positive integers, with m > n. 

In his Arithmetica, Diophantus (third century) also stated that he could get right 
triangles “with the aid of” two numbers m and n according to the formulas in equation 
(5). Diophantus seems to have arrived at these formulas by the following reasoning. 
Given the equation x* + y? = z*, put y = kx — z, where k is any rational number. Then 


2 —x*=y* = (kx —zy 


= k?x? — 2kxz + 22, 
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which leads to 
—x? = kx? —2kxz, 
or 
—x = k’x — 2kz. 


When this equation is solved for x, we get 


2k 
= Z, 
+1 
The implication is that 
P-1 
y sae = 2 


But k = m/n, with m and n integers (there is no harm in taking m > n), so that 


2mn m? — n? 
eS — y= =a 
m2 + n2 m2 + n2 


If one sets z = m* +n? to obtain a solution in the integers, it is found immediately that 
g y 


x = 2mn, y=m —n’, z=m? +n’. 
Our argument indicates that x, y, and z, as dened by the preceding formulas, satisfy 
the Pythagorean equation. The converse problem of showing that any Pythagorean triple 
is necessarily of this form is much more dif cult. The details rst appeared in the works 
of Arab mathematicians around the tenth century. 


The Crisis of Incommensurable Quantities 


The most important achievement of the Pythagorean school in its in uence on the 
evolution of the number concept was the discovery of the “irrational.” The Pythagoreans 
felt intuitively that any two line segments had a common measure; that is to say, starting 
with two line segments, one should be able to nd some third segment, perhaps very 
small, that could be marked off a whole number of times on each of the given segments. 
From this it would follow that the ratio of the lengths of the original line segments 
could be expressed as the ratio of integers or as a rational number. (Recall that a rational 
number is de ned as the quotient of two integers a/b, where b ¥ 0.) One can imagine the 
shattering effect of the discovery that there exist some ratios that cannot be represented 
in terms of integers. Who it was that rst established this, or whether it was done by 
arithmetical or geometric methods, will probably remain a mystery forever. 

The oldest known proof dealing with incommensurable line segments corresponds 
in its essentials to the modern proof that 2 is irrational. This is the proof of the 
incommensurability of the diagonal of a square with its side, and it is to be found in 
the tenth book of Euclid’s Elements. A reference in one of Aristotle’s works, however, 
makes it clear that the proof was known long before Euclid’s time. As in most classical 
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demonstrations, the method of argument was indirect. Thus, the negation of the desired 
conclusion is assumed, and a contradiction is derived from the assumption. 

The reasoning goes as follows. If the diagonal AC and side AB of the square ABCD 
have a common measure, say 4, then there exist positive integers m and n satisfying 


AC =m6, AB =n6. 
The ratio of these segments is 


AC _m 

AB ne 
To make matters simpler, let us suppose that any common factors of m and n have been 
cancelled. Now 


(ACY _ m? 
(ABP 2” 


Applying the Pythagorean theorem to the triangle ABC, one gets (AC)? = 2(AB)’, so 
that the displayed equation becomes 


m 
2 = nN’ 
or 2n* = m?. The task is to show that this cannot happen. 

Now 2n’, as a multiple of 2, is an even integer; hence m? is even. What about m 
itself? If m were odd, then m* would be odd, because the square of any odd integer must 
be odd. Consequently, m is even, say, m = 2k. Substituting this value in the equation 
m? = 2n? and simplifying, we get 


2 


2k =n’. 


By an argument similar to the one above, it can be concluded that is an even number. 
The net result is that m and n are both even (that is, each has a factor of 2), which 
contradicts our initial assumption that they have no common factor whatsoever. 

The Pythagoreans were not the rst to consider the numerical value of /2. An 
old cuneiform tablet, now in the Yale Babylonian Collection, contains the diagram of a 
square with its diagonals, as shown herewith. 


LAWN 
NZ 
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In sexagesimal notation, the number 1;24,51,10 is equal to 


ee 51 * 10 
60 602 603” 


which gives 1.414213 when translated into the decimal system. You should nd this 
familiar, for it is a very close approximation to /2 = 1.414213562... The meaning of 
the other numbers in the diagram becomes clear when we multiply 1;24,51,10 by 30. The 
result is 42;25,35, the length of the diagonal of a square of side 30. Thus, the Babylonians 
not only seemed to know that the diagonal of a square is /2 times the length of its side, 
but also had the arithmetic techniques to accurately approximate /2. 


Theon’s Side and Diagonal Numbers 


Theon of Smyrna (circa 130) devised a procedure for reaching closer and closer 
approximations of 2 by rational numbers. The computations involve two sequences of 
numbers, the “side numbers” and the “diagonal numbers.” We begin with two numbers, 
one called the rst side and denoted by x;, and the other the rst diagonal and indicated 
by y. The second side and diagonal (x2 and y2) are formed from the rst, the third side 
and diagonal (x3 and y3) from the second, and so on, according to the scheme 


x2 =x, +), yo = 2x, +), 
x3 =X2+ yo, y3 = 2x2 + yr, 


In general, x, and y, are obtained from the previous pair of side and diagonal numbers 
by the formulas 


Xn = Xn—-1 + Yn-1, Yn = 2Xn—1 + Yn-1- 
If we take x; = y,; = | as the initial values, then 
xy=14+1=2, yg =2-14153, 


x3=24+3=5, y3=2-243=7, 
x4=54+7= 12, ya=2-5+7=17. 


The names side numbers and diagonal numbers hint that the quotients y,/x, of the 
associated pairs of these numbers come to approximate the ratio of the diagonal of a 
square to its side: 


B22 Bo Boe 
XxX] X2 2’ Xx3 5 X4 12°°°°" 


This follows from the relation 


(1) y? = 2x? teil: 
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for the relation, if true, implies that 


y 2 1 2 
(*) =e (=) 
Xn Xn 


Because the value of (1/x,)* can be made as small as desired by taking n large enough, 


it appears that the ratio 
1 
me 24 
Xn Br 


tends to stay near some xed number for large n. It can be shown that the xed “limit” 
is V2. You can see how this works by considering the case n = 4. Here, 


y\? _f1TY 
ec 
289 
~ 144 


ee ee ee a . 
~ 144 5 144 — 12) ’ 


whence 


The ratio y4/x4 differs from the true value of /2. by less than 4 of | percent. 
Now condition (1), which can be written y? — 2x? = +1, can be justi ed by using 
the algebraic identity 


(2) (2x + y)? — 2x + yy = 2x? -y’. 


If x = xo, y = Yo are any two numbers satisfying the equation y — 2x? = +1, then we 
assert that x = x9 + vo, y = 2X0 + yo is also a solution. For by virtue of (2), 


y? — 2x? = (2x9 + yo!’ — (x0 + yo)” 


Thus, when one solution of y* — 2x” = +1 is known, it is possible to nd in nitely 
many more solutions by using identity (2). 

In the present situation, by the manner in which side and diagonal numbers are 
formed, this means that if y? — 2x? = +1 happens to hold for a certain value of n, then 
it must also hold for n+ 1, but with opposite sign. Setting x; = vy; = 1, we see that 
y2 — 2x? = +1 holds when n = 1, and hence this equation is valid for every value of n 
thereafter. In consequence, (1) is a correct identity for all n > 1. 

It is natural to raise the question whether the notion of side numbers and diagonal 
numbers can be used to obtain rational approximations to an arbitrary square root. Theon’s 
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original rule of formation was 
Xn =Xn-1 + Yn-1, sn = 2Xn-1 + Yn-1), n= 2. 


For 2 in the second equation, let us substitute a positive integer a (which is not a perfect 
square) to develop the following scheme: 


x2=xX1+ yi, y2 = ax; + V1, 
x3 =xX2+ )2, v3 = ax. + yo, 


X4 = X3+ 93, V4 = AaX3 + V3, 


Xn = Xn-1 + Yn-1) Yn = AXn-1 + Vn-1, 


Notice that 
Vn = (AXn—1 + Yn-1) = Xp + 2AXn—1Yn—1 + Vga 
acd = alt 1+ YP = ard, + 2a de hE 
and so, on subtraction, 
yp — ax? = (a? —a)x?_, +(1-a)y?_, 
= (1 —4)Q2_, — ax?_,). 


The import of this relation is that we have represented y? — ax? by an expression of 
the same form, but with n replaced by n — 1. Repeating this transformation for the next 
expression, we evidently arrive at the chain of equalities 


ye — ax? =(1- a)(y>_, - ax? 4) 
= (1 —a)(y?_, — ax?_,) 


= (1 — a)}(y?_, — ax?_) 


= (1 — a)" — ax7), 


and as a result, 


2 — ayn-1¢4,2 _ 2) 
@ agp eae 


2 
Xx xX; 


From this, it can be concluded that as n increases, the right-hand term tends to zero, 
whence the values y,/x, more and more closely approach the irrational number ./a. 

For an illustration, consider the case of J/3; that is, a = 3. If we take x; = ly =2 
as the initial side and diagonal numbers, then the foregoing formula reduces to 


2 n—1 
: 9 
(2) are nee 
xn 


Xn 
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The successive rational approximations of V3 are 


Mot 9 Bat: WY Ba? 
xy lo» 3? x ey ate IS 


A variation of the above theme is afforded by starting with the algebraic identity 
(3) (7 + 3x7) — 3Qxy)P = G7 — 32°Y. 
If one solution, say x = xo, y = Yo, of the equation 
y —3x7=1 


is known, then equation (3) indicates that a second solution can be found simply by 
letting x = 2xoyo, y = 4 + 3x. Indeed, on substitution, 


y? — 3x? = (90 + 3x0) — 3(2x0y0)" 


= (7 —324P = 1? = 1. 


Thus we have a process for generating solutions of y — 3x” = 1 from a single solution. 
By the rule of formation, 


2 2 
Xn = 2p AVR AAS Vn = Yp-1 + 3Xq-1 


a fresh solution x,, y, can be derived from a previous one x,—1, Vy—1. Because Xn, Vn 
satisfy 


y2 — 3x27 = 1, 


or what amounts to the same thing, 


the successive values (y,/x,) will approach 3 increasingly closely; that is, the sequence 
Yn/Xn provides a “very good” (in some sense) approximation of /3 by rational numbers. 

It is clear that the equation y* — 3x? = 1 has at least one solution in the positive 
integers, namely, x; = 1, y) = 2. We see then that 


x2 = 2x1y,) =2-1-2=4, 
yo =e 3x7 = 243-7 =7 
is also a solution. Thus new solutions are generated out of given ones. The next one is 


X3 = 2x22 = 2-4-7= 56, 
y= yet 3xZa=P4+3-4 = 97, 
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and so on. We have almost nished, for the sequence of rational approximations of the 
irrational number V3 is just 
yi _ 2 ye ys 97 ys 18,817 


x, 1 x 4 x3 56’ x, 10,864°°° 


Let us now view a strictly geometric proof of the incommensurability of the diagonal 
and side of a square. This argument, apparently older than the rst, is in the spirit of the 
arguments found in Euclid’s Elements. The basic idea is to show that we can build onto 
an arbitrary square a sequence of smaller and smaller squares. 

In the square ABCD, draw the arc BE to lay off the side 4B = s; on the diagonal 
AC = d,. Now draw the line EF perpendicular to d,, with F the point at which it 
intersects BC. By one of the congruence theorems, it is easy to prove that the triangles 
BAF and FAE are congruent; consequently, FB = FE, because they are congruent sides. 
Furthermore, CEF is an isosceles right triangle, whence its legs CE and FE are equal. 


A 


D 


A Sy B 


Next, construct a second square CEFG having sides s) = CE = d, — s, and diagonal 
dy = CB — FB = s; — sz. Laying off the sides s) = FE on the diagonal FC = dz, we 
determine CH, which is then used as s3, the side of the third square. In this third square, 
it is seen that s3 = dy — s2 and this diagonal d; = CE — EI = sy — 3. The process can be 
repeated over and over, obtaining successively smaller squares whose sides and diagonals 
satisfy the relations 


Sn = Gn—-1 — Sn-1, dy = Sn-1 — Sn- 


The geometric preliminaries completed, we assume that the diagonal and side of the 
original square are commensurable and show that this leads to an impossible situation. 
If these two lengths are commensurable, then they have a common measure 4, so that 
there exist integers M, and N, for which 


8, = M6, d, = N46. 
But then 
sy =d, —s; =(N; — M))b = M26, 
dy = 8, — 8, = (M, — M2)5 = N26, 
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where Mz < M, and N> < N,. Repetition of the argument yields 
1<---<M,<M)<M, 1<---<N3<N,< WN. 


We now come to the contradiction. Because there are only nitely many positive integers 
less than MM, and N,, these two sequences must terminate after a nite number of steps. 
This contradicts the idea that our construction of squares can be carried out inde nitely. 


Eudoxus of Cnidos 


The discovery of irrational numbers caused great consternation among the 
Pythagoreans, for it challenged the adequacy of their philosophy that number was the 
essence of all things. This logical scandal encouraged them to maintain the pledge 
of strict secrecy. Indeed, their resolve is testied to by the very name given to these 
new entities, “the unutterable.” (The Greeks used the term Jogos, meaning “word” or 
“speech,” for the ratio of two integers. Hence, when incommensurable lengths were 
described as alogos, the term carried a double meaning: “not a ratio” and “not to be 
spoken.”) The knowledge that irrationals existed was a dangerous secret to possess. 
Popular legend has it that the rst Pythagorean to utter the unutterable to an outsider 
was murdered—thrown off a ship to drown. 

It fell to Eudoxus of Cnidos (408-355 B.C.) to resolve the crisis in the foundations 
of mathematics. His great contribution was a revised theory of proportion applicable to 
incommensurable as well as commensurable quantities. Everything was based on an elab- 
orate de nition of the ratio of magnitudes, but magnitudes themselves were left unde ned. 
Hence, the problem of de ning irrational numbers as numbers was avoided entirely. The 
immediate effect of Eudoxus’s approach was to drive mathematics into the hands of the 
geometers. In the absence of a purely arithmetic theory of irrationals, the primacy of 
the number concept was renounced. Geometry was held to be a more general doctrine 
than the science of numbers, and for the next 2000 years, it served as the basis of almost 
all rigorous mathematical reasoning. 

The existence of incommensurable geometric quantities necessitated a thorough re- 
casting of the foundations of mathematics, with an increased attention to logical rigor. It 
was a formidable task and engaged the best efforts of the most notable mathematicians 
of the fourth century B.C.: Theodorus, Theaetetus, Archytas, and Eudoxus. Theodorus 
of Cyrene (born 470 B.c.), the mathematics tutor of the great philosopher Plato, is said 
to have demonstrated geometrically that the sides of squares represented by V3, /5, 
V6, JA7, ae, /10, 4/11, /12, J13, J14, 15, and /17 are incommensurable with a 
unit length. That is, he proved the irrationality of the square roots of nonsquare integers 
from 3 to 17, “at which point,” Plato said, “for some reason he stopped.” Theaetetus of 
Athens (415-369 B.C.), who was a pupil of Theodorus and a member of Plato’s school in 
Athens, extended the result, demonstrating that the square root of any nonsquare integer 
is irrational. Plato himself added to the theory by showing that a rational number could 
be the sum of two irrationals. One of the few Pythagoreans to stay behind in southern 
Italy after the death of Pythagoras, Archytas of Tarentum (428-347 B.C.) is reputed to 
have been the rst to study geometry on a circular cylinder, discovering in the process 
some of the properties of its oblique section, the ellipse. He also devised an ingenious 
solution of the problem “to double a cube” by means of cylindrical sections. 
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3.3 Problems 


1. (a) 


(b) 


Perhaps the most brilliant Greek mathematician before Archimedes was Eudoxus. 
Born about 408 B.c. in Cnidos on the Black Sea, he set out at the age of 23 to learn 
geometry from Archytas in Tarentum and for several months, philosophy from Plato 
in Athens. Eudoxus, too poor to live in Athens, lodged cheaply at the harbor town of 
Piraeus, where he had rst debarked; every day he walked the two miles to Plato’s 
Academy. Later he traveled to Egypt, where he remained for 16 months. Thereafter he 
earned his living as a teacher, founding a school at Cyzicus in northwestern Asia Minor 
that attracted many pupils. When he was about 40 years old, Eudoxus made a second 
visit to Athens accompanied by a considerable following of his own students; there he 
opened another school, which for a time rivaled Plato’s. The reputation of Eudoxus rests 
on three grounds: his general theory of proportion, the addition of numerous results on 
the study of the golden section (the division of a line segment in extreme and mean 
ratio), and the invention of a process known as the method of exhaustion. The proce- 
dure Eudoxus proposed was later re ned by Archimedes into a powerful tool for deter- 
mining curvilinear areas, surfaces, and volumes—an important precursor to the integral 
calculus. 

During this period, Greek mathematics began to be organized deductively on the 
basis of explicit axioms. Its nal axiomatic form was set forth in the 13 books of the 
Elements that Euclid wrote about 300 B.c. In compiling the Elements, Euclid built on 
the experience and achievements of his predecessors in the three centuries just past. 
Theaetetus’s elaborate classi cation of higher types of irrationals is the subject matter of 
Book X of the Elements, although Euclid must be credited with having arranged it into 
a logical whole. The Eudoxian theory of proportion—which is really a theory of real 
numbers—is incorporated into Book V; and Book II is mostly a geometric rendition of 
Pythagorean arithmetic, wherein Euclid represented numbers by line segments instead of 
the pictorial dot method the early Pythagoreans favored. 


( st 1), 500 + ») 


Establish the formula 


for n even, consider the triple 


fae a—b a a+b\? 
a 5 = 5 : 


(n, (n?/4) — 1, (n°/4) + 1).] 


Show that a = 2n, b = 2, gives rise to Plato’s 4. Verify that (3, 4, 5) is the only Pythagorean triple 
formula for Pythagorean triples, whereas involving consecutive positive integers. [Hint: 
a=(2n+ 1)*, b=1, yields Pythagoras’s own Consider the Pythagorean triple (x, x + 1, x + 2) and 
formula. show that x = 3.] 

2. Find all right triangles with sides of integral length 5. (a) Establish that there are in nitely many 


whose areas are equal to their perimeters. [Hint: The 
equations x? + y? =z? andx+y+z= sxy imply 
that (x — 4)(yv — 4) = 8.] 


3. For n > 3 a given integer, nd a Pythagorean triple 
having n as one of its members. [Hint: For n an odd 
integer, consider the triple 


Pythagorean triples (x, y, z) in which x and y 
are consecutive integers. [Hint: If (x, x + 1, z) 


happens to be a Pythagorean triple, so is 
(3x +2z+ 1, 3x +224 2, 4x + 3z + 2).] 


(b) Find ve Pythagorean triples of the form 
(x,x + 1, z). 
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6. 


10. 


Consider the sequence of quotients y,/x, of Theon’s 
diagonal numbers to side numbers. 


(a) Verify that the rst, third, and fth terms in this 
sequence are getting successively larger, 
whereas the second, fourth, and sixth terms are 
decreasing. 

(b) Compute the difference between 2 and the 


square of each term, through the rst six terms; 
(n/XnyY° should be getting nearer 2 at each 
stage, alternating above and below, hence y,/x, 
approximates /2. 


Let two sequences of numbers be formed in 
accordance with the following rule: 


x, = 2, vy =3, 
Xn = Shut + 2yn-1, 


Ya = 4xn—1 + 3Vn-1 for n > 2. 


(a) Write out the rst ve numbers in each of the 
above sequences. 
(b) Show that 
Vn 2x; = Vat — a 
whence 
y? — 2x? = yt — 2x? = 1. 
(c) From part (b), conclude that successive values 


of y,/xX, are nearer and nearer approximations 


of /2. 


Consider the sequence of numbers de ned by the 
following rule: 


x, =2, 
1 2 
Xn = =| Xn-1+— ) forn > 1. 
2 Xn-1 
(a) Write out the rst four terms of this sequence in 
decimal form. 
(b) Assuming that the terms x, approach a number 


L as n increases, show that L = J/2. [Hint: 
The number L satises L = 3(L + 2/L).] 


Prove that /3 and /2 are irrational by assuming that 
each is rational and arguing until a contradiction is 
reached. 


Replace 2 by 3 in Theon’s de nition of side numbers 
and diagonal numbers, so that the rule of formation 
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becomes 
Xn =Xn-1 + Yn-1, 
DS io 3Xn-1 = ¥n-1 n= 2. 

(a) Starting with x; = 1, y; = 2, write out the rst 
six numbers in each of the resulting sequences. 

(b) Conrm for several values of n that when y, /x; 
is in lowest terms, 

y? — 3x2 = 1 or —2. 

(c) Assuming that the relation in part (b) holds for 
all n, show that this implies that the successive 
ratios y,,/x, are approaching V3. 

(d) Write out the rst six values of y,/x, to get an 
approximation /3. 

11. Archimedes (287-212 B.c.) in his book Measurement 


12. 


13. 


of a Circle presented, without a word of justi cation, 
the inequality 


1351 265 
— 3>—. 
man aa 
(a) As an explanation of the probable steps leading 
to the left-hand bound, show rst that 
: 1 / 1 
i) 26-5 =,/20—-1+(5) 
> 262-1 
and then 
. 1351 il 1 
ll) = 26 
780 15 52 
1 
> av 202 —-1= V3 
(b) Obtain the right-hand bound in a similar 


manner by replacing = with x. 


Because V3 is approximately 3, one can put 
V3 = e + 1/x), where x is unknown. 


(a) Square both sides of this expression, neglect 
1/x?, and solve the resulting linear equation for 
x to get a second approximation of /3. 

(b) Repeat this procedure once more to nd a third 
approximation. 

(a) Given a positive integer n that is not a perfect 


square, let a? be the nearest square to n (above 
or below n, as the case may be), so that 
n=a’ +b. Prove that 
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14. 


15. 


16. 


17. 


(b) Use part (a) to approximate 50, 63, and 
V75 by rational numbers. 


Use the inequality of Problem 13 to get Archimedes’ 
bounds on J3. [Hint: Take a = 26 and b = 1.] 


A standard proof of the Pythagorean theorem starts 
with a right triangle ABC, with its right angles at C, 
and then draws a perpendicular CD from C to the 
hypotenuse AB. 


a 18. 


(a) Prove that triangles ACD and CBD are both 
similar to triangle ABC. 

For a triangle ABC with legs of lengths a and b 
and with hypotenuse of length c, use the 
proportionality of corresponding sides of similar 


triangles to establish that a? + b? = c’?. 


(b) 


For another proof of the Pythagorean theorem, 
consider a right triangle ABC (with right angle at C) 
whose legs have length a and b and whose 
hypotenuse has length c. On the extension of side 
BC pick a point D such that BAD is a right angle. 


(a) From the similarity of triangles ABC and DBA, 
show that 4D = ac/b and DC = a’?/b. 

Prove that a? + b* = c? by relating the area of 
triangle ABD to the areas of triangles ABC and 
ACD. 


(b) 


Several years before James Gar el d became president 
of the United States, he devised an original proof of 
the Pythagorean theorem. It appeared in 1876 in the 
New England Journal of Education. Starting with a 
right triangle ABC, Gar eld placed a congruent 
triangle EAD as indicated in the gure. He then drew 
EB so as to form a quadrilateral EBCD. Prove that 


20. 
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a’ +b? = c’ by relating the area of the quadrilateral 
to the area of the three triangles ABC, EAD, and EBA. 


D a E 
| 

A 

a 

Cc B 


The Pythagoreans de ned the harmonic mean of a 
and b, where a < b, to be the number / such that 


h-a _a 

b-h bo 
For instance, the harmonic mean of 6 and 12 is 8, 
because (8 — 6)/(12 — 8) = 6/12. Prove that h is the 


harmonic mean of a and b if and only if / satis es 
either of the relations: 


(a) 1 1 1 1 
a) ---=---. 
a h hb 
(b) h= 2ab 
ath 


. Pappus (circa 320) in his Mathematical Collection 


provided a construction for the harmonic mean of the 
segments OA and OB as follows. On the 
perpendicular to OB at B lay off BC = BD, and let 
the perpendicular to OB at A meet OC at the point 
E. Join ED, and let H be the point at which ED cuts 
OB. Prove that h = OH is the desired harmonic 
mean between a = OA and b = OB. [Hint: From the 
similarity of triangles OAE and OBC, as well as of 
triangles HAE and HBD, infer that a/b = AH/HB.] 


Cc 
E 
O A B 
D 
Establish the “perfect proportion” 
a _ 2ab/(a + b) 


(a+b)/2_ b 
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between the arithmetic and harmonic means of two 
numbers a and b. 


21. The division of a line segment into two unequal parts 
so that the whole segment will have the same ratio to 
its larger part that its larger part has to its smaller 
part is called the golden section. A classical 
ruler-and-compass construction for the golden section 
of a segment AB is as follows. At B erect BC equal 
and perpendicular to 4B. Let M be the midpoint of 
AB, and with MC as a radius, draw a semicircle 
cutting AB extended in D and E£. Then the segment 
BE laid off on AB gives P, the golden section. 


(a) Show that ADBC is similar to ACBE, whence 
DB/BC = BC/BE. 

(b) Subtract 1 from both sides of the equality in 
part (a) and substitute equals to conclude that 
AB/AP = AP/PB. 

(c) Prove that the value of the common ratio in part 
(b) is (5 + 1)/2, which is the “golden ratio.” 
[Hint: Replace PB by AB — AP to see that 
AB? — AB. AP — AP? = 0. Divide this 
equation by AP? to get a quadratic equation in 
the ratio AB/AP.] 

(d) A golden rectangle is a rectangle whose sides 
are in the ratio (V5 + 1)/2. (The golden 
rectangle has dimensions pleasing to the eye 
and was used for the measurements of the 
facade of the Parthenon and other Greek 


3.4 Three Construction Problems 
of Antiquity 


Hippocrates and the Quadrature 


of the Circle 
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temples.) Verify that both the rectangles AEFG 
and BEFC are golden rectangles. 


22. Theodorus of Cyrene (circa 400 B.c.) who was the 
mathematics teacher of Plato, showed how to 
construct a line segment of length ./n for any 
positive integer n. Prove the following, 


(a) Given an odd integer n, then ./n is represented 
by the leg of a right triangle whose hypotenuse 
is (x + 1)/2 and whose other leg is (n — 1)/2. 

(b) Given an even integer n, then ./n is represented 
by half of the leg of a right triangle whose 
hypotenuse is 7 + 1 and whose other leg is 
n—-l., 


23. It has been suggested that Theodorus also obtained 


Jn (2 <n < 17) by constructing a spiral-like gure 
consisting of a sequence of right triangles having a 
common vertex, so that in each triangle the leg 
opposite the common vertex had length 1. Show that 
the hypotenuse of the nth triangle in this sequence 
has length /n + I. (The reason Theodorus stopped 
at 17 is that at the next step, wherein /18 would 
be constructed, the gure cuts across the initial axis 
for the rst time.) 


The mathematician who dominated the 
second half of the fth century B.C. was 
Hippocrates of Chios (460-380 B.c.), who is 
to be distinguished from his more celebrated 
contemporary Hippocrates of Cos, the father 
of Greek medicine. Like Thales, Hippocrates 
began his life as a merchant and ended as a 
teacher; but being less shrewd than Thales, 


Hippocrates was robbed of his money. Accounts differ on whether he was swindled by 
customhouse collectors at Byzantium or whether his ships were plundered on the high 
seas by Athenian pirates. At any rate, with his property lost, Hippocrates went to Athens 
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to prosecute the offenders in the law courts. Obliged to stay for many years (perhaps from 
450 to 430 B.c.), he attended the lectures of several philosophers. There is good reason to 
believe that the Pythagoreans were settled in Athens at that time, so he may have come 
under their in uence even though he had no Pythagorean teacher in the formal sense. Ulti- 
mately, Hippocrates attained such a pro ciency in geometry that he became one of the rst 
to support himself openly by accepting fees for teaching mathematics. If, as some say, the 
Pythagoreans taught him what he knew of arithmetic and geometry, then by the standards 
of the time he betrayed their trust by selling the secrets of mathematics to anyone who 
would pay the price. (A more charitable interpretation is that the Pythagoreans, moved by 
Hippocrates’ misfortune, allowed him to earn money by teaching their geometry.) Aris- 
totle spoke unkindly of Hippocrates: “It is well known that persons stupid in one respect 
are by no means so in all others; thus Hippocrates, though a competent geometer, seems 
in other regards to be stupid and lacking in sense.” The Greeks, indeed, were likely to 
view any man a fool who through his own simplicity was cheated out of his possessions. 

By the middle of the fth century, so many geometric theorems had been established 
that it became increasingly necessary to tighten the proofs and put all this material in 
good logical order. Proclus told how Hippocrates composed a work on the elements 
of geometry, anticipating the better-known Elements of Euclid by more than a century. 
No trace of this rst textbook on geometry remains, however (in fact, no mathematical 
treatise of the fth century has survived). Although Hippocrates’ book may have started 
a signicant tradition, it would have had the shortcomings of a pioneering work, and 
been rendered obsolete by Euclid’s Elements. Hippocrates did originate the now famil- 
iar pattern of presenting geometry as a chain of propositions, a form in which other 
propositions can be derived on the basis of earlier ones. Among other innovations, he 
introduced the use of letters of the alphabet to designate points and lines in geometric 
gures. 

When Hippocrates arrived in Athens, three special problems—the quadrature of the 
circle, the duplication of the cube, and the trisection of a general angle—were already 
engaging the attention of geometers. These problems have remained landmarks in the 
history of mathematics, a source of stimulation and fascination for amateurs and scholars 
alike through the ages. 

The achievement on which Hippocrates’ fame chie y rests has to do with the rst 
of these problems, the quadrature of the circle. This problem, sometimes called the 
“squaring of the circle,” can be stated simply: Is it possible to construct a square whose 
area shall be equal to the area of a given circle? The problem is much deeper than it rst 
appears, because the important factor is how the square is to be constructed. Tradition 
has it that Plato (429-348 B.c.) insisted that the task be performed with straightedge and 
compass only. In this method the assumption is that each instrument will be used for a 
single, specic operation: 


1. With the straightedge, a line can be drawn through two given points. 


2. With the compass, a circle with a given center and radius can be drawn. 


It is not permissible to use these two instruments in any other way; in particular, neither 
device is to be used for transferring distances, so that the straightedge cannot be graduated 
or marked in any way, and the compass must be regarded as collapsing as soon as either 
point is lifted off the paper. A point or a line is said to be constructible by straightedge 
and compass if it can be produced from given geometric quantities with these two tools, 
using them in the prescribed way only a nite number of times. 
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In the strict Greek sense of construction, the quadrature problem remained unsolved 
in spite of vigorous efforts by the Greek and other, later geometers. The futility of their 
attempts was demonstrated in the nineteenth century, when mathematicians were at last 
able to prove that it is impossible to square the circle by straightedge and compass alone. 
As it turns out, the test of constructibility under these instrumental limitations uses the 
ideas of algebra, not geometry, and involves concepts unknown in antiquity or the Middle 
Ages. Squaring the circle is equivalent to constructing a line segment whose length is ./7 
times the radius of the circle. Thus, the impossibility of constructing such a line segment 
by means laid down by the Greeks would be proved if it could be shown that ./z is not 
a constructible length. The argument hinges on the transcendental nature of the number 
z; that is, w is not the root of any polynomial equation with rational coef cients. (The 
transcendence of z was established by Lindemann in 1882 in a long and intricate proof.) 


Plato 
(429-348 B.C.) 


(The Bettmann Archive.) 


Even early investigators must have suspected that the allowable means were inad- 
equate for solving this quadrature problem; for when they failed to nd a construction 
involving merely circles and straight lines, they introduced special higher curves assumed 
to be already drawn. Here they were successful. Hippias of Elis (circa 425 B.C.), a near 
contemporary of Hippocrates, invented a new curve called the quadratrix, for the express 
purpose of squaring the circle. His solution was perfectly legitimate, but did not satisfy 
the restriction Plato had laid down. Hearing that Hippias had devised a sliding apparatus 
by which his curve could be drawn, Plato rejected the solution on the grounds that it was 
mechanical and not geometrical. Plutarch (Convivial Questions ) describes Plato as saying: 
“For in this way the whole good of geometry is set aside and destroyed, since it is reduced 
to things of the sense and prevented from soaring among eternal images of thought.” 

Hippocrates’ attempts at squaring the circle led him to discover that there are certain 
plane regions with curved boundaries that are squarable. More speci cally, he showed 
that two lunes (a lune is the moon-shaped gure bounded by two circular arcs of unequal 
radii) could be drawn, whose areas were together equal to the area of a right triangle. 
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This was accomplished as follows. Starting with an isosceles right triangle ABC, he 
constructed semicircles on the three sides as in the diagram. 


Hippocrates apparently knew that the areas of two circles were proportional to the 
squares of the lengths of their diameters. Thus, 


Area semicircle on AB _ AB? 
Area semicircle on AC AC2” 


This ratio must equal 2; for the Pythagorean theorem, as applied to triangle ABC, allows 
AB* = AC* + CB* = 2AC”. Hence, the semicircle on AB has twice the area of the 
semicircle on AC. From this, Hippocrates was led to conclude that the sum of the areas 
of the two small semicircles equaled the area of the larger one. The next step was to 
subtract the areas III and IV common to both. The gure shows that the areas remaining— 
namely, the sum of the areas I and II of the two lunes and the area of triangle ABC—are 
equal. But triangle ABC has area $(AC -BC)= 3AC’, so that 


Area lune I + area lune II = 5AC = 


To put it another way, lune I has an area equivalent to half that of triangle ABC, 


2 
Area lune I = 2 (54c’) a (+) : 
2\2 2 
and the “square of the lune” has been found. Hippocrates thus provided the rst example 
in mathematics of a curvilinear area that admits exact quadrature. 

Having shown that the lune could be squared, Hippocrates next tried to square the 
circle by a similar argument. To this end, he took an isosceles trapezoid ABCD formed 
by the diameter of a circle and three consecutive sides of half of a regular hexagon 
inscribed in the circle. Further semicircles were then described, having as diameters the 
sides AB, BC, and CD of the hexagon, as well as the radius OD of the original circle. 
Hippocrates proved that the area of the trapezoid ABCD equaled the sum of areas of the 
three lunes I, II, and II plus the area of the semicircle on OD. 


ae 
, \) 
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Because the squares of the diameters are to each other as the areas of the respective 
semicircles, 


Area semicircle on OD OD* OD* 1 


Area semicircle on AD AD? (20D) 4" 


But each of the sides 4B, BC, and CD is equal to the radius OD, from which it follows 
that each of the small semicircles has area a quarter that of the large semicircle. Knowing 
this, one concludes that the area of the semicircle on AD is the same as the total area 
of four semicircles—the three semicircles on the equal sides of the half-hexagon and 
the semicircle on the radius OD. If the parts common to both of these areas (to wit, the 
shaded segments lying between the hexagon and the circumference of the semicircle) 
are removed, the remaining areas will be equal. In other words, the lunes I, II, and II 
together with the semicircle on OD will have an area equivalent to that of the trapezoid 
ABCD: 


Area trapezoid ABCD = area lune I + area lune I + area lune III 
+ area semicircle on OD. 


If it were possible to subtract from this sum three squares with areas equal to the areas 
of the three lunes, then we could construct a rectangle equal in area to the semicircle 
on OD; twice that rectangle would then be equivalent to the circle on OD. As any 
rectangle can be converted to a square having the same area, the circle would have been 
squared. 

Hippocrates’ work on lunes has been preserved through the writings of the sixth- 
century commentator Simplicius and is indeed the only sizable fragment of classical 
Greek (pre-Alexandrian) mathematics that has been transmitted to us as originally com- 
posed. According to Simplicius, Hippocrates believed that he had actually succeeded in 
obtaining the quadrature of the circle by the argument as we have described it. He did 
not, needless to say, solve the squaring of the circle. 

The mistake lay in assuming that every lune can be squared; whereas this was shown 
possible only in the special case with which Hippocrates had concerned himself. What he 
proved for the lune on the side of an inscribed isosceles triangle need not be true for the 
lune on the side of an inscribed half-hexagon. Actually it is unlikely that Hippocrates, 
one of the most competent of geometers, would have made such a blunder. He may have 
hoped that in due course these lune quadratures would lead to the squaring of the circle. 
But it must have been a mistake on the part of the commentator to think that Hippocrates 
had claimed to have squared the circle when he had not done so. 


The Duplication of the Cube 


Another famous construction problem that concerned geometers of the time was the 
duplication of the cube; in other words, nding the edge of a cube having a volume 
twice that of a given cube. Just how the duplication problem originated is a matter of 
conjecture. Perhaps it dates back to the early Pythagoreans who had succeeded in doubling 
the square—if upon the diagonal of a given square a new square is constructed, then the 
new square has exactly twice the area of the original square. After this accomplishment, 
it would be only natural to extend the problem to three dimensions. 
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Tradition, however, provides us with a more romantic tale. According to the account 
that has prevailed most widely, the Athenians appealed to the oracle at Delos in 430 B.C. 
to learn what they should do to alleviate a devastating plague that had in icted great 
suffering on their city and caused the death of their leader, Pericles. The oracle replied 
that the existing altar of Apollo should be doubled in size. Because the altar was in 
the form of a cube, the problem was to duplicate the cube. Thoughtless builders merely 
constructed a cube whose edge was twice as long as the edge of the altar. At this, legend 
has it, the indignant god made the pestilence even worse than before. When the error 
was discovered, a deputation of citizens was sent to consult Plato on the matter. Plato 
told them that “the god has given this oracle, not because he wanted an altar of double 
the size, but because he wished in setting this task before them to reproach the Greeks 
for their neglect of mathematics and their contempt of geometry.” Whether the plague 
was actually abated or whether it simply ran its course is not known, but because of the 
oracle’s response, the problem of duplicating the cube is often referred to as the “Delian 
problem.” 

History is confused, and there are at least two legends on the subject. We are also 
told that the poet Euripedes (485—406 B.C.) mentioned the Delian problem in one of his 
tragedies, now lost. In this version, the origin of the problem is traced to King Minos, 
who is represented as wishing to erect a tomb to his son Glaucus. Feeling that the 
dimensions proposed were too undignied for a royal monument, the king exclaimed, 
“You have enclosed too small a space; quickly double it, without spoiling the beautiful 
(cubical) form.” In each of these accounts, the problem seems to have had its genesis in 
an architectural dif culty. 

Here too, the rst real progress in solving the duplication problem was made by 
Hippocrates. He showed that it can be reduced to nding, between a given line and 
another line twice as long, two mean proportionals. (That is, two lines are inserted 
between the given lines so that the four are in geometric proportion.) In our present 
notation, if a and 2a are the two given lines, and x and y are the mean proportionals 
that could be inserted between them, then the lengths a, x, y, and 2a are in geometric 
progression, which is to say 


a x y 

xy 2a 
The rst two ratios imply that x* = ay. From the second pair of ratios, we see that 
y? = 2ax. These equations are combined into 


whence it appears that 
x? = 2a. 


In other words, the cube that has edge x will have double the volume of a given cube 
of edge a. 

Hippocrates did not succeed in nding the mean proportionals by constructions using 
only straightedge and compass, those instruments to which Plato had limited geometry. 
Nevertheless the reduction of a problem in solid geometry to one in plane geometry 
was in itself a signi cant achievement. From this time on, the duplication of the cube 
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was always attacked in the form in which Hippocrates stated it: How may two mean 
proportionals be found between two given straight lines? 


The Trisection of an Angle 


Although Hippocrates advanced two of the three famous construction problems, he made 
no progress with trisecting an angle. The bisection of an angle with only straightedge and 
compass is one of the easiest of geometrical constructions, and early investigators had no 
reason to suspect that dividing an angle into three equal parts under similar restrictions 
might prove impossible. Some angles can obviously be trisected. In the special case of 
the right angle POQ, the construction is found as follows. With O as a center, draw a 
circle of any radius intersecting the sides of the angle in points A and B. Now draw 
a circle with center at B and passing through O. The two circles will intersect in two 
points, one of which will be a point C in the interior of angle POQ. 


Q 


Triangle BOC is equilateral, hence equiangular; therefore COB = 60°. But then 
LCOA = 90° — 60° = 30° = +(90°), 


and line OC is a trisector of the right angle. 

For 2000 years mathematicians sought in vain to trisect an arbitrary angle. In 1837, 
Pierre Wantzel (1814—1848) of the Ecole Polytechnique in Paris supplied the rst rigorous 
proof of the impossibility of trisecting any given angle by straightedge and compass alone. 
In the same paper, published in Liouville’s Journal de Mathématiques, Wantzel also 
demonstrated the futility of duplicating the cube in the manner speci ed. The key to this 
conclusion was the conversion of the two geometric problems to questions in the theory 
of equations. Wantzel obtained simple algebraic criteria that would permit the solution 
of a polynomial equation with rational coef cients to be geometrically constructed by 
means of a straightedge and compass. The classical geometric problems of trisection and 
duplication lead to cubic equations that do not satisfy Wantzel’s conditions, and thus the 
corresponding constructions cannot be carried out. 

If the restrictions imposed by the Greeks are relaxed, there are a variety of ways 
of dividing an angle into three equal parts. The simplest solution of the problem is 
to allow oneself the liberty of marking the straightedge. The following technique of 
rotating a marked straightedge until certain conditions are satised was devised by 
Archimedes. Let POQ be the angle to be trisected. With the vertex O as center, draw 
a circle of any radius r intersecting PO in A and QO in B. Now lay off the distance 
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r on a straightedge. By shifting the straightedge around, you can get a certain position 
in which it passes through the point B, while the endpoints of the r segment lie on the 
circle (at C) and the diameter AOA’ extended (at D). The line through the points B, C, 
and D is now drawn with the aid of the straightedge. 


P 


D A' O A 


With these preliminaries accomplished, we undertake to show that angle ODC is 
one-third of angle AOB. First observe that by its construction, CD = OC =r, so that 
triangle ODC is isosceles; hence COD = ZODC. Because an exterior angle in a triangle 
is equal to the sum of the nonadjacent interior angles, it follows that in triangle COD, 


LOCB = LCOD + LODC = 2Z0DC. 


Also, in the isosceles triangle OCB, we have OCB = ZOBC. Another appeal to the 
exterior angle theorem (this time applied to triangle ODB) leads to the equality 


ZAOB = LODB + LOBC = LODC + LOBC. 
These various observations can be brought together to give 
ZAOB = LODC + LOBC 
= LODC + LOCB 
= ZODC + 2ZODC = 3ZODC, 


which accomplishes our aim. It is worth emphasizing that the usual rules for straightedge 
and compass constructions have been violated, because the straightedge was marked. That 
is, the points C and D were determined by sliding the straightedge to the proper position 
to make CD equal to r. 


3.4 Problems 


1. 


For a variation of Hippocrates’ argument that the area 
of a lune could be reduced to the area of a circle, 
begin with a square ABCD and construct a semicircle 
on its diagonal. With the point D as a center and 

AD as radius, draw a circular arc from A to C, as 

in the gure. Prove that the area of the lune, shaded 
in the gure, is equal to the area of triangle ABC. 
[Hint: Similar circular sections (the region between a 
chord and the arc subtended by the chord) have areas 
proportional to the squares of the lengths of their 
chords. Apply this fact to the similar sections I and II.] 


AX 
VY 


A 
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The following solution to the continued mean propor- 
tionals problem is often attributed to Plato, although it 
could hardly be his in view of his objection to mechan- 
ical constructions. Consider two right triangles ABC 
and BCD, lying on the same side of the common leg 
BC (see the gure). Suppose that the hypotenuses AC 
and BD intersect perpendicularly at the point P, and 
are constructed in such a way that AP = a and DP = 
2a. Prove that x = BP and y = CP are the required 
mean proportionals between a and 2a, that is, that 


a x_y 
xy 2a 
D 
2a 
A 
a 
y~ 
B Cc 


[Hint: When parallel lines are cut by a third line, 
alternate interior angles are equal. Conclude therefore 
that the triangles APB, CPB, and DPC are similar. ] 


Apollonius (circa 225 B.c.) solved the 

problem of inserting two mean proportionals between 
segments of lengths a and 2a. He rst constructed 

a rectangle ABCD, with AB =a and AD = 2a, 
letting E be the point at which the diagonals 
bisected one another. With £ as a center, he 

then drew a circle cutting the extensions of AB and 
AD at points P and Q, respectively, so that P, C, 
and Q all lay on a straight line. (Apollonius is said to 
have invented a mechanical device by which this last 
step could be made.) For such a gure, establish that 


(a) The triangles PAQ, PBC, and CDQ are 
similar, whence 
a y _ a+ y 


x 2a 2a+x’ 


P. 
y 
B ~ 
~ 
FS 
A 2a D x Q 
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(b) Triangles EFQ and EGP are right triangles with 
equal hypotenuses, whence 


(a +x + (2) a0 ($4 "’ 
1 an el a 


or 
(2a+x)x =(at+y)y. 


(c) Segments DO = x and BP = y are the two 
mean proportionals between a and 2a: 


a y 


a 
y 2a’ 


The Greek mathematician Menaechmus (c. 350 
B.C.), the tutor of Alexander the Great, obtained a 
purely theoretical solution to the duplication 
problem based on nding the point of intersection 
of certain “conic sections.” To duplicate a cube 

of edge a, he constructed two parabolas having 

a common vertex and perpendicular axes, so that 
one parabola had a focal chord (latus rectum) 

of length a and the other a chord of length 2a. Prove 
that the abscissa x of the point of intersection of the 
two parabolas satis e s the condition x* = 2a°; the 
sought-for x, the cube’s edge, is thereby obtained. 


The trisection of a given angle can also be 
accomplished by a construction due to Nicomedes 
(circa 240 B.c.). Let ZAOB be a given angle. Through 
the point B, draw two lines, one perpendicular to the 
other side of ZAOB at C and one parallel to it. Now 
mark the length a = 20B on a straightedge and slide 
the straightedge so that it passes through the point 
O, while the endpoints of the a segment lie on BC 
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and BD (at P and Q, respectively, so that PO = a). 


Verify each of the following assertions: 


(a) If M is the midpoint of PQ, then ZMOB = 
ZBMO. [Hint: The midpoint of the hypotenuse 
of a right triangle is equidistant from the 
endpoints of its sides.] 

(b) By the exterior angle theorem, as applied 
to triangle BMQ, we nd that BMP = 
LMBO + LMQB. 

(c) LAOQ = ZBOO. 

(d) LAOB = LAOQ + LOOB = 3ZBQ0. 


Nicomedes solved the problem of duplicating the cube 
by an argument like that of Apollonius. First, construct 
a rectangle ABCD with AB = a and AD = 2a. Let 
M be the midpoint of AD and N the midpoint of AB, 
and let the segments CM and BA be extended to meet 
in G. Take the point F on the perpendicular FN to be 
such that FB = a. Now draw BH parallel to GF and 
draw FP to cut segment AB produced in P, with P 
so chosen that HP = a. (To accomplish this last step, 
Nicomedes invented a special plane curve, and even 
an apparatus that would draw it, called the conchoid.) 
Prolong the line PC until it meets AD extended in Q. 


Establish that 


(a) The triangles PAQ, PBC, and CDQ 
are similar, whence 
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a yo aty 
x 2a 2a+x’ 


(b) The triangles PBH and PGF are similar, whence 


a z+a 


y y+2a’ 
or a/z = y/2a, so that z = x. 


(c) The triangles FNB and FNP have FN as a 
common side, and so 


v-(8) oro" 49) 


or 


x ya 


y x+2a’ 


(d) Segments DO = x and BP = y are the two 
mean proportionals between a and 2a: 


Isaac Newton (1642-1727) suggested the following 
construction for duplicating the cube. Given a 
segment AB, erect a perpendicular BR to AB and draw 
BT so that angle ABT equals 120°. Let D be the 
point on BT such that if AD is drawn meeting 

BR at C, then CD= AB. 


Establish that if DE is drawn perpendicular to BR, 
each of the following will be true: 


(a) Triangles ABC and DEC are similar, whence 
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(b) 
= = tan30° = sa 
J3 BE 
x a 


“ b+y ab+be 


(c) The result of squaring the last equation 
and substituting b? = c? — a? is 
(2a +c) = 2a3(2a +c). 


(d) Since c? = 2a’, the cube of side AC is double 
the cube of side AB. 


To nd a fourth proportional to given line segments 
with lengths a, b, and c, rst construct two 
noncollinear rays emanating from a point O. On these 
rays mark off segments OA and OC of lengths a and 
c, respectively, and connect the points 4 and C so as 
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to form a triangle. On the ray on which the length a 

has been marked, now lay off a segment AB of length 
b. Finally, construct a line through the point B parallel 
to the side AC of the triangle constructed earlier, and 
intersecting the other ray in a point D. If the segment 
CD has length x, show that x satis es the proportion 


The curve usually called the quadratrix was invented by 
Hippias of Elis (born about 460 B.C.) to trisect an an- 
gle. The curve acquired its name from its later use in 
the quadrature of the circle. Like his contemporary Hip- 
pocrates, Hippias was one of the rst to teach for money, 


one of the so-called sophists. The word “sophist,” much like the word “tyrant,” did not 
originally have a derogatory meaning although it soon came to receive one. The term 
rst meant “wise man” and only later did it take on the connotation of one who reasons 
adroitly and speciously, rather than soundly. 

The sophists were itinerant teachers, usually from Asia Minor or the Aegean Islands, 
who had acquired learning and experience through wide travel. Whereas the disciples 
of Pythagoras were forbidden to accept fees for sharing their knowledge, the sophists, 
less hampered by tradition, had no such qualms. Shortly after the middle of the fth 
century B.C. several of these wandering lecturers—some of them reputable scholars, some 
outright imposters—arrived in Athens to vend their wares. There was a ready market for 
their talents among the prosperous Athenians, and success there ensured one’s reputation 


throughout Greece, Sicily, and Italy. 


The sophists took all knowledge as their province, but their central subject was the 
art of disputation. They professed to be able to teach their students to speak with clarity 
and persuasion, with the appearance of logic, on any topic whatever, and to defend 
either side of a question successfully. This laid them open to the charge of training in 
cleverness rather than virtue. Their opponents claimed that the sophists taught youth 
“to prove that black is white and to make the worse appear the better.” In spite of the 
criticisms against them, they were very much in demand. Wealthy people took pride in 
entrusting the education of their sons to the best and most famous sophists. In the end, 
their commercialism and the extravagant claims made for their instruction turned Plato 
and others against them, and gave the term “sophist’” its present meaning. 
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Because most of what we know about Hippias’s life and character comes from two 
dialogues of Plato in which sophists are castigated, it is hard to judge him fairly. In 
the Platonic dialogues named after him, Hippias was pictured as an arrogant, boastful 
buffoon. He was made to say that he had earned more money than any other two contem- 
porary sophists and had gained, in spite of the competition from the illustrious Protagoras 
of Abdera (in Thrace), huge sums on his Sicilian lecture tour. His claims were further 
recounted—that if he had received no lecture fees in Sparta and had not been invited 
to teach its youth, it was only because Spartan law prohibited foreign teaching. Hip- 
pias came from Elis, a small state in the northwest corner of the Peloponnesus, whose 
inhabitants had charge of the games that took place every fourth year on the plains of 
Olympia. 

In Plato’s writing, Hippias boasted that on his previous visit to the Olympic festival 
everything that he wore was of his own making, not merely his garments, but also his 
seal ring, oil ask, and sandals. He was said to have brought with him epics, tragedies, 
and all kinds of prose compositions of his own fashioning, and to have been prepared to 
lecture on music, letters, and the art of memory. The secret of Hippias’s wide knowledge 
seems to have been his exceptional memory. If he once heard a string of fty names, 
for instance, he could repeat them all in correct order. The dialogues Hippias Major and 
Hippias Minor, since they were caricatures, are unreliable as portraits—yet they must 
surely have recorded enough of Hippias’s eccentricities that his contemporaries would 
have recognized him. 


Hippias of Elis 


Although we know of no other mathematics that we can attribute to him, Hippias’s 
reputation rests securely on his invention of the quadratrix. It is the rst example of a 
curve that could not be drawn by the traditionally required straightedge and compass 
but had to be plotted point by point. The quadratrix is described by a double motion as 
follows. 


Let a straight line segment AF rotate clockwise about A with a constant velocity from 
the position AB to the position AD, so that a quadrant BED of a circle is described. At 
the same moment that the radius AF leaves its initial position AB, a line MN leaves 
BC and moves down with a constant velocity toward AD, always remaining parallel to 
AD. Both these motions are so timed that AF and MN will reach their ultimate position 
AD at the same moment. Now, at any given instant in their simultaneous movement, the 
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rotating radius and the moving straight line will intersect at a point (F' is a typical point). 
The locus of these points of intersection is the quadratrix. If FH is the perpendicular to 
AD, then the property of the quadratrix is that 


/BAD _ AB __ arc BED 
LEAD FH atc ED’ 


It is worth noting that the de nition does not actually locate any point of the quad- 
ratrix on AD. If the rotating radius and moving straight line are made to end their motions 
together, then they will both coincide with AD, hence will not intersect one another at 
a unique point. The point of the quadratrix on AD (namely the point G) can be located 
only as a limit. 


To see the ease with which the quadratrix can be used to trisect an angle, suppose 
that the given angle is ZXAY. Place this angle at the center of a circle within which the 
quadratrix is constructed, and let XA cut the curve at F. Draw FH perpendicular to AD 
and trisect FH. Through the point P of trisection, draw MN parallel to AD, meeting the 
quadratrix at O. Now join AQ and extend it to meet the quadrant in the point R. Then 
ZDAR is the required angle. From the de nition of the quadratrix, it is easy to prove that 

LDAR PH (4FH) 1 


DAE FH FH 3 


in consequence of which ZDAR = 4 DAE = +LXAY : 

The use of the quadratrix in nding a square equal in area to a given circle is a more 
sophisticated matter and might not have been obvious to Hippias. Pappus, in his large 
compendium Mathematical Collection, made the statement: 


For the squaring of the circle, there was used by Dinostratus, Nicomedes, and some other 
more recent geometers a certain curve which took its name from this property; for it is 
called by them “square-forming” [quadratrix]. 


Hence, any ascription of the curve to Hippias is lacking. And as for Dinostratus (circa 
350 B.c.), nothing more is known of his work than is disclosed by this passage, which 
should remind us of the scantiness of testimony on Greek mathematics and its practition- 
ers. Although there is no universal opinion, Hippias is usually credited with inventing 
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the quadratrix as a device for trisecting angles, and Dinostratus with rst applying it to 
the quadrature of the circle. 

Dinostratus’s solution of the squaring of the circle, as transmitted to us by Pappus, 
requires one to know the position of G, the point at which the quadratrix meets the line 
AD. If it is assumed that G can be found, Pappus’s proposition as he established it is 


arcBED _ AB 

AB AG 
This is proved by a double reductio ad absurdum argument, and provides one of the 
earliest examples in Greek mathematics of the indirect method of reasoning Euclid used 


so extensively. 
B 


A A GK D 


If the ratio (arc BED)/AB is not equal to AB/AG, then it must equal 4B/AK, where 
either AK > AG or AK < AG. Let us begin by assuming that AK > AG. With A as center 
and AK as radius, draw a quarter circle KF'L, intersecting the quadratrix at F and the 
side AB at L. Join AF and extend it to meet the circumference BED at E; also, from F 
draw FH perpendicular to AD. Since corresponding arcs of a circle are proportional to 
their radii, 


arc BED — AB_ 
arc KFL AK’ 
and if the hypothesis is correct, we must have 


arc BED _ AB 
AB OAK’ 
from which it follows that AB = arc KFL. But by the de ning property of the quadratrix, 
it is known that 


AB __ are BED __ are KFL 
FH arcED ~ arc FK’ 


and it was just proved that AB = arc KFL. Therefore, the last relation tells us that FH = 
arc F'K. But this is absurd, for the perpendicular is shorter than any other curve or 
line from F to AD. Thus the possibility that 4K > AG is ruled out. If AK < AG, a 
contradiction is reached in the same manner; hence, we are left with AK = AG and (arc 
BED)/AB = AB/AG. 
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The quadrature problem just described is the quadrature of a quadrant, and Pappus 
took for granted that from this, one would be able to arrive at a square equal in area 
to a circle. For squaring the circle, we shall use Proposition 14 of Book II of Euclid’s 
Elements: “To construct a square equal to a given rectilinear gure.” Let a circle of 
radius r be given. Using the quadratrix, a line segment of length s can be obtained for 
which 


where C is the circumference of the circle. Once the length s is available, it is possible 
to construct a line segment which is the fourth proportional to 7, r and s (see Problem 8 
of Section 3.4). The resulting segment will be equal in length to g = C/4, the quadrant 
arc of the circle. Because the area A of the circle is half the product of its radius and its 
circumference, we have 


1 C 
A=-=rC = 2r | — | = 2rq. 
BT r ( Fi ) rq 
A rectangle with 2r as one side and g as the other will have area equal to 4; a square equal 
in area to the rectangle is easily constructed by means of a semicircle. This is equivalent 
to taking the side x of the required square to be the mean proportional between the line 
segments 2r and q, 


The Grove of Academia: Plato’s Academy 


Most sophists had no permanent residence. They engaged their lecture halls, collected 
their fees for courses of instruction, and then departed. But by the early fourth century 
B.C., many of them had given up their itinerant practices and established themselves in 
Athens. The city began to gain a reputation for scholarship that attracted students from 
near and far. To use Hippias’s words—at least those given in Plato’s Dialogues—Athens 
had become “the very headquarters of Greek wisdom.” 

The most celebrated of the new schools to open in Athens was the Academy of 
Plato, where Aristotle was a student. As a disciple of Socrates, Plato (429-348 B.c.) 
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A mosaic from Pompeii depicting the Academy of Plato. (The Bettmann Archive.) 


had found it expedient to leave Athens after his master was sentenced to drink poison. 
For a dozen years, he traveled in the Mediterranean world, stopping in Egypt, Sicily, 
and southern Italy. In Italy, Plato became familiar with the tenets of the Pythagoreans, 
which may partly explain his appreciation of the universal value of mathematics. On 
his way back to Greece he was sold as a slave by the ship’s captain but was quickly 
ransomed by his friends. About 387 B.c., Plato returned to his native city to establish 
himself as a philosopher. In a grove in the suburbs of Athens, Plato founded a school that 
became, in a sense, the spiritual ancestor of our Western institutions of higher learning. 
The land originally belonged to the hero Academos, so that it was called the grove 
of Academia; and therefore the new school of philosophy was named the Academy. 
After the fashion of that time, legal recognition was secured by making the Academy a 
religious brotherhood, dedicated to the worship of the Muses. Accordingly, it had chapels 
dedicated to these divinities. The Academy was the intellectual center of Greece for 900 
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years, until permanently closed in 529 A.D. by the Christian Emperor Justinian as a place 
of pagan and perverse learning. 

It is through Plato that mathematics reached the place in higher education that it still 
holds. He was convinced that the study of mathematics furnished the nest training of 
the mind and hence was indispensable for philosophers and for those who would govern 
his ideal state. Because he expected those seeking admission to the Academy to be well 
grounded in geometry, he caused to be displayed over its portals the warning inscription, 
“Let no man ignorant of geometry enter here.” It is reported that one of Plato’s successors 
as a teacher in the Academy turned away an applicant who knew no geometry, saying, 
“Depart, for thou hast not the grip of philosophy.” Whether or not these stories are true, 
there is no question that in contrast to the sophists who looked down on the teaching 
of the abstract concepts of the scientist, Plato gave mathematics a favored place in the 
curriculum of the Academy. The importance of arithmetical training, in his view, is that 
“arithmetic has a very great and elevating effect, compelling the mind to reason about 
abstract number.” In speaking of the virtues of mathematics, he was, of course, espousing 
the cause of pure mathematics; by comparison, he thought its practical utility was of no 
account. Plato carried his dislike of “applied mathematics” to the extreme of protesting 
the use of mechanical instruments in geometry, restricting the subject to those gures 
that could be drawn by straightedge and compass. 

Plato was primarily a philosopher rather than a mathematician. So far as mathematics 
is concerned, it is not known that he made any original contribution to the subject matter; 
but as one who inspired and directed other research workers, he performed as great a 
service as any of his contemporaries did. According to the Greek commentator Proclus: 


Plato ...caused mathematics in general, and geometry in particular, to make great advances, 
by reason of his well-known zeal for the study, for he Iled his writings with mathematical 
discourses, and on every occasion exhibited the remarkable connection between mathematics 
and philosophy. 


Most of the mathematical advances that came during the middle of the fourth century B.C. 
were made by the friends and pupils of Plato. Proclus, after giving us a list of names of 
those who contributed to the subject at that time, went on to say, “All these frequented the 
Academy and conducted their investigations in common.” The hand of Plato is also seen 
in the increased attention given to proof and the methodology of reasoning; accurate 
de nitions were formulated, hypotheses clearly laid down, and logical rigor required. 
This collective legacy paved the way for the remarkable systemization of mathematics 
in Euclid’s Elements. 

About 300 B.c., the Platonic Academy found a rival, the Museum, which Ptolemy I 
set up in Alexandria for teaching and research. The talented mathematicians and scientists 
for the most part left Athens and adjourned to Alexandria. Although the main center of 
mathematics had shifted, the direct descendant of Plato’s Academy retained its preemi- 
nence in philosophy until the Emperor Justinian suppressed the philosophical schools of 
Athens, decreeing that only those of the orthodox faith should engage in teaching. Ed- 
ward Gibbon, in The Decline and Fall of the Roman Empire, saw Justinian’s legislation 
of 529 as the death knell of classical antiquity, the triumph of Christian ignorance over 
pagan learning. 


The Gothic arms were less fatal to the schools of Athens than the establishment of a new 
religion whose minister superseded the exercise of reason, resolved every question by an 
article of faith, and condemned the in del or sceptic to eternal ame....The golden chain, 
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as it was fondly styled, continued ... until the edict of Justinian, which imposed perpetual 


silence on the schools of Athens. 


Beyond 529, the institution of higher learning that Plato had founded ceased to be an 


instrument of Greek education. 


3.5 Problems 


1. Complete Dinostratus’s proof of the quadrature of the 
quarter-circle by showing that the assumption 
AK < AG leads to a contradiction. [Hint: Show that 
in the accompanying gure, arc PK = FK, isa 
contradiction. ] 


B a 
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2. (a) Show that in modern polar coordinates, the 
equation of the quadratrix is 
2a0 


: ’ 
sind 


where @ is the angle made by the radius vector 
with AD, and r is the length of the radius 
vector and a the side of the square ABCD. 

(b) Verify that AG = limy_,9r exists, that in fact, 
AG =2a/z. 


3. Proposition 14 of Book I of Euclid’s Elements 
solves the construction: To describe a square that 
shall be equal (in area) to a given rectilinear gure . 
Prove that if ABCD is the given rectangle, AF is the 
diameter of a semicircle, and BFGH is a square, then 
the square is equal in area to the rectangle. 


6 
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4. Show how Dinostratus, having found a line segment 
whose length was one-fourth the circumference of a 
circle, might have used the following theorem— 
stated by Archimedes in his Measurement of a 


Circle—to help square the circle: The area of any 
circle is equal to the area of the right triangle that 
has an altitude equal to the radius of the circle and a 
base equal to the circumference. 


The tomahawk-shaped instrument shown in the 
accompanying gure can be used to solve the 
trisection problem. (PO = OR = RS, with PTR a 
semicircle on PR as diameter, UR perpendicular to 
PS.) If ZAOB is the angle to be trisected, place the 
tomahawk on the angle so that S lies on OA, the line 
segment UR passes through O, and the semicircle 
with diameter PR is tangent to OB at T. Prove that 
the triangles OTQ, ORQ, and ORS are all congruent, 
whence /ROA is one-third of ZAOB. 


The limagon (from the Latin word for “snail,” /imax) 
was discovered by Etienne Pascal (1588-1640), 
father of the better-known Blaise Pascal. The curve is 
based on the circle C of radius 1 with 


center at (1,0); for it is de ned to be the set of all 
points whose distance from the circle C measured 
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along a line through the origin is constantly equal to 
1, the radius of C. Prove that the equation of the 
limagon in polar coordinates is r = 1 + 2cos@, 
hence in rectangular coordinates is 

(x? + y? — 2x)? = x* + y. [Hint: The polar equation 
of the circle C is r = 2cos@.] 


7. Although the limagon was invented for other 
purposes, it was later shown to afford a method for 
trisecting arbitrary angles. Let ABC be any central 
angle in a circle with center B = (1, 0) and radius 1. 
Draw the limacon for the circle and let BA extended 
cut the limacon in the point D. Let the line from the 
origin O to D meet the circle at E, as shown in the 
gure. Prove that angle BDE is one-third as large as 
angle ABC. [Hint: ABC = ZBOD + ZBDO = 
ZBEO + LBDE = LBDE + ZEBD + £BDE = 
3ZBDE..| 


(IK 
Nee 
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CHAPTER 4 
The Alexandrian School: Euclid 


It is the glory of geometry that from so few principles, fetched from without, it is able to accomplish 


so much. 
ISAAC NEWTON 


Toward the end of the fourth century B.c., the scene 


Euclid and the Elements of mathematical activity shifted from Greece to Egypt. 
The battle of Chaeronea, won by Philip of Macedon in 

A Center of Learning: 338 B.C., saw the extinction of Greek freedom as well 

The Museum as the decay of productive genius on its native soil. Two 


years later, Philip was murdered by a discontented noble 
and was succeeded by his 20-year-old son, Alexander the Great. Alexander conquered a 
great part of the known world within 12 years, from 334 B.C. to his death in 323 B.C., 
at the age of 33. Because his armies were mainly Greek, he spread Greek culture over 
wide sections of the Near East. What followed was a new chapter of history, known as 
the Hellenistic (or Greek-like) Age, which lasted for three centuries, until the Roman 
Empire was established. 

Alexander’s great monument in Egypt was the city that still bears his name, Alexan- 
dria. Having taken and destroyed the Phoenician seaports in a victorious march down 
the Eastern Mediterranean, Alexander was quick to see the potential for a new maritime 
city (a sort of Macedonian Tyre) near the westernmost mouth of the Nile. But he could 
do little more than lay out the site, because he departed for the conquest of Persia soon 
afterward. The usual story is that Alexander, with no chalk at hand to mark off the streets, 
used barley from the commissary instead. This seemed like a good idea until clouds of 
birds arrived from the delta and ate the grain as fast as it was thrown. Disturbed that this 
might be a bad omen, Alexander consulted a soothsayer, who concluded that the gods 
were actually showing that the new city would prosper and give abundant riches. 

At Alexander’s death, one of his leading generals, Ptolemy, became governor of 
Egypt and completed the foundation of Alexandria. The city had the advantage of a superb 
harbor and docking facilities for 1200 ships, so it became with the shortest possible delay 
the trading center of the world, the commercial junction point of Asia, Africa, and Europe. 
Alexandria soon outshone and eclipsed Athens, which was reduced to the status of an 
impoverished provincial town. For nearly a thousand years, it was the center of Hellenistic 
culture, growing in the later years of the Ptolemaic dynasty to an immense city of a million 
people. Following its sacking by the Arabs in A.D. 641, the building of Cairo in 969, and 
the discovery of a shipping route around the Cape of Good Hope, Alexandria withered 
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away, and by the time of the Napoleonic expedition its population had dwindled to a mere 
4000. 

The early Ptolemies devoted themselves to making Alexandria the center of intel- 
lectual life for the whole eastern Mediterranean area. Here they built a great center of 
learning in the so-called Museum (seat of the Muses), a forerunner of the modern uni- 
versity. The leading scholars of the times—scientists, poets, artists, and writers—came to 
Alexandria by special invitation of the Ptolemies, who offered them hospitality as long 
as they wished to stay. At the Museum, they had leisure to pursue their studies, access 
to the nest libraries, and the opportunity of discussing matters with other resident spe- 
cialists. Besides free board and exemption from taxes, the members were granted salary 
stipends, the only demand being that they give regular lectures in return. These fellows 
of the Museum lived at the king’s expense in luxurious conditions, with lecture rooms 
for their discussions, a colonnaded walkway in which to stroll, and a vast dining hall, 
where they took their meals together. The poet Theocritus, enjoying the bounty, hailed 
Ptolemy as “the best paymaster a free man can have.” And another sage, Ctesibius of 
Chalcis, when asked what he gained from philosophy, candidly replied, “Free dinners.” 

Built as a monument to the splendor of the Ptolemies, the Museum was nonetheless a 
milestone in the history of science, not to mention royal patronage. It was intended as an 
institution for research and the pursuit of learning, rather than for education; and for two 
centuries scholars and scientists ocked to Egypt. At its height, this center must have had 
several hundred specialists, whose presence subsequently attracted many pupils eager to 
develop their own talents. Although one poet of the time contemptuously referred to the 
Museum as a birdcage in which scholars fattened themselves while engaging in trivial 
argumentation, science and mathematics ourished with remarkable success. Indeed, it 
is frequently observed that in the history of mathematics there is only one other span 
of about 200 years that can be compared for productivity to the period 300-100 B.c., 
namely the period from Kepler to Gauss (1600-1850). 

Scholars could not get along without books, so the rst need was to collect 
manuscripts; when these were suf ciently abundant, a building was required to hold 
them. Established almost simultaneously with the Museum and adjacent to it was the 
great Alexandrian library, housing the largest collection of Greek works in existence. 
There had of course been libraries before it, but not one possessed the resources that 
belonged to the Ptolemies. Manuscripts were of cially sought throughout the world, and 
their acquisition was vigorously pressed by agents who were commissioned to borrow 
old works for copying if they could not otherwise be obtained; travelers to Alexandria 
were required to surrender any books that were not already in the library. Many stories 
are told of the high-handed methods by which the priceless manuscripts were acquired. 
One legend has it that Ptolemy III borrowed from Athens the rolls kept by the state con- 
taining the authorized texts of the writers Aeschylus, Sophocles, and Euripides. Although 
he had to make a deposit as a guarantee that the precious volumes would be returned, 
Ptolemy kept the original rolls and sent back the copies (needless to say, he forfeited the 
deposit). A staff of trained scribes catalogued the books, edited the texts that were not 
in good condition, and explained those works of the past that were not easily understood 
by a new generation of Greeks. 

The Alexandrian library was not entirely without rivals in the ancient world. The most 
prominent rival was in Pergamon, a city in western Asia Minor. To prevent Pergamon 
from acquiring copies of their literary treasures, the jealous Ptolemies, it is said, prohibited 
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the export of papyrus from Egypt. Early writers were careless with numbers and often 
exaggerated the size of the library. Some accounts speak of the main collection at the 
library as having grown to 300,000 or even 500,000 scrolls in Caesar’s time (48 B.C.), 
with an additional 200,000 placed in the annex called the Serapeum. The collection 
had been built partly by the purchase of private libraries, one of which, according to 
tradition, was Aristotle’s. After the death of Aristotle, his personal papers passed into 
the hands of a collector who, fearing that they would be con scated for the library at 
Pergamon, hid all the manuscripts in a cave. The scrolls were badly damaged by insects 
and moisture, and the Alexandrian copyists made so many errors when restoring the texts 
that they no longer agreed with the versions of Aristotle’s works already housed in the 
library. 


Euclid’s Life and Writings 


Before the Museum passed into oblivion in A.D. 641, it produced many distinguished 
scholars who were to determine the course of mathematics for many centuries: Euclid, 
Archimedes, Eratosthenes, Apollonius, Pappus, Claudius Ptolemy, and Diophantus. Of 
these, Euclid (circa 300 B.C.) is in a special class. Posterity has come to know him as the 
author of the Elements of Geometry, the oldest Greek treatise on mathematics to reach us 
in its entirety. The Elements is a compilation of the most important mathematical facts 
available at that time, organized into 13 parts, or books, as they were called. (Systematic 
expositions of geometry had appeared in Greece as far back as the fth century B.c., but 
none have been preserved, for the obvious reason that all were supplanted by Euclid’s 
Elements.) Although much of the material was drawn from earlier sources, the superbly 
logical arrangement of the theorems and the development of proofs displays the genius 
of the author. Euclid uni ed a collection of isolated discoveries into a single deductive 
system based on a set of initial postulates, de nitions, and axioms. 

Few books have been more important to the thought and education of the Western 
world than Euclid’s Elements. Scarcely any other book save the Bible has been more 
widely circulated or studied; for 20 centuries, the rst six books were the student’s usual 
introduction to geometry. Over a thousand editions of the Elements have appeared since 
the rst printed version in 1482; and before that, manuscript copies dominated much of 
the teaching of mathematics in Europe. Unfortunately, no copy of the work has been 
found that actually dates from Euclid’s own time. Until the 1800s, most of the Latin 
and English editions were based ultimately on a Greek revision prepared by Theon of 
Alexandria (circa 365) some 700 years after the original work had been written. But in 
1808, it was discovered that a Vatican manuscript that Napoleon had appropriated for 
Paris represented a more ancient version than Theon’s; from this, scholars were able to 
reconstruct what appears to be the de nitive text. 

Although the fame of Euclid, both in antiquity and in modern times, rests almost 
exclusively on the Elements, he was the author of at least 10 other works covering a 
wide variety of topics. The Greek text of his Data, a collection of 95 exercises prob- 
ably intended for students who had completed the Elements, is the only other text by 
Euclid on pure geometry to have survived. A treatise, Conic Sections, which formed 
the foundation of the rst four books of Apollonius’s work on the same subject, has 
been irretrievably lost, and so has a three-volume work called Porisms (the term porism 
in Greek mathematics means “a corollary”). The latter is the most grievous loss, for it 
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Euclid 
(circa 300 Bc.) 


een _- : r fc: én (Smithsonian Institution.) 


apparently was a book on advanced geometry, perhaps an ancient counterpart to analytic 
geometry. 

As with the other great mathematicians of ancient Greece, we know remarkably little 
about the personal life of Euclid. That Euclid founded a school and taught in Alexandria 
is certain, but nothing more is known save that, the commentator Proclus has told us, 
he lived during the reign of Ptolemy I. This would indicate that he was active in the 
rst half of the third century B.C. It is probable that he received his own mathematical 
training in Athens from the pupils of Plato. Two anecdotes that throw some light on the 
personality of the man have Itered down to us. Proclus, who wrote a commentary to the 
Elements, related that King Ptolemy once asked him if there was not a shorter way to 
learning geometry than through the Elements, to which he replied that there is “no royal 
road to geometry”—implying thereby that mathematics is no respecter of persons. The 
other story concerns a youth who began to study geometry with Euclid and inquired, after 
going through the rst theorem, “But what shall I get by learning these things?” After 
insisting that knowledge was worth acquiring for its own sake, Euclid called his servant 
and said, “Give this man a coin, since he must make a prot from what he learns.” 
The rebuke was probably adapted from a maxim of the Pythagorean brotherhood that 
translates roughly as, “A diagram and a step (in knowledge), not a diagram and a coin.” 


For more than two thousand years Euclid has been the hon- 


4.2 Euclidean Geometry ored spokesman of Greek geometry, that most splendid cre- 
ation of the Greek mind. Since his time, the study of the 

Euclid’s Foundation Elements, or parts thereof, has been essential to a liberal ed- 

for Geometry ucation. Generation after generation has regarded this work 


as the summit and crown of logic, and its study as the best 
way of developing facility in exact reasoning. Abraham Lincoln at the age of 40, while 
still a struggling lawyer, mastered the rst six books of Euclid, solely as training for 
his mind. Only within the last hundred years has the Elements begun to be supplanted 
by modern textbooks, which differ from it in logical order, proofs of propositions, and 
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applications, but little in actual content. (The rst real pedagogical improvement was 
by Adrien-Marie Legendre, who in his popular Eléments de Géométrie, rearranged and 
simpli ed the propositions of Euclid. His book ran from an initial edition in 1794 to a 
twelfth in 1823.) Nevertheless, Euclid’s work largely remains the supreme model of a 
book in pure mathematics. 

Anyone familiar with the intellectual process realizes that the content of the Elements 
could not be the effort of a single individual. Unfortunately, Euclid’s achievement has 
so dimmed our view of those who preceded him that it is not possible to say how far 
he advanced beyond their preparatory work. Few, if any, of the theorems established 
in the Elements are of his own discovery; Euclid’s greatness lies not so much in the 
contribution of original material as in the consummate skill with which he organized 
a vast body of independent facts into the de nitive treatment of Greek geometry and 
number theory. The particular choice of axioms, the arrangement of the propositions, 
and the rigor of demonstration are personally his own. One result follows another in 
strict logical order, with a minimum of assumptions and very little that is super uous. 
So vast was the prestige of the E/ements in the ancient world that its author was seldom 
referred by name but rather by the title “The Writer of the Elements” or sometimes 
simply “The Geometer.” 

Euclid was aware that to avoid circularity and provide a starting point, certain facts 
about the nature of the subject had to be assumed without proof. These assumed state- 
ments, from which all others are to be deduced as logical consequences, are called 
the “axioms” or “postulates.” In the traditional usage, a postulate was viewed as a “self- 
evident truth”; the current, more skeptical view is that postulates are arbitrary statements, 
formulated abstractly with no appeal to their “truth” but accepted without further justi - 
cation as a foundation for reasoning. They are in a sense the “rules of the game” from 
which all deductions may proceed—the foundation on which the whole body of theorems 
rests. 

Euclid tried to build the whole edice of Greek geometrical knowledge, amassed 
since the time of Thales, on ve postulates of a speci cally geometric nature and ve 
axioms that were meant to hold for all mathematics; the latter he called common no- 
tions. (The rst three postulates are postulates of construction, which assert what we 
are permitted to draw.) He then deduced from these 10 assumptions a logical chain of 
465 propositions, using them like stepping-stones in an orderly procession from one 
proved proposition to another. The marvel is that so much could be obtained from so 
few sagaciously chosen axioms. 

Abruptly and without introductory comment, the rst book of the Elements opens 
with a list of 23 de nitions. These include, for instance, what a point is (“that which has 
no parts”) and what a line is (“being without breadth”). The list of de nitions concludes: 
“Parallel lines are straight lines which, being in the same plane and being produced 
inde nitely in both directions, do not meet one another in either direction.” These would 
not be taken as de nitions in a modern sense of the word but rather as naive descriptions 
of the notions used in the discourse. Although obscure and unhelpful in some respects, 
they nevertheless suf ce to create certain intuitive pictures. Some technical terms that 
are used, such as circumference of a circle, are not de ned at all, whereas other terms, 
like rhombus, are included among the de nitions but nowhere used in the work. It is 
curious that Euclid, having de ned parallel lines, did not give a formal de nition of 
parallelogram. 


146 


Chapter 4 The Alexandrian School: Euclid 


Euclid then set forth the 10 principles of reasoning on which the proofs in the 
Elements were based, introducing them in the following way: 


Postulates 


Let the following be postulated: 


A straight line can be drawn from any point to any other point. 
A nite straight line can be produced continuously in a line. 
A circle may be described with any center and distance. 


All right angles are equal to one another. 


Pie aN oe 


If a straight line falling on two straight lines makes the interior angles on the same 
side less than two right angles, then the two straight lines, if produced inde nitely 
meet on that side on which the angles are less than two right angles. 


Common Notions 


Things that are equal to the same thing are also equal to one another. 
If equals are added to equals, the wholes are equal. 
If equals are subtracted from equals, the remainders are equal. 


Things that coincide with one another are equal to one another. 


Aa a oS So 


The whole is greater than the part. 


Postulate 5, better known as Euclid’s parallel postulate, has become one of the most 
famous and controversial statements in mathematical history. It asserts that if two lines / 
and /’ are cut by a transversal ¢ so that the angles a and b add up to less than two right 
angles, then / and /’ will meet on that side of t on which these angles lie. The remarkable 
feature of this postulate is that it makes a positive statement about the whole extent of a 
straight line, a region for which we have no experience and that is beyond the reach of 
possible observation. 


Those geometers who were disturbed by the parallel postulate did not question that 
its content was a mathematical fact. They questioned only that it was not brief, simple, 
and self-evident, as postulates were supposed to be; its complexity suggested that it 
should be a theorem instead of an assumption. The parallel postulate is actually the 
converse of Euclid’s Proposition 27, Book I, the thinking ran, so it should be provable. 
It was thought impossible for a geometric statement not to be provable if its converse 
was provable. There is even some suggestion that Euclid was not wholly satis ed with 


Euclidean Geometry 


PROPOSITION 1 


147 


his fth postulate; he delayed its application until he could advance no further without 
it, though its earlier use would have simpli ed some proofs. 

Almost from the moment the E/ements appeared and continuing into the nineteenth 
century, mathematicians have tried to derive the parallel postulate from the rst four 
postulates, believing that these other axioms were adequate for a complete development 
of Euclidean geometry. All these attempts to change the status of the famous asser- 
tion from “postulate” to “theorem” ended in failure, for each attempt rested on some 
hidden assumption that was equivalent to the postulate itself. Futile so far as the main 
objective was concerned, these efforts led nevertheless to the discovery of non-Euclidean 
geometries, in which Euclid’s axioms except the parallel postulate all hold and in which 
Euclid’s theorems except those based on the parallel postulate all are true. The mark 
of Euclid’s mathematical genius is that he recognized that the fth postulate demanded 
explicit statement as an assumption, without a formal proof. 

Detailed scrutiny for over 2000 years has revealed numerous aws in Euclid’s treat- 
ment of geometry. Most of his de nitions are open to criticism on one ground or another. 
It is curious that while Euclid recognized the necessity for a set of statements to be as- 
sumed at the outset of the discourse, he failed to realize the necessity of unde ned_ terms. 
A de nition, after all, merely gives the meaning of a word in terms of other, simpler 
words—or words whose meaning is already clear. These words are in their turn de ned 
by even simpler words. Clearly the process of de nition in a logical system cannot be 
continued backward without an end. The only way to avoid the completion of a vicious 
circle is to allow certain terms to remain unde ned. 

Euclid mistakenly tried to dene the entire technical vocabulary that he used. 
Inevitably this led him into some curious and unsatisfactory de nitions. We are told 
not what a point and a line are but rather what they are not: “A point is that which 
has not parts.” “A line is without breadth.” (What, then, is part or breadth?) Ideas of 
“point” and “line” are the most elementary notions in geometry. They can be described 
and explained but cannot satisfactorily be de ned by concepts simpler than themselves. 
There must be a start somewhere in a self-contained system, so they should be accepted 
without rigorous de nition. 

Perhaps the greatest objection that has been raised against the author of the E/ements 
is the woeful inadequacy of his axioms. He formally postulated some things, yet omitted 
any mention of others that are equally necessary for his work. Aside from the obvious 
failure to state that points and lines exist or that the line segment joining two points is 
unique, Euclid made certain tacit assumptions that were used later in the deductions but 
not granted by the postulates and not derivable from them. Quite a few of Euclid’s proofs 
were based on reasoning from diagrams, and he was often misled by visual evidence. 
This is exemplied by the argument used in his very rst proposition (more a problem 
than a theorem). It involved the familiar construction of an equilateral triangle on a given 
line segment as base. 


For a line segment AB, there is an equilateral triangle having the segment as one of its 
sides. 


Proof. Using Postulate 3, describe a circle with center A and radius AB passing 
through point B. Now, with center B and radius AB, describe a circle passing 
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through A. From the point C, in which the two circles cut one another, draw the 
segments CA and CB (Postulate | allows this), thereby forming a triangle ABC. It is 
seen that AC = AB and BC = AB because they are radii of the same circle. It then 
follows from Common Notion | that 4B = BC = AC, and so triangle ABC is 


equilateral. 
Gc 


There is only one problem with all this. On the basis of spatial intuition, one feels 
certain that the two circles will intersect at a point C and will not, somehow or other, 
slip through each other. Yet the purpose of an axiomatic theory is precisely to provide 
a system of reasoning free of the dependence on intuition. The whole proposition fails 
if the circles we are told to construct do not intersect, and there is unhappily nothing in 
Euclid’s postulates that guarantees that they do. To remedy this situation, one must add 
a postulate that will ensure the “continuity” of lines and circles. Later mathematicians 
satisfactorily Iled the gap with the following: 


If a circle or line has one point outside and one point inside another circle, then it has two 
points in common with the circle. 


The mere statement of the postulate involves notions of “inside” and “outside” that do 
not explicitly appear in the Elements. If geometry is to ful ll its reputation for logical 
perfection, considerable attention must be paid to the meaning of such terms and to the 
axioms governing them. 

During the last 25 years of the nineteenth century, many mathematicians attempted 
to give a complete statement of the postulates needed for proving all the long-familiar 
theorems of Euclidean geometry. They tried, that is, to supply such additional postulates 
as would give explicitness and form to the ideas that Euclid left intuitive. By far the most 
in uential treatise on geometry of modern times was the work of the renowned German 
mathematician David Hilbert (1862—1943). Hilbert, who worked in several areas of math- 
ematics during a long career, published in 1899 his main geometrical work, Grundlagen 
der Geometrie (Foundations of Geometry). In it he rested Euclidean geometry on 21 
postulates involving six unde ned terms—with which we should contrast Euclid’s ve 
postulates and no unde ned terms. 


Book I of the Elements 


The 48 propositions of the rst book of the Elements deal mainly with the properties 
of straight lines, triangles, and parallelograms—what today we should call elementary 
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plane geometry. Much of this material is familiar to any student who has had a traditional 
high-school course in plane and solid geometry. Although we shall not examine all these 
results in detail, Proposition 4 is one that deserves a close look. This proposition is 
called the side-angle-side theorem, for it contains the familiar criterion for congruence of 
triangles, namely, two triangles are congruent if two sides and the included angle of one 
are congruent to the corresponding sides and included angle of the other. We have used 
the word congruent where Euclid spoke of equality. When he referred to two angles 
(or for that matter, two line segments) as “equal,” he meant that they could be made to 
coincide. For our purposes, it is safe to think of congruent objects as having the same 
size and shape. 

Euclid tried to give a proof of the side-angle-side theorem by picking up one triangle 
and superimposing it on the other triangle so that the remaining parts of the two triangles 
tted. His argument, which was supposedly valid by Common Notion 4, ran substantially 
as follows: Given AABC and AA’B’'C’, where AB = A'B’, LA = ZA’, and AC= A'C’, 
move AABC so as to place point A on point A’ and side AB on side A’B’. Because 
AB = A’B’, point B must fall on point B’. Because ZA = ZA’, the side AC has the same 
direction as side A’C’, and because of the equal lengths of AC and A’C’, the points C 
and C’ fall on each other. Now, if B and B’ coincide and C and C’ coincide, so must 
the connecting line segments BC and B’C’. The two triangles coincide in all respects, 
so it follows that they are congruent. 


Cc Cc’ 


A B A’ B' 


Although this “principle of superposition” may seem reasonable enough in dealing 
with material triangles made of wire or wood, its legitimacy has been questioned for 
working with conceptual entities whose properties exist only because they have been 
postulated. Indeed, the prominent British logician Bertrand Russell (1872-1970) spoke 
of superposition in no uncertain terms as a “tissue of nonsense.” The chief criticism is 
that in assuming that a triangle can be moved about without any alteration in its internal 
structure, when it is only known that two sides and an included angle remain constant, one 
is really assuming that these determine the rigidity of the triangle. Thus, in postulating 
the possibility of movement without change in form or magnitude, congruence itself is 
actually being postulated. Euclid’s proof is therefore a vicious circle of reasoning. It has 
been conjectured that Euclid felt reluctant to use superposition in proving congruence 
and did so sparingly in the Elements but could not dispense with it entirely, for lack of a 
better method. Present-day mathematicians avoid the dif culty by taking the side-angle- 
side theorem as an axiom from which the other congruence theorems are then derived. 
At any rate, Euclid’s approach to the problem of congruence was logically de cient. 

Perhaps the most famous of the earlier propositions of Book I is Proposition 5, which 
states, “In an isosceles triangle, the angles at the base are congruent to one another.” 
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(Here, by angles at the base is meant the angles opposite the two congruent sides.) This 
proposition sometimes marked the limit of the instruction in Euclid in the universities 
of the Middle Ages. It is historically interesting as having been called “elefuga,” a me- 
dieval term meaning “the ight of the fools,” because at this point the student usually 
abandoned geometry. Another name commonly used for Proposition 5 is pons asinorum, 
a Latin phrase signifying “bridge of fools,” or “bridge of asses,” although opinion is 
not unanimous about the exact implication of the title. The name might have been sug- 
gested by the dif culties that poor geometers have with the proposition; anyone unable 
to proceed beyond it must be a fool. A more generous interpretation is that the diagram 
that accompanies Euclid’s proof resembles a trestlebridge so steep that a horse could not 
climb the ramp, though a sure-footed animal such as an ass could. Perhaps only the sure- 
footed student could proceed beyond this stage in geometry. Here is an abbreviated proof 
of Euclid’s Proposition 5. The contention is that in a triangle ABC, where AB = AC, one 
has ABC = ACB. To validate this, select points F and G on the extensions of sides 
AB and AC such that AF = AG. 


5) 


F G 


Then triangles AFC and AGB will be congruent, by the side-angle-side proposition. 
Indeed, they have a common angle at A, while AC = AB and AF = AG. By the de nition 
of congruent triangles, all the corresponding parts are equal, so that the bases FC = GB, 
LACF = LABG, and ZAFC = AGB. It is worth noticing too that 


FB = AF — AB = AG—AC=GC. 


The implication is that triangles BFC and CGB are themselves congruent (also by the 
side-angle-side proposition), whence as corresponding angles, 4BCF = /CBG. This last 
equality, together with the fact that ZA4BG = ZACF, tells us that 


LABG — LCBG = LACF — LBCF, 


or ZABC = ZACB. 

Fortunately, there is a far simpler proof of this proposition (attributed to Pappus of 
Alexandria, A.D. 300), which requires no auxiliary lines whatever. The pertinent obser- 
vation is that nowhere in the statement of the side-angle-side proposition is it required 
that the two triangles be distinct. The details are as follows. Given the isosceles triangle 
ABC, where AB = AC, think of it in two ways, one way as triangle ABC and the other as 
triangle ACB. Thus, there is a correspondence between AABC and AACB with vertices 
A, B, and C corresponding to vertices A, C, and B, respectively. Under this correspon- 
dence, AB = AC, AC = AB, and BAC = /CAB. Thus, two sides and an included angle 
are congruent to the parts that correspond to them, whence the triangles are congruent. 
This means that all the parts in one triangle are equal to the corresponding parts in the 
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A page from the rst printed edition of Euclid’s Elements. Published in Latin in 
1482. (Courtesy of Burndy Library.) 


other triangle, and in particular, ABC = ZACB, which was to be proved. 


A A 


B Cc C B 


A result about triangles that Euclid found very useful in his development of geometry 
is the exterior angle theorem. This theorem is the embodiment of practically all the 
Euclidean axioms, for nearly all are used in its proof. 
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PROPOSITION 16 


If one of the sides of a triangle is produced, then the exterior angle is greater than 
either opposite interior angle. 


Proof. Let ABC be any triangle and pick D to be any point on the extension of side 
BC through C. Call E the midpoint of AC; extend the line segment BE to a point F 
so that BE = EF. Because AE = EC, BE = EF, and AEB = LFEC (vertical angles 
are equal by Proposition 15), the triangles AEB and FEC are congruent, from the 
side-angle-side proposition. The result is that LBAE = ZFCE. But according to 
Common Notion 5, the whole is greater than any of its parts, so that ZDCA > 


/FCE. Hence the exterior angle DCA is greater than /BAE, which is an opposite 
interior angle of this triangle. 


B 


Likewise, by extending side AC to a point G, it can be shown that ZGCB > ZABC. 
Because ZGCB and ZDCA are vertical angles (hence equal), we immediately have 
DCA greater than ZABC, the other opposite interior angle. 


Aside from the fact that the existence of midpoints must rst be established, 
the main aw in this argument is Euclid’s assumption from his diagram that if the 
segment BE is extended, the point F is always “inside” angle DCA. On the basis of 
the postulates, he assumes—as distinct from the diagram—there is nothing to justify 
this conclusion. If the diagram is drawn instead on the curved surface of a sphere, then 
when BE is extended its own length to F’, the point F ends up on the far side of the 


sphere, and BF may be so long that F' falls “outside” angle DCA. Instead of having 
ZDCA > LFCE, just the reverse would be true. 


The underlying dif culty is that in making his so-called proof, Euclid took it for 
granted that a line is innite. The critical postulate in this regard, Postulate 2, asserts 
merely that a line can be produced continuously—that it is endless or boundless—but 
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does not necessarily imply that a line is in nite. On a sphere, where the role of a line is 
played by a great circle (a circle that has the same center as the sphere itself), a line that 
is produced from a given point will eventually return to that point. Because Euclid was 
not thinking of such a possibility, he apparently had no misgivings in proceeding on the 
basis of Postulate 2. 

The rst 26 propositions of the Elements develop theorems on congruent triangles, 
on isosceles triangles, and on the construction of perpendiculars. One also nds among 
the results the exterior angle theorem and the fact that the sum of two sides of a triangle 
is greater than the third side. The subject matter is based mainly on very ancient sources. 
There is a denite change of character beginning with Proposition 27; here, Euclid 
introduced the theory of parallels, but still without making use of his parallel postulate. 

Euclid de ned_ two lines as parallel if they did not intersect, that is, if no point lay on 
both of them. Euclid could have used the exterior angle theorem, although he did not do 
so, to prove the existence of parallel lines. (Or he could have added an extra postulate to 
the effect that parallel lines actually existed.) To see that this is possible, let / be any line 
and at each of two distinct points A and B on/ erect a perpendicular to / (Proposition 11 
allows this). If these perpendiculars were to meet at a point C, then in triangle ABC 
the exterior angle at B and the opposite interior angle at A, since they are right angles, 
would be equal. Because Proposition 16 is then violated, the two perpendiculars to / 
cannot meet; in other words, they are parallel. 


To make the next proposition precise, we require a de nition. Suppose a line ¢ 
(called a “transversal’”’) intersects lines / and /' at two distinct points A and B. In the 
accompanying gure, angles c, d, e, and f are called interior angles, while a, b, g, h 
are exterior angles. The usual language is to refer to the pair of angles c and e (d and 
f) as “alternate interior angles,” b and fA (a and g) as “alternate exterior angles.” 


The eight angles may also be grouped into four pairs of corresponding angles; angles a 
and e form such a pair of corresponding angles, and so do the pairs b and f, c and g, 
and d and h. 

With this terminology at hand, let us consider another proposition. 
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PROPOSITION27 If two lines are cut by a transversal so as to form a pair of congruent alternate interior 
angles, then the lines are parallel. 


Proof. Referring to the gure, let the transversal t intersect lines / and /’ at points A 
and B, so as to form a pair of alternate interior angles, say, 4b and Zc, which are 
equal. To achieve a contradiction, assume that lines / and /’ are not parallel. Then 
they will meet at a point C that lies, let us say, on the right side of ¢ so as to form a 
triangle ABC. It can be concluded that an exterior angle (in this case, Zb) is 
congruent to an opposite interior angle of triangle ABC (namely, Zc). But we know 
that this is impossible, for an exterior angle of a triangle is always greater than 
either opposite interior angle. In consequence, / and /' are parallel. 


Proposition 27 implies that if two lines are perpendicular to the same line, then the 
two lines are parallel. From this fact, it is an easy matter to establish that through any 
point P that is not on a given line /, there passes a line /’ that is parallel to /. All we need 
do is drop a perpendicular from P to the line / with foot at O (Proposition 12 allows 
this) and at P to erect a line /’ that is perpendicular to PQ (the construction is given in 
Proposition 11). Because / and /' have a common perpendicular, they must be parallel, 
with /’ through P. 

Let us pass over Proposition 28, which is just a variation of Proposition 27, and next 
examine Euclid’s Proposition 29. It states the converses of the preceding two proposi- 
tions. To this point, all the results have been obtained without any reference to the parallel 
postulate. They are, as we say, independent of it and would still be valid if the fth pos- 
tulate were deleted, or replaced by another one compatible with the remaining postulates 
and common notions. To prove Proposition 29, we must use the parallel postulate for the 
rst time. 


PROPOSITION29 A transversal falling on two parallel lines makes the alternate interior angles congruent 
to one another, the corresponding angles congruent, and the sum of the interior angles 
on the same side of the transversal congruent to two right angles. 


Proof. Suppose that the lines and angles are labeled as in the gure. We conclude at 
once that because Za and /b are supplementary angles, /a plus 2b equals two right 
angles (this is the content of Proposition 13). If Za > Zc, then Za+ 4b > Zc + Lb, 
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and 2b + Zc would be less than two right angles. It would follow 


t 


from Postulate 5 that / and /' must meet to the right of ¢. But this contradicts that / 
and /’ are parallel. Thus, it cannot happen that Za > Zc, or to put it in the 
afrmatively, Za < Zc. A like contradiction arises when we assume that the 
inequality Za < Zc holds; therefore Za = Zc. Because Zc and Ze are vertical angles, 
they are equal, whence Za = Ze. Finally, observe that the sum Za + 2b equals two 
right angles and 4a = Zc, so that the sum 2b + Zc of the interior angles 2b and Zc 
equals two right angles. 


ee 


It is worth noticing that Propositions 27 and 29 both provide proofs by contradiction, 
sometimes called reductio ad absurdum proofs. This is an important form of reasoning 
that consists in showing that if the conclusion is not accepted, then absurd or impos- 
sible results must follow. The element that produces the contradiction is different in 
each proposition. In Proposition 27, one ends up contradicting the exterior angle theo- 
rem, whereas in the case of Proposition 29, it is the parallel postulate that provides the 
absurdity. 

Moving further with these ideas, we look at another important result, namely Propo- 
sition 30. 


PROPOSITION 30 Two lines parallel to the same line are parallel to one another. 


Proof. Suppose that each of the lines / and /' is parallel to the line k. We claim that 
/ is also parallel to /’. Let these lines be cut by the transversal ¢, as indicated in the 
gure. Because ¢ has fallen on the parallel lines / and k, the angle a equals the angle 
b by Proposition 29. Likewise, since ¢ has fallen on the parallel lines & and /’, the 
angles 6 and c are equal. But then Za = Zc (this is Common Notion 1). Because 
these are alternate interior angles, it is apparent by Proposition 27 that / and /’ are 
parallel. 
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One implication of Proposition 30 is that through a point P not on a given line /, 
there cannot be more than one line parallel to 7. The argument is as follows. Suppose 
there were two distinct lines through P, each parallel to /; then from Proposition 30, 
they would be parallel to each other. This would, by the meaning of parallel, contradict 
that the lines intersect at P. 

We should stress this last point before temporarily abandoning the subject of parallel 
lines. Euclid did not require the parallel postulate to know that parallel lines exist, or 
what is more important, that it is possible to construct a parallel to a given line through 
an external point. The primary effect of Postulate 5 is to ensure that there exists only 
one line parallel to the given line through a point not on the line. 

Throughout Book I, Euclid went forward in a logical chain of propositions until 
his nal goal was reached. The work on parallel lines culminates with the result that 
the sum of the angles of a triangle is congruent to two right angles. The proof rests on 
Proposition 29 and hence implicitly involves the parallel postulate. It is surprising how 
many notable consequences of Euclidean geometry besides the properties of parallel lines 
stem, directly or indirectly, from this postulate. 


In any triangle, the sum of the three interior angles is equal to two right angles. 


Proof. Given a triangle ABC with angles a, b, and c, extend the side AB to a point 
D and through B draw a line / parallel to side AC. 


Cc I 


But Zc = Ze, since they are alternate interior angles formed by / and AC with BC. 
Similarly, Proposition 29 guarantees that Za = 4d. Now, the sum Zh + Ze + Ld 
equals two right angles (this is the content of Proposition 13), and so the sum of the 
interior angles of AABC must equal two right angles. 


Euclid’s Proof of the Pythagorean Theorem 


Book I closes—in Propositions 47 and 48—with a remarkably clever proof of the 
Pythagorean theorem and its converse. Although few of the propositions and proofs in 
the Elements are Euclid’s own discoveries, this proof of the Pythagorean theorem is 
usually ascribed to Euclid himself. Proclus wrote, “I admire the writer of the Elements 
not only that he gave a very clear proof of this proposition, but that in the sixth book, he 
also explained the more general proposition by means of an irrefutable argument.” On 
the surface, this suggests that the proof at the end of Book I was Euclid’s own; some 
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authorities contend that it was rst advanced by Eudoxus, who antedated Euclid by at 
least a generation, and the version in which the theory of proportion is applied to the 
sides of similar triangles bears the mark of Thales. 

The proof of the Pythagorean theorem found in Proposition 47 involves the contents 
of Book I only. The feeling that the reasoning is arti cial and unnecessarily intricate led 
the German philosopher Arthur Schopenhauer (1788-1860) to dismiss the demonstration 
with the contemptuous remark that it was not an argument but a “mousetrap.” Thus, 
among the many different names applied to Euclid’s proof, it is not uncommonly called 
“the mousetrap proof.” 


The diagram herewith illustrates Euclid’s proof. Given a right triangle ABC, with 
right angle at C, erect squares on each of the sides. Next, draw the perpendicular from 
C to AB and DE, meeting these sides at the points J and K, respectively. The key 
observation is that the rectangle A/KD has twice the area of the triangle CAD: 


(1) AJKD = 2(ACAD). 


This is because each gure has the same base AD and the same altitude A/. In like 
manner, since the lower square AFGC and the triangle FAB have the same base AF’ and 
the same altitude AC, the area of the square is twice the area of the triangle: 


(2) AFGC = 2(AFAB). 


We see that the two triangles CAD and FAB are congruent by the side-angle-side theorem 
(AC = AF, LCAD = CAB + LDAB = LCAB + LCAF = LFAB, and AD = AB), hence 
have the same area; that is, 


(3) ACAD = AFAB. 
Putting relations (1) and (2) together, we conclude at once that 


(4) AJKD = AFGC. 
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By exactly the same reasoning, it can be demonstrated that the rectangle BEKJ and 
square BCH are of equal area: 


(5) BEKJ = BCHI. 


But a glance at the diagram shows that the area of the square on the hypotenuse is the 
sum of the areas of the two rectangles AJKD and BEKJ. Thus, 


(6) ABED = AJKD + BEKJ 
= AFGC + BCHI 


and, with a change of notation, the theorem obtains: 
AB* = AC? + CB’. 


The Pythagorean theorem is immediately followed in the Elements by a proof of its 
converse: If in a triangle ABC the square on one of the sides (say BC) is equal to the 
sum of the squares on the other two sides, the angle contained by these other two sides 
is a right angle. For the proof, Euclid constructed a right triangle congruent to the given 
triangle. Speci cally, the procedure would be to lay off a line segment AD perpendicular 
to AC and equal in length to AB. 


D A B 


By hypothesis, AC? + AB* = BC”, and the Pythagorean theorem (as applied to ACAD) 
implies that AD? + AC” = CD?. Because AD = AB, the implication is that BC? = CD’, 
whence BC = CD. It follows that triangles CAD and CAB are congruent, for their cor- 
responding sides are congruent. Thus ZCAB = ZCAD, a right angle. 

Euclid’s similarity proof of the Pythagorean theorem (Proposition 31 of Book VI) 
had to be delayed, since the plan of the Elements called for the theory of proportion to 
be expounded in Books V and VI. It depends on a property that is characteristic of right 
triangles: A perpendicular from the vertex C of the right angle to the hypotenuse divides 
triangle ABC into two similar right triangles ADC and BDC. Observe that each of the 
new right triangles so formed and the original triangle are equiangular and hence similar. 
As regards triangles ABC and ADC, for instance, we have 4A = /A, since it is common 
to both triangles, and ACB = ZADC, for these are both right angles. The sum of the 
angles in any triangle equals two right angles, so it is equally clear that /B = ZACD. 

Because in Euclidean geometry it is proved that corresponding sides of similar 
triangles are proportional, 
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These proportionality relations imply that 


a’ = cx and b? =cy 


and, by addition, that 
a+b =cxt+ceyae(x+y)=c’. 


You may have gathered that the Elements is not a perfect model of mathematical 
reasoning; critical investigation reveals numerous aws in its logical structure. The truth 
is that so far as Euclid’s aim was to place geometry on an unimpeachable foundation, he 
failed in the attempt. This is not to belittle the work; it was a magni cent achievement, 
a giant step forward marking the real beginning of axiomatic mathematics. Although 
some of its underpinnings have needed shoring up, Euclid’s Elements is still a grand 
work, worthy of study. “This wonderful book,” wrote Sir Thomas Heath, “with all its 
imperfections, which are indeed slight enough when account is taken of the date at which 
it appeared, is and will remain the greatest mathematical textbook of all time.” 


Book II on Geometric Algebra 


Book II of the Elements could be called a treatise on geometric algebra, because it 
is algebraic in substance but geometric in treatment. Algebraic problems are cast entirely 
in geometric language and solved by geometric methods. Lacking any adequate algebraic 
symbolism, Euclid found it necessary to represent numbers by line segments. 


A a B A a B 


Thus, the product ab (as we write it) of two numbers is thought of as the area of a 
rectangle with sides whose lengths are the two numbers a and b. Euclid referred to the 
product as the “rectangle contained by AB = a and BC = b”; in place of a”, he spoke of 
“the square on AB.” Various algebraic identities, even complicated ones, were presented 
by Euclid in purely geometric form. For instance, the identity 


(a+ by =a? +2ab+b’ 
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was pictured in terms of the diagram 


and was quaintly stated in Proposition 4 of Book II as: “Ifa straight line be cut at random 
(into two parts a and b), the square on the whole is equal to the square on the two parts 
and twice the rectangle contained by the parts.” 

By Euclid’s time, Greek geometric algebra had reached a stage of development where 
it could be used to solve simple equations involving unknown quantities. The equations 
were given a geometric interpretation and solved by constructive methods; the answers 
to these constructions were line segments whose lengths corresponded to the unknown 
values. The linear equation ax = bc, for example, was viewed as an equality between 
areas ax and bc. Consequently, the Greeks would solve this equation by rst constructing 
a rectangle ABCD with sides AB = b and BC = c and then laying off AE = a on the 
extension of AB. One produces the line segment ED through D to meet the extension 
of BC in a point F and completes the rectangle EBFH. It is clear that KH = CF is the 
desired quantity x, for the rectangle KDGH (or ax) is equal in area to the rectangle 
ABCD (or bc); this can be seen by removing equal small triangles from the equal large 
triangles EHF and EBF. 


E a A b B 

a 
D 

K Cc 

ax x 

H G F 


When it came to quadratic equations, Euclid reduced them to the geometric equiva- 
lent of one of the forms 


x(x +a) = Db’, x(x —a) =D’, x(a —x)= Db’, 


which were then solved by applying theorems on areas. He was not the rst to expound 
on this technique, for according to the Commentary of Proclus, “These things are ancient 
and the discovery of the Muse of the Pythagoreans.” 

The method of applying areas was fundamental in Euclid’s work, and this was, 
strictly speaking, not so much a case of applying an area as of constructing a gure. In 
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its simplest form, the process consists of constructing a rectangle of unknown height x 
so that its base lies on a given line segment AB, but in such a way that the area of the 
rectangle either exceeds a speci ed value R by the square x? or falls short of R by the 
square x7. 


Let us see how Euclid actually used this method. Proposition 5 of Book II of the 
Elements was designed to teach the solution of the quadratic equation 


x7 +b? =ax, a>2b. 


The procedure was disguised by the peculiar geometric garb in which the Greeks were 
forced to clothe their results. We are told, to a given line segment AB = a apply the 
rectangle AOFG of known area b* in such a way that it shall fall short (from the 
rectangle on the entire segment AB) by a square gure, say x7. In brief, this calls for 
constructing the gure herewith. 


b 2 2 


Suppose that the applied rectangle is erected on y as a base and the “de cient” square 
on x as a base; then the segment AB has length x + y =a, while the applied rectan- 
gle corresponds to xy = b*. (One should recognize this as an Old Babylonian algebra 
problem.) Furthermore, 


x? + b* = area ABLG = ax, 


so that this “application of area” is the geometric equivalent of solving the equation 
x? +b? = ax. 

How does one go about producing the square of area x* speci ed in the quadratic 
equation? The answer is to be found in Proposition 28 of Euclid’s Book VI, a construction 
proposition, which states: Given a straight line AB, construct along this line a rectangle 
equal to a given area b*, assuming that the rectangle falls short of AB by an amount 
lled out by another rectangle (or square). We are instructed to erect at P, the midpoint 
of line AB =a, a perpendicular PE equal in length to 5; then with E& as a center and 
radius a/2, we draw an arc cutting AB at the point Q. Then the line segment QB has 
length equal to the solution of the quadratic equation x? + 5? = ax. For it can be proved 
that (4Q)(OB) = (PE)*, and when QB is set equal to x, this amounts to the statement 
that (a — x)x = b?. 
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For geometric veri cation that (4Q)(OB) = (PE)’, construct a rectangle ABLG hav- 
ing width BL = QB and complete the squares PBDC and QBLF on PB and QB as sides. 
The diagram that Euclid used for this purpose is shown. From various theorems on areas, 
it can be seen that 


AOFG + HFKC = (APHG + POFH) + HFKC 


= PBLH+ FLDK + HFKC 
= (PB). 
—x 

AL P Q x B 

a A). 

G H aie 
a/2 

Cc K D 


Because the rectangle AOFG has area (AO)(OF) = (AO)(QB) and HFKC = (PO), we 
get 


(AQ)(QB) + (PO) = (PB). 


All of this is, of course, formulated in geometric language. As Euclid expressed it in 
Proposition 5 of Book II: If a straight line is cut into equal and unequal parts, the rectangle 
contained by the unequal parts of the whole together with the square on the straight line 
between the points of section is equal to the square on the half. 

All that is needed to complete the argument is an appeal to the Pythagorean theorem. 
This leads directly to 


(AQ)(QB) = (PBY — (POY = (PE), 

or with the appropriate substitutions, (a — x)x = b?. The conclusion: AB = a has been 
divided into two segments AQ and QB, and the length of the segment QB is the number 
x for which x? + b? = ax. 

In the same spirit, Proposition 6 of Book II enables one to solve the quadratic 
equation 

x? + ax =’, 

or written another way, the equation (x + a)x = b*. The method of solution by appli- 
cation of areas would be to say: To a given line segment AB = a, apply the rectangle 
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AQKF of known area b* in such a way that it will exceed (the rectangle on the whole 
segment AB) by a square gure, say x7. This requires constructing a gure as shown. If 


the applied rectangle is erected upon the segment AQ = y as a base, then 


y-x=a, xy = Bb’. 


ax x 


~- 


y 


What Euclid wanted to teach is nothing more than the geometric solution of another 


Babylonian problem. 


To get the rectangle AOKF, which is equal in area to b* and has one side con- 
taining the line AB, we use a construction Euclid described in his sixth book (Proposi- 
tion 29). At the endpoint B of AB = a, erect a perpendicular BE equal in length to b; 


then with 


a/2 xt 


the midpoint P of AB as center and radius PE, draw an arc cutting the extension of AB 
at the point QO. We maintain that the rectangle with sides AO and BQ will be equal to 


the square on BE; that is 
(AQ)(BQ) = (BE). 


The diagram Euclid provided for a demonstration is as shown, where PODC and BOKH 


are squares described on PO and BQ, respectively. 


Regarding areas, it is evident that 
AQKF + GHLC = (APGF + POKG) + GHLC 
= HKLD + POKG + GHLC 
= (PQ)’. 
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Because the rectangle AOKF has an area equal to (AQ)(OK) = (4Q)(BQ) and GHLC is 
a square of side PB, the foregoing equation can be expressed as 


(AQ)(BQ) + (PB) = (PO)’. 


Euclid translated all this into ponderous geometric verbiage in Proposition 6: If a straight 
line is bisected and produced to any point, then the rectangle contained by the whole 
(with the added straight line) together with the square on half the line bisected is equal 
to the square on the straight line made up of the half and the part added. 

At this point, the Pythagorean theorem comes to the rescue again, for the last-written 
equation reduces to 


(4Q)(BQ) = (POY — (PBY = (BEY = b’. 


We have only to put AB = a and BO = x to see that the length of the segment BQ is 
the value required to satisfy the equation 


(x +a)x =B’. 


The special case for which a = b provides us with the opportunity to introduce what 
the celebrated astronomer Johannes Kepler called “one of the two Jewels of Geometry” 
(the second is the theorem of Pythagoras). For the construction used in solving the 
quadratic (x + a)x = a* amounts to dividing a given line segment AB into what is called 
the “golden section.” Translated into mathematical language, the golden section means 
that the segment AB = a is cut at a point C so that the whole segment is in the same 
ratio to the larger part CB = x as CB is to the other part, AC = a — x. Stated otherwise, 
it produces the relation 

a x 


-= ; x>a-—x. 
x a-x 


This, in turn, leads to the quadratic equation x(x + a) = a” already mentioned, the pos- 
itive root of which is 


x = fa(J/5 — 1). 


When a = 1, the value x = 5(/5 — 1) is the reciprocal of the “golden ratio”—that is, 
0.6180339.... 

Let us review Euclid’s construction for the golden section of a line segment AB = a. 
At the endpoint B of AB, erect a perpendicular BE equal in length to a; with the midpoint 
P of AB as center and radius PE, draw an arc cutting the extension of AB at the point 
Q. Take B as center and radius BQ, and draw an arc meeting AB at C. The point C 
divides the segment AB in the ratio sought. 
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Construction of the Regular Pentagon 


Much of the history of classical mathematics could be written around the idea of 
the golden section. It appears again in Book IV of the Elements with regard to the 
inscription (with the two traditional instruments, straightedge and compass) of certain 
regular polygons in a circle. You may recall that a regular polygon is a convex polygon 
with all its sides equal in length and with equal angles at each vertex. When a regular 
polygon of 7 sides is inscribed in a circle, the central angle formed by the radii drawn 
to two consecutive vertices has measure 360°/n. The Greeks were able to solve the 
problem of inscribing in a circle a regular polygon of an assigned number of sides when 
the number was 3, 4, 5, 6, 15, or twice the number of any inscribable polygon. The rst 
case in which they failed concerned a regular polygon of 7 sides. 

The construction of a regular pentagon (polygon of 5 sides), the division of a circle 
into 5 equal parts and the construction of an angle equal to 360°/5 = 72° are equivalent 
problems. The solution is taught in Propositions 10 and 11 of Book IV; Euclid relied on 
forming an isosceles triangle having each of the base angles equal to twice the remaining 
angle. This made the summit angle 36° and each of the angles at the base equal to 72°, 
thereby permitting the construction of both the regular pentagon and regular decagon 
(polygon of 10 sides). 


72° 72° 


In following the Greek method for constructing regular polygons of 5 and 10 sides, 
one would proceed as follows. Pick an arbitrary line segment AB = a for the radius of a 
circle and solve the quadratic equation x(x + a) = a” to get a line segment whose length 
isx = sa(/5 — 1). This is equivalent to cutting AB in golden section by a point C and 
letting x = AC. As we shall presently see, x will be the side of an inscribed decagon, or 
what amounts to the same thing, x can be stepped off as a chord in the circle of radius 
AB =a exactly 10 times. To conrm this, let us construct the isosceles triangle ABD 
having as its sides two radii AB = AD of the circle, and as its base BD a segment of 
length x. Also lay off the segment CD. By virtue of the condition a/x = x/(a — x), we 
have 


AB AC 
AC’ CB 
or, since AB = AD and AC = DB, 


AD _ DB 
DB CB 
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Another point to notice is that ADB = ABD, as base angles of an isosceles triangle; 
these are marked w in the gure. The upshot of all this is that triangles ADB and CBD 
are similar, for they have two pairs of corresponding sides proportional and the included 
angles equal. Then DAB equals ZCDB, because these are corresponding angles in similar 
triangles (they are marked 8). A little calculating with angles tells us that DCB = ZDBC, 
whence CD = DB = x. Indeed, more is true: ZADC equals /BAD, since each is a base 
angle of the isosceles triangle ACD. 

With the routine work out of the way, we are now ready to sum up. Because the 
sum of the angles of triangle DAB must equal two right angles, it can be concluded that 


180° = ZDAB + ZADB + LDBA 


=B+a+a 
= B+2B + 28, 
and as a result, 
180° 
= = 36° 
5 


Segment BD subtends a central angle of 36°, so it is the side of a regular inscribed 
decagon and will go 10 times as a chord within the circle of radius AB. The regular 
pentagon is drawn by selecting every other point as a vertex. 


The regular pentagon had a particular appeal to the early Pythagoreans, because its 
diagonals formed the star pentagram, the sign of recognition of the society. Although it is 
highly likely that Euclid’s method of constructing a pentagon was known to Pythagoras or 
his immediate disciples, no statement about the extent of their mathematical knowledge 
can be other than tentative. What is known is that Proclus, whose works inform us 
concerning the history of Greek geometry, wrote that Eudoxus (circa 370 B.C.) greatly 
added to the number of theorems that Plato originated concerning the “section,” meaning 
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the golden section. This is the rst reference we have of the name for such a division of 
a line segment. 

Having taken the opportunity to digress a little, let us now return to the main theme 
of the section. We saw earlier how the realization that certain geometric magnitudes are 
not expressible by whole numbers shook the foundations of the Pythagorean doctrine, 
which maintained that “everything is number.” It meant that such a simple equation 
as x* = 2 had no solution in their domain of (rational) numbers. The dilemma was 
resolved by putting algebra in a geometric dress; numbers came to be represented by 
line segments and geometric constructions were substituted for algebraic operations, so 
that products, for instance, corresponded to rectangular areas. Once the greater applica- 
bility of geometry was realized, geometric argument became the basis for all rigorous 
mathematics. 

The geometric algebra, a theory of line segments and areas, of Book II of Euclid’s 
Elements was the culmination of the Greek attempt to cope with the irrational through 
geometry. The book consists of propositions that appear on the surface to belong to 
geometry, but have content that is entirely algebraic. In particular, the treatment of 
quadratic problems is reduced to one of the equations 


x(x +a) =D’, x(a —x)=b’, x(x — a) = 5’, 


which are then solved geometrically by means of “application of area,” so that the 
roots appear as line segments. Although the individual solutions by area are awkward, 
involving as they do intricate constructions of plane gures, they follow exactly the 
same pattern as the earlier Babylonian algebraic calculations. The geometric algebra of 
the Elements is nothing more than a transposition of an inherited body of Babylonian 
procedures to geometric form. The chief difference is that, where Babylonian calculations 
only give a solution to quadratic equations if the square root can be found exactly 
(otherwise, a convenient approximation is accepted), Greek geometric algebra always 
gives an answer—a line segment is produced that may very well represent an irrational 
number. 

By embodying all mathematics except the theory of whole numbers in geometry, the 
Greeks swept the dif culties of the irrational under the rug, so to speak. The cumber- 
some techniques of geometric algebra allowed the Greeks to solve quadratic equations, 
but without assuming the existence of irrational numbers. This essentially alien garb, 
with all its clumsy verbiage and overwhelming diagrams, retarded progress in algebra 
for many centuries. For although linear and quadratic equations can be expressed clearly 
in the language of geometric algebra, higher-degree equations are effectively precluded 
from consideration. It is paradoxical that a religious controversy in the minds of the 
Pythagoreans, the worshipers of mathematics, should have had such a profoundly dele- 
terious effect on its growth. 

Greek geometric algebra had to await a translation into a formal symbolic language 
before a satisfactory divorce of algebraic calculation from geometry could take place. 
Historically, the systematic attempt to “symbolize” arithmetic and algebra operations 
is a relatively recent phenomenon, the decisive contribution of sixteenth-century 
mathematics. By the 1500s, negative rational numbers and zero were in regular use in 
practical calculations, but mathematicians still lacked a clear conception of irrational 
numbers. The German algebraist Michael Stifel (1486-1567), for instance, in his 
Arithmetica Integra of 1544, argued: 


168 


Chapter 4 The Alexandrian School: Euclid 


We are moved and compelled to assert that they truly are numbers, compelled that is, by 
the results which follow from their use. On the other hand ... just as an in nite number is 
not a number, so an irrational number is not a true number, but lies hidden in some sort of 
cloud of in nity. 


Doubt about the soundness of irrational numbers was expressed in the stigma numerus 
surdus (“inaudible number’), the phrase coming from the word surdus, “deaf or mute”— 
a Latin translation of an Arabic translation of the Greek alogos (“irrational number’’). 
Mathematicians such as Stifel pragmatically manipulated irrational numbers uncritically, 
without seriously questioning their precise meaning or nature, until the late 1800s. Then 
the question of the logical structure of the real number system was faced squarely. 
In an epoch-making essay entitled Continuity and Irrational Numbers (1872), Richard 
Dedekind nally established the theory of irrational numbers on a logical foundation, 
free from the extraneous in uence of geometry. 


A 
4.2 Problems 
Problems 1—10 contain propositions from Book I of Eu- Cc ~ 
clid’s Elements. In each instance, prove the indicated result. B 
1. Proposition 6. If two angles of a triangle are 4. Proposition 18. If one side of a triangle is greater than a 
congruent with one another, then the sides opposite second side, then the angle opposite the rst is greater 
these angles will also be congruent. [Hint: Let ABC than the angle opposite the second. [Hint: In AABC, for 
be a triangle in which CAB = ZCBA. If AC BC, AC > AB, choose a point D on AC such that AD = AB; 
say, AC > BC, then choose a point D on AC such use the fact that ZADB is an exterior angle of ABCD.] 
that AD = BC] 
A 
Cc D 
D 
B Cc 
A B 5. Proposition 26. Two triangles are congruent if they have 
one side and two adjacent angles of one congruent with a 
2. Proposition 15. If two lines cut one another, then Sid antl mo adjacent aneles ne ie Oiler, aak Tet 


3: 


AABC and ADEF be such that ZB = ZE, ZC = ZF, and 
BC = EF. If AB # DE, say AB > DE, choose a point G 
on AB for which BG = ED.] 


they make vertical angles that are equal. [Hint: 
Appeal to Proposition 13, which says that if a ray is 
drawn from a point on a line, then the sum of the 
pair of supplementary angles formed is equal to two 


A 
right angles. ] G 
B Cc 
P “ B B — Cc 
D 


A D 


Proposition 17. In a triangle, the sum of any two 
angles is less than two right angles. [Hint: In AABC, 
extend segment BC to a point D and use the exterior 
angle theorem. ] 
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Proposition 28. Two lines intersected by a third line 
are parallel if the sum of the two interior angles on 

the same side of the transversal is equal to two right 
angles. [Hint: In the gure for this problem, 

a+ B = 180°. Use Proposition 13.] 


t 
P 
a 
B 
Q 


Proposition 33. If two opposite sides of a 
quadrilateral are equal and parallel, then the other 
two sides are also equal and parallel (hence, the 
quadrilateral is a parallelogram). [Hint: In the 
quadrilateral shown, let AB = DC and assume that 
AB and DC are parallel. Show that AABC is 
congruent with AADC. ] 


A B 
D } Cc 
Proposition 35. Two parallelograms that have the 


same base and lie between the same parallel lines are 
equal in area to one another. 


A D__E F 
B C 


[Hint: In the gure, let ABCD and BCFE be 
parallelograms and let AD and EF lie on a line 
parallel to BC. Show that AABE is congruent with 
ADCF.] 


Proposition 37. Two triangles that have the same 
base and lie between the same parallel lines are equal 
in area to one another. 


E A D F 
= — a ae ee, 
NN 
NN 
XN 7 
B Cc 


[Hint: In the gure, let ABC and DBC be triangles 
such that AD is parallel to BC. Consider the 
parallelograms EBCA and FCBD.] 


Proposition 41. \f a parallelogram and a triangle have 
the same base and lie between the same parallel 


11. 
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lines, then [the area of] the parallelogram is double 
[the area of] the triangle. 


[Hint: In the gure, let ABCD be a parallelogram and 
EBC be a triangle, with AD and E on a line parallel 
to BC. Consider the triangles ABC and EBC.] 


In Mathematical Collection, Pappus (circa 320) gave 
the following generalization of the Pythagorean 
theorem, which applies to all triangles, whether right 
triangles or not. 


H 
“~@ 


Let ABC be any triangle and ABDE and ACFG be 
arbitrary parallelograms described externally on AB 
and AC. Suppose that DE and FG intersect at the 
point H when extended, and draw BL equal and 
parallel to HA. Then (in area) 


BLMC = ABDE + ACFG. 


Prove Pappus’s theorem. [Hint: First extend HA, BL, 
and MC until they meet LM, DE, and FG, 
respectively. Now, apply Proposition 35 to the 
parallelograms ABDE and ABKH, and also to ACFG 
and ACJH.] 


The Greeks constructed a line segment of length ./n, 
where n is a positive integer, as follows. First write n 
as n-1; then make AB =n and BC= 1. Drawa 
semicircle on AC as diameter. Erect BD 
perpendicular to AC at B, meeting the semicircle at 
the point D. By similar triangles, prove that the 
length of BD equals ./n. 
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Although Euclid’s great work is entitled Ele- 


Euclid’s Number Theory ments of Geometry, its subject matter extends 


far beyond what we would now regard as 
Euclidean Divisibility Properties high school geometry. Three of the books 

of the Elements (namely, VU, VIII, and IX), 
containing a total of 102 propositions, are devoted to arithmetic in the Greek sense. That 
is to say, they deal mainly with the nature and properties of what are called the “natural 
numbers” or the “positive integers.” Euclid was building on earlier foundations, because 
much of the substance of these arithmetical books can be traced to the Pythagoreans. 
Again he must be accorded the credit of having imposed a logical order on the whole. 
Many of the results had been long known but not always rigorously proved. Any earlier 
works on the theory of numbers that may have been written are no longer extant, so that 
it is impossible to say which proofs were supplied to Euclid and which were his own 
discoveries. 

Euclid was particularly interested in questions pertaining to divisibility, and he prop- 
erly emphasized the function of the prime numbers. In Book IX, the last of the books 
on number theory, many signi cant theorems can be found. Of these the most celebrated 
is Proposition 20, which reads, “Prime numbers are more than any assigned multitude 
of prime number.” What we have here is the famous assertion that there are in nitely 
many primes. Proposition 14 contains the essence of what today is called the funda- 
mental theorem of arithmetic—any integer greater than | can be written as a product of 
primes in exactly one way. Proposition 35 gives a derivation of the formula for nding 
the sum of numbers in geometric progression; and the following, and last, proposition in 
Book IX establishes a criterion for forming “perfect numbers” (the nomenclature is no 
doubt Pythagorean). 

As Euclid possessed no algebraic symbolism, he was forced to represent arbitrary 
numbers by line segments marked by one letter, or by two letters placed at the ends of 
the segment. His proofs, which were given in a verbal form, as opposed to the modern 
symbolic form, did not make use of geometry. In Books VII, VII, and IX, no geometrical 
gures were used for indeed none were necessary. Although Euclid may have adopted 
the language “plane numbers” and “solid numbers” to refer to products of two and three 
numbers, these were represented throughout the text not by rectangles or volumes but 
by segments. 

Book VII begins with a variety of de nitions that serve all three arithmetical books, 
including those of prime and composite numbers. Where Euclid phrased these in terms 
of line segments, we shall use modern notation and wording. 


Definition 


An integer b is said to be divisible by an integer a # 0, in symbols a | b, if there exists 
some integer c such that b = ac. One writes afb to indicate that b is not divisible by a. 


Thus, 39 is divisible by 13, since 39 = 13 - 3. However, 10 is not divisible by 3; for 
there is no integer c that makes the statement 10 = 3c true. 

There is other language for expressing the divisibility relation a|b. We might say 
that a divides b, a is a divisor of b, that a is a factor of b, or that b is a multiple of a. 
Notice too that in the de nition given there is a restriction on the divisor a; whenever the 
notation a|b is used, the understanding is that a is different from zero. Because Euclid 
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Liber J. 


lem , fub aqualibus veétis tineis contentum , &> ba- 
fin BG bef EF aqualem babebunt 3 eritque tri- 
angulum BAG triangulo EDF aquale , ac reli- 
qui anguliB, C veliquis angulis BE, FE aquales 
(rant, wtergque utyi we, feb quibus aqualia’ latera 
fubtenduntur. 

Si pun&éum D puné&o A applicetur, &-reéta 
DE rete AB {uperponatur, cadet punum E 
in B, quia DE? = AB. Item recta DF cadet 
in AC, quiaang. A *#= D. Quineriam pun- 
dum E punéto C coincidet, quia AC 7 = DF. 
Ergo rectz EF, BC, cim coldem habeant ter- 
minos, > congruent, & proinde axquales funt. 
Quare triangula BAC, BDF; & anguli B, E; 
itémg; anguli C, F- etiam congruunt, & a- 


quantur. Quod erat Demonftrandum. 


Prop. Vi: 


A Tfofcelium triangulorzam ABC 
qui ad bafim funt anuli ABC, 
ACB Sater fe funt aquates? Ec 
produttis aqualsbus veths lines 
B G ABA C qui fab bife funt an- 
gus CBD, BCE inter fe «- 
quales evant. 
D b *Accipe AF = AD, & a3 1, 
E junge CD, ac BF, bu. p i. 
Quoniam in :triangulis © byp- 
ACD, ABF,funt AB‘ = AC,& AE 4= AD, 4 conftr. 
angulusq; A communis,terit ang. ABFox ACD; © * © 
& ang. AFB*«= ADC, & bas. BFe= DC; 
nem FC! = DB. ergé in triangulis B F C, f 3 «x. 
BDC Seritang. FCB, = DBC. Q.E D. Irem 8 4 # 
ideo ang. FBC = DCB. atqui ang. ABF § = } pr. 
ACD. ergo ang. ABC *= ACB. QeE.D. Kh 
Corollayium. 
Hinc, Onne triangulum zquilaserum eft 
gucq; zqudangulum. 
Prop. 


Euclid’s proof that the base angles of an isosceles triangle are equal. From Isaac 
Barrow’s edition of Euclid’s Elements (1665). (From An Introduction to the History of 
Mathematics, 6/E, copyright ©1990 by Saunders College Publishing, a division of Holt, Rinehart and 
Winston, Inc., reprinted by permission of the publisher.) 


always represented numbers by line segments, he did not use the phrases “is a divisor 
of” or “is a multiple of.” Instead, he replaced these by “measures” and “is measured by,” 
respectively. For Euclid, a number 6 was measured by another number a if b = ac for 
some third number c. 

Euclid, in representing numbers by line segments, would never have considered a 
negative number. But in the modern view, the divisors of an integer always occur in 
pairs. If a is a divisor of b, then so is —a; indeed, b = ac implies that b = (—a)(—c). 
To nd all the divisors of a given integer, it sufces to obtain the positive divisors and 
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then adjoin to them their negatives. For this reason we shall usually limit ourselves—as 
Euclid did for his own reasons—to positive divisors. 

It will be helpful to list a number of simple facts involving the concept of divisor. 
For integers a, b, and c, the following hold: 


a|0, lla, ala. 

a|1 if and only if a = +1. 

If a|b and c|d, then ac| bd. 

If a|b and b|c, then a|c. 

a|b and b|a if and only if a = +6. 


Se oN 


If a|b and a|c, then a|(bx + cy) for any integers x and y. 


We shall establish assertion 6, leaving the veri cation of the other parts as an exercise. 
Now the relations a | b and a|c ensure that there exist integers r and s satisfying b = ar 
and c = as. But then 


bx + cy =arx +asy =a(rx + sy), 


whatever the choice of x and y. Because rx + sy is itself an integer, the last-written 
equation says simply that a|(bx + cy), as desired. 

It is convenient to call an expression of the form bx + cy, where x and y are integers, 
a linear combination of b and c. Note that b + c and b — c are both linear combinations of 
b and c (in the rst instance take x = y = 1; in the second let x = 1, y = —1). Hence, as 
a special case of assertion 6, we see that if a|b and a|c, then a|(6+c) and a|(b—c). 

Classifying positive integers greater than 1 as either prime or composite is very 
important in number theory; because of the fundamental theorem of arithmetic, many 
properties of integers can be deduced from properties of primes. Fact assertion | tells us 
that any integer a > | is divisible to +1 and by +a, divisors that are frequently named 
improper divisors. If they exhaust the divisors of a, then a is said to be a prime number. 
Put somewhat differently we have this de nition. 


Definition 


An integer p > 1 is called a prime number, or simply a prime, if its only positive 
divisors are | and p. An integer that is greater than | and not a prime is termed 
composite. 


Among the rst 10 positive integers, 2, 3, 5, and 7 are all primes, whereas 4, 6, 8, 
9, and 10 are composite numbers. Note that the integer 2 is the only even prime, and 
according to our de nition, the number | is distinguished in the sense of being neither 
prime nor composite. To illustrate Euclid’s language, let us record his way of de ning a 
prime: “A prime number is that which is measured by a unit (that is, by 1) alone.” 

It is often of interest to nd out whether two given numbers have any factors in 
common, and if so which ones. 


Definition 


Ifa and b are arbitrary integers, then an integer d is said to be a common divisor of 
a and b if we have both d|a and d |b. 
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Because | divides each integer, | is a common divisor of a and b. Hence, any pair 
of integers possesses at least one positive common divisor. In fact, if either a and b is 
nonzero, then a_ nite number of positive common divisors exist. Among these, there is 
one that is the largest, called the greatest common divisor of a and b, and denoted by the 
symbol gcd (a, 5). 


Example. The positive divisors of 12 are 1, 2, 3, 4, 6, and 12, and the positive 
divisors of 30 are 1, 2, 3, 5, 6, 10, 15, and 30; hence, the positive common divisors 
of 12 and 30 are 1, 2, 3, and 6. Because 6 is the largest of these integers, it follows 
that ged (12, 30) = 6. 


The Algorithm of Euclid 


To obtain the greatest common divisor of two integers, we could always proceed as 
in the last example by listing all their positive divisors and picking out the largest one 
common to each; but this is cumbersome for large numbers. A more ef cient process is 
given early in the seventh book of the Elements. Although there is historical evidence 
that this method predates Euclid by at least a century, it today goes under the name 
“Euclidean algorithm.” 

Euclid’s procedure relies on a result so basic that it is often taken for granted: the 
division theorem. Roughly, the theorem asserts that an integer a can be divided by a 
positive integer b in such a way that the remainder is smaller than b. An exact statement 
of this fact follows. 


For integers a and b, with b > 0, there exist unique integers q andr satisfying 


a=qb+r, O0<r <b. 


So 


The integers g and r are called the quotient and the remainder in the division of a by 
b. We accept the division theorem without proof, noting that b is a divisor of a if and 
only if the remainder r in the division of a by 5 is zero. 

In examining the division theorem, let us take b = 7. Then, for the choices a = 1, 
—2, 28, and —59, one gets the representations 


t= 0-74 
—2=(-1)-7+5 
28=4-7+0 


—59 = (-9)-744. 


The aim is to focus attention not so much on the division theorem as on its use in 
nding greatest common divisors. To this end, let a and 4 be two integers whose greatest 
common divisor is desired; there is no harm in assuming that a > b> 0. The rst step 
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is to apply the division theorem to a and 5, to get 
a=qb+n, 0<r, <b. 


If it happens that r; = 0, then b| a, and since b| 6 also, ged (a, b) = b. If r; 4 0, divide 
b by r; to produce integers gz and r2 satisfying 


b= qr +r, 0<1r <?r}. 
If r2 = 0, then we stop; otherwise, we go on as before, dividing 7; by r2, to obtain 
r= Qro +P, O75 are. 


This division continues until some zero remainder appears, say at the (7 + 1)st stage, 
at which r,_; is divided by r,. A zero remainder must occur sooner or later, since the 
decreasing sequence b > r, > r2 > --- > 0 cannot contain more than 5 integers. 

The result is the following system of equations: 


a=qb+n, 0 <r, <b, 
b=qrnt+r, 0<m <n, 
ry) = @r2 +P, 0<1r3 <r, 


ln—3 = Qn-1ln 2+Tn-1, 0 <ry-1 <Vn-2,; 


Yn—2 = Qn’n-1 +P ns Oy S71, 
Tn—-1 = n+iln +0. 


We argue that r,,, the last nonzero remainder that appears in this algorithm, is equal to 
gcd (a,b). Now ry |rp—1 by the last equation of the above system. From the equation 
immediately preceding, it follows that r, |7r,—-2; for r,-2 is a linear combination of r, 
and r,—1, both of which are divisible by r,. Working backward through these equations, 
we nd that, divides each of the preceding remainders r;. Finally r, |b, and from the 
rst equation a = q,;b +r, we get r, |a. Therefore, 7, is a positive common divisor of 
a and b. 

Next, suppose that d is an arbitrary positive common divisor of a and b. The rst 
of the equations tells us that d|r,. It is clear, in going down the list of the equations, 
that d divides r2,r3,... and ultimately r, also. But d|r,, with d and r, both positive 
integers, implies that d <r,. In consequence, r, is the largest of the positive common 
divisors of a and b; that is, ged (a, b) = ry. 

There is another important point that deserves mention. Namely, gcd (a,b) can 
always be expressed as a linear combination of the integers a and b. To verify this, we 
fall back on the Euclidean algorithm. Starting with the next-to-last equation arising from 
the algorithm, we write 7, as 


ln =Vn—-2 — Qnln-1; 


a linear combination of 7,,_; and r,_2. Now solve the preceding equation in the algorithm 
for r,—; and substitute to 


Ph =Tln-2 — An(¥n—3 _ An—1'n-2) 


= a a nQn—1 Wn—-2 + (—Gn n-3- 
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This eliminates r,_; and represents r,, as a linear combination of 7,2 and r,_3. Continu- 
ing backward through the system of equations, we successively eliminate the remainders 
ln—1s Vn—2,+++>%2,171 until a stage is reached at which r, = gcd (a, b) is expressed as a 
linear combination of a and b. 

To summarize, what we have obtained is the following: 


For integers a and b, of which both are not zero, there exist integers x and y such that 
ged (a, b) = ax + by. 


| 


Example. Let us see how the Euclidean algorithm works in a concrete case by 
calculating, say, gcd (12,378, 3054). The appropriate applications of the division 
algorithm produce the equations 


12,378 = 4- 3054+ 162 
3054 = 18- 162 + 138 
162 = 1- 138+ 24 
138 =5-24+4 18 
24=1-18+46 
18=3-6+0. 


Our previous discussion tells us that the last nonzero remainder appearing above, 
namely the integer 6, is the greatest common divisor of 12,378 and 3054: 


6 = gcd (12,378, 3054). 


To represent 6 as a linear combination of the integers 12,378 and 3054, we start with 
the next-to-last of the displayed equations and successively eliminate the remainders 
18, 24, 138, and 162: 


6 = 24-18 
= 24 — (138 — 5 - 24) 
= 6-24 — 138 
= 6(162 — 138) — 138 
= 6-162—7- 138 
= 6- 162 — 7(3054 — 18 - 162) 
= 132-162 —7- 3054 
= 132(12,378 — 4- 3054) — 7 - 3054 
= 132- 12,378 + (—535)3054. 
Thus, we have 
6 = gcd (12,378, 3054) = 12,378x + 3054y, 


where x = 132 and y = —535. It might be well to record that this is not the only way 
to express the integer 6 as a linear combination of 12,378 and 3054. Among other 
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possibilities, one could add and subtract 3054 - 12,378 to get 
6 = (132 + 3054)12,378 + (—535 — 12,378)3054 
= 3186- 12,378 + (—12,913)3054. 


It may happen that | and —1 are the only common divisors of a given pair of 
integers, whence gcd (a, b) = 1. For example, 


gcd (2, 5) = ged (9, 16) = ged (27, 35) = 1. 


This situation occurs often enough to prompt a de nition: 


Definition 


Two integers a and b are said to be relatively prime, or prime to each other, whenever 
gcd (a, b) = 1. 


We should emphasize that it is possible for a pair of integers to be relatively prime 
without either integer being a prime. On the other hand, if p is a prime number, then 
gcd (a, p) = | if and only if p/a. This is true because the only positive divisors of p 
are | and p itself, so that either gcd (a, p) = 1 or ged (a, p) = p. The latter case holds 
provided that pla. 

The next theorem describes relatively prime integers in terms of linear combinations. 


Let a and b be integers, of which both are not zero. Then a and b are relatively prime if 
and only if there exist integers x and y such that 1 = ax + by. 


Proof. \f a and b are relatively prime, so that ged (a, b) = 1, then our last theorem 
guarantees the existence of integers x and y satisfying 1 = ax + by. Conversely, 
suppose that | = ax + by for some choices of x and y, and that d = 

gcd (a, b). Because dja and d|b, we must have d|(ax + by) or d|1. Because d is a 
positive integer, this last divisibility condition forces d = 1, and the desired 
conclusion follows. 


| 


This result leads to an observation that is useful in certain situations. 


If gcd (a, b) = d, then gcd (a/d, b/d) = 1. 


Proof. Before starting with the proof proper, we should observe that although a/d 
and b/d have the appearance of fractions, they are in fact integers, since d is a 
divisor of both a and b. Because gcd (a, b) = d, it is possible to nd integers x and 
y such that d = ax + by. On dividing both sides of this equation by d, one obtains 
the expression 


| = (a/d)x + (b/d)y. 


Because a/d and b/d are integers, an appeal to the theorem is legitimate. The 
conclusion is that a/d and b/d are relatively prime. 


| 
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In illustration of the corollary, we observe that gcd (12, 30) = 6 and 
gcd (12/6, 30/6) = ged (2,5) = 1, 
as expected. 
It is not true, without imposing an extra condition, that a|c and b|c together yield 


ab|c. For instance, 6|24 and 8|24, but clearly 6 - 8/24. Were 6 and 8 relatively prime, of 
course, the situation would be altered. This brings us to another corollary. 


COROLLARY2 If a\c and b\|c, with gcd (a, b) = 1, then ab\c. 


Proof. Because a|c and b|c, there exist integers r and s for which c = ar = bs. 
Also, the condition gcd (a, b) = 1 allows us to write 1 = ax + by for suitable 
choices of integers x and y. If this last equation is multiplied by c, it appears that 


c=c-l=c(ax + by) =acx + bey. 
If the appropriate substitutions are now made on the right-hand side, then 
c = a(bs)x + b(ar)y = ab(sx + ry) 


or as a divisibility statement, ab|c. 


ee 


Proposition 24 of Book VII of Euclid’s Elements seems mild enough, but it is 
fundamentally important in number theory. In modern notation, it may be stated as 
follows. 


EUCLID’S If a\be, with gcd (a, b) = 1, then ale. 

LEMMA 
Proof. We start again by writing | = ax + by, where x and y are integers. 
Multiplication of this equation by c produces 


c=1-c=(ax + by)c = acx + bey. 


Because alac and a|bc, it follows that a|(acx + bcy), which may be restated as alc. 


CO 


If a and b are not relatively prime, then the conclusion of Euclid’s lemma may fail 
to hold. A specic example: 12|9- 8, but 12/9 and 12/8. 


The Fundamental Theorem of Arithmetic 


The fundamental theorem of arithmetic, otherwise known as the “unique factorization 
theorem,” asserts that any integer greater than | can be represented as a product of primes, 
and that the product is unique apart from the order in which the factors appear. Although 
this theorem is sometimes attributed to Euclid, it apparently was not expressly stated 
before 1801, when Gauss featured it in his Disquisitiones Arithmeticae. The nearest that 
Euclid himself came to this result was Proposition 14 of Book IX: “If a number be the 
least that is measured by prime numbers, it will not be measured by any other prime 
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number except those originally measuring it.” Some authorities argue that Euclid’s failure 
to “discover” the fundamental theorem stems from his inability to form products wherein 
the number of factors is unspeci ed. Others argue that the theorem asserts the existence 
of a certain representation, and that the Greeks could not conceive of the existence of 
anything that was not constructible by elementary geometry. 

Because every number either is a prime or, by the fundamental theorem, can be 
broken down into unique prime factors and no further, the primes serve as the “building 
blocks” from which all other integers can be made. Accordingly, the prime numbers 
have intrigued mathematicians through the ages, and although many remarkable theorems 
relating to their distribution in the sequence of positive integers have been proved, even 
more remarkable is what remains unproved. The open questions can be counted among 
the outstanding unsolved problems of all mathematics. 

To begin on a simple note, we observe that the prime 3 divides the integer 36. We 
may write 36 as the product 


6-6, or 9-4, or 12-3, or 18-2; 


and in each instance, 3 divides at least one of the factors involved in the product. This 
is typical of the general situation, and the precise result can be stated. 


If p is a prime and p|ab, then p|a or p\b. 


Proof. If p\a, then we need go no further, so let us assume that p/a. Since the only 
positive divisors of p (hence, the only candidates for the value of gcd (a, p)) are 1 
and p itself, this implies that gcd (a, p) = 1. Citing Euclid’s lemma, it follows 
immediately that p|b. 


This theorem extends to products with more than two factors. We state the result 
without proof. 


If p is a prime and p\ayaz+-+a,, then pla, for some k, where 1 <k <n. 


| 


Let us next show that any composite number is divisible by a prime (Proposition 31, 
Book VI). For a composite number n, there exists an integer d satisfying the conditions 
d|n and | < d <n. Among all such integers d, choose p to be the smallest. Then p must 
be a prime number. Otherwise, it too would possess a divisor g with | < q < p; but g|p 
and p|n imply that g|n, which contradicts our choice of p as the smallest divisor, not 
equal to 1, of n. Thus, there exists a prime p with p|n. 

With this preparation we arrive at the fundamental theorem of arithmetic. As indi- 
cated earlier, the theorem asserts that every integer larger than 1 can be factored into 
primes in essentially one way; the linguistic ambiguity “essentially” means that the rep- 
resentation 2 - 3 - 2 is not considered different from 2 - 2-3 as a factorization of 12. The 
precise formulation is given as follows. 
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FUNDAMENTAL Every positive integer n > | is either a prime or can be expressed as a product of 
THEOREM OF primes; this representation is unique, apart from the order in which the factors occur. 


ARITHMETIC 
Proof. Either n is a prime or it is composite. In the rst case there is nothing to 
prove. If 7 is composite, then there exists a prime divisor of 7, as we have shown. 
Thus, 7 may be written as n = pjn,, where p, is prime and | <n, <n. If n, 1s 
prime, then we have our representation. In the contrary case, the argument is 
repeated to produce a second prime number p2 such that n; = p2n2; that is, 


n= Pi p2N2, l<ny <n. 
If n2 is a prime, then it is not necessary to go further. Otherwise, write nz = p3n3, 
with p3 a prime; hence, 
N= Pi P2P3N3, 1<7n3 <n. 
The decreasing sequence 
n>n>ng>-::>1 


cannot continue inde nitely, so that after a nite number of steps n,; is a prime, say 
Px. This leads to the prime factorization 


n= P\P2°** Pk- 
The second part of the proof—the uniqueness of the prime factorization—is more 
dif cult. To this purpose let us suppose that the integer n can be represented as a 
product of primes in two ways; say, 
N= Pi P2°** Pr = G192°° "Ys, rss, 
where the p; and q; are all primes, written in increasing order, so that 
Pisps::'sSPr and GQ 2925 °°* 4s. 

Because p\|9192--- ds, we know that p;|q, for some value of k. Being a prime, qx 
has only two divisors, | and itself. Because p; is greater than 1, we must conclude 
that p; = qx; but then it must be that p; > q;. An entirely similar argument (starting 
with q; rather than p;) yields g; > pj, so that in fact py = q,. We can cancel this 
common factor and obtain 

P2P3°** Pr = 9293 °°" s- 
Now repeat the process to get p2 = qo; cancel again, to see that 

P3P4°°* Pr = 9394°°"s- 
Continue in this fashion. If the inequality r < s held, we should eventually arrive at 
the equation 

l= Gr+19r4+2°°° ss 

which is absurd, since each g; > 1. It follows that r = s and that 


Pi = 1; P2 = 92,-+-++) Pr = Ors 


making the two factorizations of n identical. The proof is now complete. 


a 
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Of course, several of the primes that appear in the factorization of a given integer 
n may be repeated (as is the case with 360 = 2.2-2-3.-3-5). By collecting the equal 
primes and replacing them by a single factor, we could write n in the so-called standard 


form 


ky ko k, 
n — Py Pr cae Py A 
where each k; is a positive integer and each p; is a prime with p; < po <--: < py. 
To illustrate: The standard form of the integer 360 is 360 = 2° - 3*- 5. Further ex- 
amples are 


4725 =3°-5*-7 ~~ and 17,640 = 2? .37.5.7°. 


We cannot resist giving another proof of the irrationality of 2, this time using the 
fundamental theorem of arithmetic. 


The number \/2 is irrational. 


Proof. Suppose to the contrary that /2 is a rational number, say, /2 = a/b, where 
a and b are both integers with gcd (a, b) = 1. Squaring, we get a* = 2b’, so that 
bla’. If b > 1, then the fundamental theorem guarantees the existence of a prime p 
such that p|b. From p|b and bla’, it follows that p|a?; but then pla, hence 

gcd (a, b) > p. We therefore arrive at a contradiction, unless b = 1. If this happens, 
then a? = 2, which is impossible (we assume you are willing to grant that no integer 
can be multiplied by itself to give 2). Our original supposition that V2 is a rational 
number is untenable; so it must be an irrational number. 


a 


An Infinity of Primes 


By this time, you are probably asking, Is there a prime number that is the largest, 
or do the primes go on forever? The answer is to be found in a very ingenious, yet quite 
simple, proof given by Euclid (Proposition 20, Book IX) in his Elements. In general 
terms, what he showed is that beyond each prime another and larger prime can be found. 
The actual details follow; the argument is Euclid’s, although the words and modern 
notation are not. 


There are an infinite number of primes. 
Proof. Write the primes 2,3, 5,7, 11,... in ascending order. For any particular 
prime p, consider the number 
Ne O36 5s 7115-4 py 1. 


That is, form the product of all the primes from 2 to p, and increase this product by 
one. Because N > 1, we can use the fundamental theorem to conclude that N is 
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divisible by some prime q. But none of the primes 2,3,5,..., p divides N. For if g 
were one of these primes, then on combining the relation g|2-3-5---p with q|n, 
we would get g|(N — 2-3-5---p), or what is the same thing, g|1. The only 
positive divisor of the integer | is 1 itself, and since g > 1, the contradiction is 
obvious. Consequently, there exists a new prime q larger than p. 


a 


Euclid’s proof demonstrates the existence of some prime larger than p; but we do 
not necessarily arrive at the very next prime after p when we use the method indicated 
by his proof. For example, this process yields 59 as a prime beyond 13: 


N=(2-3-5-7-11-13)+ 1 = 30,031 = 59 - 509 


Frequently, there are a great many primes between the prime p considered and the one 
obtained in the manner the proof suggests. 

How can we determine, given a particular integer, whether it is prime or composite, 
and if it is composite, how can we actually nd a nontrivial divisor? The most obvi- 
ous approach is successive division of the integer in question by each of the numbers 
preceding it; if none of them (except 1) serves as a divisor, then the integer must be a 
prime. Although this method is very simple, it cannot be regarded as useful in practice. 
For even if one is undaunted by large calculations, the amount of work involved may be 
prohibitive. 

Composite numbers have a property that enables us to reduce materially the necessary 
computations. If an integer a > 1 is composite, it can be written as a = bc, where 
1 <b <a and 1 <c <a. Assuming that b < c, we get b* < bc =a, and so b < Ja. 
Because b > 1, there is for b at least one prime factor p. Then p < b < ./a; furthermore, 
because p|b and bja, it follows that p|a. The point is simply this: A composite number 
a will always possess a prime divisor p satisfying p < ./a. 

In testing the primality of a specic integer a > 1, it therefore sufces to divide a 
by those primes not exceeding ./a (presuming, of course, the availability of a list of 
primes up to ./a). This can be claried by considering the integer a = 509. Because 
22 < /509 < 23, we need only try out the primes that are not larger than 22 as possible 
divisors, namely, the primes 2, 3, 5, 7, 11, 13, 17, and 19. Dividing 509 by each of these 
in turn, we nd that none serves as a divisor of 509. The conclusion is that 509 is a 
prime number. 


Example. The foregoing technique provides a practical means for determining the 
standard form of an integer, say a = 2093. Because 45 < 2093 < 46, it is enough 
to examine the primes 2, 3, 5, 7, 11, 13, 17, 19, 23, 29, 31, 37, 41, and 43. By trial, 
the rst of these to divide 2093 is 7, with 2093 = 7 - 299. As regards the integer 299, 
the seven primes less than 18 (note that 17 < /299 < 18 are 2, 3, 5, 7, 11, 13, and 
17. The rst prime divisor of 299 is 13, and carrying out the required division, we 
obtain 299 = 13 - 23. But 23 is itself a prime, whence 2093 has exactly three prime 
factors, namely 7, 13, and 23: 


2093 = 7-13-23. 
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1. 


10. 


Given integers a, b, and c, verify that 


(a) Ifal|b, then albc. 

(b) Ifal|b and alc, then a*|be. 

(c) ab if and only if ac|bc, provided c # 0. 
(d) Ifa\(a +5), then alb. 

(e) Ifa|b and cid, then aclbd. 


Show that if a|b, then (—a)|b, a|(—b), and 
(—a)|(—5). 

For any positive number n, it can be shown that there 
exists an even integer a that is representable as the 
sum of two odd primes in n different ways. Con rm 
that the integers 66, 96, and 108 can be written as 
the sum of two primes in six, seven, and eight ways, 
respectively. 


A conjecture of Lagrange (1775) asserts that every 

odd integer greater than 5 can be written as a sum 

p+2q, where p and g are both primes. Verify that 
this holds for all such odd integers through 75. 


Find an example to show that the following 
conjecture is not true: Every positive integer can be 
written in the form p+ a’, where p is a prime (or 
else equal to 1) and a > 0. 


Prove that the only prime of the form n? — 1 is 7. 
[Hint: Factor n3 — 1 as (n — 1)(n? +n + 1).] 


Find a set of four consecutive odd integers of which 
three are primes, and a set of ve consecutive odd 
integers of which four are primes. 


Although the answer is not known, it appears that 
each positive multiple of 6 can be written as the 
difference of two primes. Conrm this as far as 90. 


Consider the primes arranged in their natural order 
2,3,5,7,.... It is conjectured that beginning with 3, 
every other prime can be composed of the addition 
and subtraction of all smaller primes (and 1), each 
taken once. For example: 
3=1+2, T=1—2+43-45, 
13=14+2-3-54+7+411 
=—-14+2+34+5-7+11. 
Show that this also holds for 19, 29, 37, and 43. 
Establish each of these statements. 


(a) The square of any integer is of the form either 
4n or 4n + 1. 
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11. 


12. 


13. 


14. 


15. 


16. 


17. 


18. 


19. 
20. 


21. 


22. 


(b) The square of any odd integer is of the form 
8n + 1. [Hint: Any odd integer is of the form 
4k + 1 or 4k 4+3.] 

(c) The square of any integer not divisible by 2 or 
3 is of the form 12” + 1. [Hint: By the division 
theorem, an integer can be represented in one of 
the forms 6k, 6k + 1, 6k + 2, 6k + 3, 6k + 4, or 
6k + 5.] 


For any arbitrary integer a, show that 2|a(a + 1) and 
3\a(a + 1)\(a +2). 

Prove that if a is an integer not divisible by 3, then 
3\(a* — 1). 


Verify that the difference of two consecutive squares 
is never divisible by 2; that is, 2 does not divide 
(a+ 1) — a? for any choice of a. 


For a positive integer a, show that gcd (a, 0) =a, 
gcd (a, 1) = 1, and ged (a,a) =a. 


Find ged (143, 277), ged (136, 232), and 
ged (272, 1479). 


Use the Euclidean algorithm to obtain integers x and 
y satisfying: 


(a) ged (56, 72) = 56x + 72y. 

(b) gcd (24, 138) = 24x + 138y. 

(c) ged (119, 272) = 119x + 272y. 

(d) ged (1769, 2378) = 1769x + 2378y. 


Prove that any two consecutive integers are relatively 
prime, that is, ged (a,a +1) =1 for any integer a. 


Establish that the product of any three consecutive 
integers is divisible by 6, and the product of any four 
consecutive integers is divisible by 24. 


Given that p is a prime and pla”, show that p"|a”. 


(a) Find all prime numbers that divide 40! (recall 
that 40! = 1-2-3.-4---40). 

(b) Find the prime factorization of the integers 
1234; 10,140; and 36,000. 


(a) An unanswered question is whether there are 
in nite ly many primes that are 1 more than a 
power of 2, such as 5 = 27 + 1. Find two more 
of these primes. 

(b) It is equally uncertain whether there are 
in nite ly many primes that are | less than a 
power of 2, such as 3 = 27 — 1. Find four more 
of these primes. 


Prove that the only prime p for which 3p + 1 isa 
perfect square is p = 5. [Hint: If 3p + 1 =a’, then 
3p =a*—1=(a+1)(a—1)] 
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23. It has been conjectured that every even integer can 24. Determine whether the integer 701 is prime by 
be written as the difference of two consecutive testing all primes p < 701 as possible divisors. Do 
primes in an in nite number of ways. For example, the same for the integer 1009. 


4=11-—7=17-13=23-19 


25. Prove that is irrational for any prime p. 
=47-—43=131-127=.... VP uP : 


Express the integer 6 as the difference of two 26. Use the division theorem to show that every prime 
consecutive primes in 10 ways. except 2 and 3 is of the form 6n + 1 or 6n +5. 


Another Alexandrian mathematician whose work in 


4.4 Eratosthenes, the Wise Man number theory remains signicant is Eratosthenes 
of Alexandria (276-194 B.c.). Eratosthenes was born in Cyrene, 

a Greek colony just west of Egypt and under Ptole- 

The Sieve of Eratosthenes maic domination, but spent most of his working days 


in Alexandria. At some time during his early life he 
studied at Plato’s school in Athens. When about 30 years of age Eratosthenes was invited 
to Alexandria by King Ptolemy III to serve as tutor for his son and heir. Later, Eratos- 
thenes assumed the most prestigious position in the Hellenistic world, chief librarian at 
the Museum, a post he was to hold for the last 40 years of his life. It is reported that in 
old age he lost his sight, and unwilling to live when he was no longer able to read, he 
committed suicide by refusing to eat. 

Eratosthenes was acknowledged to be the foremost scholar of his day and was un- 
doubtedly one of the most learned men of antiquity. An author of extraordinary versatility, 
he wrote works (of which only some fragments and summaries remain) on geography, 
philosophy, history, astronomy, mathematics, and literary criticism; and he also com- 
posed poetry. Eratosthenes was given two nicknames that are signi cant in light of the 
prodigious range of his interests. In honor of his varied accomplishments, his friends 
called him Pentathis, a name applied to the champion in ve athletic events—hence, 
to men who tried their hands at everything. His detractors felt that in attempting too 
many specialities, Eratosthenes failed to surpass his contemporaries in any one of them. 
They dubbed him Beta (the second letter of the Greek alphabet), insinuating that while 
Eratosthenes stood at least second in all elds, he was rst in none. Perhaps a kinder 
explanation of this second nickname is that certain lecture halls in the Museum were 
marked with letters, and Eratosthenes was given the name of the room in which he 
taught. 

Although Eratosthenes could be regarded as among a second echelon in many en- 
deavors, he was certainly not beta in the elds of geography and mathematics. His 
three-volume Geographica, now lost except for fragments, was the rst scientic at- 
tempt to put geographical studies on a sound mathematical basis. In this work, he 
discussed the arguments for a spherical earth and described the position of various 
land masses in the known world. Eratosthenes’ actual mapping of the populated quar- 
ters of the earth was based on hearsay and speculation, but it was the most accu- 
rate map of the world that had yet appeared and the rst to use a grid of meridi- 
ans of longitude and parallels of latitude. He regarded the inhabited lands as placed 
wholly in the northern hemisphere, surrounded by a continuous body of ocean. Er- 
atosthenes made the rst suggestion for the circumnavigation of the globe when he 
observed: “If it were not for the vast extent of the Atlantic Sea one might sail from 
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The habitable world according to Eratosthenes. (From A Short History of Scientific Ideas 
by Charles Singer. Reproduced by permission of Oxford University Press.) 


Iberia (Spain) to India along one and the same parallel.” The vast amount of quan- 
titative data accumulated by Eratosthenes as head of the largest library of antiquity 
made his Geographica the prime authority for centuries; the longitude and latitude 
of 8000 places on earth were given, as well as many estimates of distances between 
locations. 

As a mathematician, Eratosthenes produced as his chief work a solution of the famous 
Delian problem of doubling the cube and the invention of a method for nding prime 
numbers. His mechanical contrivance for effecting duplication, called a mesolabium, 
or mean-nder, consisted of a rectangular framework along which three rectangular 
plates (marked with their diagonals) of height equal to the width of the frame slide 
in three grooves, moving independently of one another and able to overlap. Suppose 
that the original positions of the rectangular plates are shown as in the gure, where 
AP and FQ are the sides of the frame and ARGF, RSHG, and STIH are the plates that 
slide. 


F G Pog £ 


If the rst plate remains stationary while the second slides under the rst, and the 
third under the second, to a position in which the points A, B, C, and D 
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are brought into line, then the result looks like the preceding gure. Draw a straight line 
through the collinear points A, B, C, and D, meeting the side FQ at E. From the theory 
of similar triangles, we then obtain 


HE BE GE 


GE AE FE’ 
while 
BG GE CH HE 
AF FE “\ BG GE’ 
Tying the various relations together, we see that 
CH BG 
BG AF’ 
By similar reasoning, 
DI CH 
CH BG’ 


and so DI, CH, BG, and AF are in continued proportion. On setting DJ = a, AF = 2a, 
CH = x, and BG = y, we get 

aq x y 

xy 2a’ 
which makes apparent the conclusion that x and y are the required mean proportionals 
between the lengths a and 2a. Put another way: If a is the length of the edge of a given 
cube, the cube that has edge x will have volume double the original one’s. 

Eratosthenes was so pleased with his contrivance for solving the Delian problem that 
he had a monument erected to Ptolemy II on which the proof was inscribed, and he also 
caused the mean- nder to be cast in bronze. What could be more curious behavior—the 
best way Eratosthenes could think of to thank and atter the king was to dedicate the 
solution of an esoteric mathematical problem to him! Of course, any mechanical solution 
was not as “pure” as straightedge and compass constructions would be, and as such 
would be abhorrent to the principles of Plato. 

We have seen that if an integer a > 1 is not divisible by a prime p < J/a, then a 
itself is necessarily a prime. Eratosthenes used this fact as the basis of a clever technique, 
called the sieve of Eratosthenes, for nding all primes less than a given integer 7. 
The scheme calls for writing down the integers from 2 to n in their natural order and 
then systematically eliminating all the composite numbers by striking out all multiples 
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2p, 3p, 4p,... of the primes p < ./n. The integers that are left on the list—that do not 
fall through the “sieve”—are primes. 

To see by example how this works, suppose that we want to nd all primes not 
exceeding 100. Recognizing that 2 is a prime, we begin by crossing out all even integers 
from our listing, except 2 itself. The rst of the remaining integers is 3, which must be 
a prime. We keep 3, but strike out all higher multiples of 3, so that 6,9, 12,... are now 
removed. The smallest integer after 3 not yet deleted is 5. It is not divisible by either 2 
or 3 (otherwise it would have been canceled), hence is also a prime. Because all proper 
multiples of 5 are composite numbers, we next remove 10, 15, 20,..., retaining 5 itself. 
The rst surviving integer 7 is a prime, for it is not divisible by 2, 3, or 5, the only primes 
that precede it. After the proper multiples of 7, the largest prime less than 100 = 10, 
have been eliminated, all composite integers in the sequence 2, 3,4, ..., 100 have fallen 
through the sieve. The positive integers that remain, to wit 2, 3,5, 7, 11, 13, 17, 19, 23, 29, 
31, 37, 41, 43, 47, 53, 59, 61, 67, 71, 73, 79, 83, 89, 97, are all the primes less than 100. 

The accompanying table represents the result of the completed sieve. The multiples 
of 2 are crossed out by \; the multiples of 3 are crossed out by /; the multiples of 5 are 
crossed out by —; the multiples of 7 are crossed out by ~. 


2 3 4 5 K 7 S ¥ +e 
1 &% 13 % 6 NW 17 we 19 28 
we 6% 23 4M 8% 2» WM %R ~ 29 34 
31 3 6 MM Sr 8H 37 BW W 3 
4. 2 4 M 45 % 47 MR 44> 30 
BM 3 4M 5 Fe SF BW 59 4 
61 & BY 65 5 67 68 69 AY 
71 KR 73 Mm HB 6 XH BR 79 3¢ 
BR 8 BR OS OHOCBTBBKOC*RD 99 
DL ee 9% 54 95. 9G 97 BR YW OO 


Measurement of the Earth 


Today Eratosthenes is best remembered for having devised a practical method for 
calculating the earth’s circumference. Although his was not the rst or last such estimate 
made in antiquity, it was far more accurate than all previous estimates. The extraordinary 
thing about Eratosthenes’ achievement is its simplicity. His procedure was based on 
estimates of the arc of the great circle through Alexandria and Syene, the city that today 
is called Aswan. The two cities had certain advantages. They were thought to be on 
the same meridian; the distance between them had been measured by a bematistes, or 
surveyor, trained to walk with equal steps and count them, and had been found to be 
5000 stadia; and travelers had commented on the curious fact that in Syene, at the time 
of the summer solstice, the sun at noon cast no shadow from an upright stick. This meant 
that Syene was directly under the Tropic of Cancer, or at least, nearly so. Story has it 
that Eratosthenes conrmed the position of the tropic by observing the water in a deep 
well. At noontime of the summer solstice, the bottom was completely illuminated by the 
sun’s rays, the edge of the well casting no shadow at all on the water below. 

Because the sun is so vastly distant from the earth, its rays may be regarded as 
striking the earth in parallel lines. Eratosthenes argued that at noon on the day of the 
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summer solstice, the continuation of a line through the well at Syene would pass through 
the center of the earth, the sun being directly overhead. At the same time at Alexandria, 
the sun was found to cast a shadow indicating that the sun’s angular position from 
zenith was a = 7° 12’ = a or s of a complete circle. In making this determination, 
Eratosthenes apparently used a sundial consisting of a hemispherical bowl with a vertical 
pointer at its center to cast a shadow; the direction and height of the sun could be read 
off by observing the sun’s shadow with lines drawn on the concave interior. Now an 
imaginary line drawn through the vertical pointer of the sundial would pass through 
the center of the earth and there form an angle with the line through the well at Syene. 
This central angle would have to equal a, according to the theorem that asserts that the 
alternate interior angles formed by a transversal cutting a pair of parallel lines are equal. 
In brief, the angle the sun’s rays would make with the pointer of the sundial would equal 
the angle subtended at the earth’s center by the arc connecting Alexandria and Syene. 


Pointer casts 
a shadow 


ae ae 
Alexandria Parallel rays 
Shadow to ae eo from sun a 


be measured a 


Center of 
earth 


Assuming that the sundial at Alexandria, the well at Syene, the center of the earth, 
and the center of the sun when directly over Syene all lay in the same plane, Eratosthenes 
inferred that 

a 5000 360° 


4 


360° circumference’ ° 50 


and there was but one unknown (the earth’s circumference) in the equation. This gave 
him 50 times the 5000 stadia, or 250,000 stadia, for the entire circumference of the 
earth. For some reason not known to us (perhaps to account for any error that existed in 
measuring the distance between Alexandria and Syene), he added an extra 2000 to this 
gure to conclude that the desired circumference was 252,000 stadia. Unfortunately, there 
was more than one kind of stadium used for measuring distance. If it is assumed that 
Eratosthenes used Egyptian stadia of 516.73 feet each, then his 252,000 stadia work out 
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to the incredibly excellent value of 24,662 miles, just 245 miles less than the true value. 
The ancient world certainly accepted Eratosthenes’ measurement as the best possible. 
Pliny (A.D. 23-79), the Roman naturalist, said it was so bold and subtle a feat that it 
would be a shame not to accept the gure, and he even recorded divine sanction of it. 

Such a close estimate must, however, be regarded as somewhat accidental. Although 
the method was sound in theory, the accuracy of the answer would have to depend on 
the precision with which the basic data could be determined. Eratosthenes made several 
compensating errors. The gure of = of the circle for the difference in latitude is near the 
truth, but Syene is not directly on the tropic, Alexandria is not on the same meridian (it 
lies about 3° to the west of Syene), and the direct distance between the two places is 4530 
stadia, not 5000. This does not matter very much, because Eratosthenes’ achievement 
lies in his method; for a man who was regarded as a “‘second-stringer” in the Alexandrian 
era of Greek mathematics, it showed the touch of genius. 


The Almagest of Claudius Ptolemy 


Any discussion of Alexandria must take into account the advances made in astron- 
omy, a branch of science completely dependent on mathematics. For fourteen centuries, 
the accepted blueprint of the solar system was that of the Alexandrian Claudius Ptolemy 
(A.D. 100-170). Ptolemy did for astronomy what Euclid did for geometry; by incorpo- 
rating a brilliant power of synthesis and exposition with original genius, he reduced the 
works of his predecessors to a matter of “historical interest” with little chance of survival. 
His great treatise Syntaxis Mathematica (The Mathematical System), or the Almagest, 
as it became known to the Arabs and medieval Europeans, was destined to remain the 
supreme authority on astronomy until the publication of Copernicus’s De Revolutionibus 
(1543). We are ignorant of most of the events in Ptolemy’s life, except for the knowl- 
edge that he was a native of Egypt and that his numerous astronomical observations were 
made in the period between A.D. 127 and 151, probably at the Museum. 

The very name of Ptolemy’s masterpiece has its own curious history. The Greeks 
called it Megale Syntaxis (the Great Collection). Later translators from Greek into Ara- 
bic, either through admiration or carelessness, combined the Arabic article a/ with the 
superlative megiste to form the hybrid word almagisti, “The greatest,” whence the Latin 
Almagestum and colloquial Almagest, by which name it has been known ever since. 

Ptolemy came at the end of a long line of Greek thinkers who viewed the earth 
as the xed and immovable center of the universe, around which the planets swung in 
concentric circles. To assert that the earth was at any place other than the center of the 
heavens was to deny humans their position of supremacy in the universe, to believe that 
human affairs were no more signicant to the gods than those of other planets. Some 
astronomers, notably Aristarchus of Samos, proposed the heliocentric hypothesis—that 
the earth and the planets all revolved in circles about a xed sun—but it was rejected 
for various reasons. One did not have to be trained in astronomy to observe that the 
earth seemed stable under the feet, that lighter bodies did not y into the air, or that 
projectiles shot straight upward did not fall farther to the west. Archimedes advanced 
the more scientic argument that if the earth were in motion, its distance from the stars 
would vary, and this apparently was not so. 

According to the Pythagorean prejudice for the beauty and perfection of the circle, 
the motion of the sun and planets had to be circular. However, their deviation from 
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Claudius Ptolemy 
(circa 145) 


(The Bettmann Archive.) 


circular orbits was great enough to have been observed and to require explanation. To 
reduce celestial motion to combinations of circular movements, the Greek astronomer 
Apollonius had worked out an ingenious scheme of epicycles, or small circles having 
their centers on the circumferences of other circles. In the epicycle system, each planet 
travels around the earth in a large circle, called a “deferent”; this circle does not represent 
the true path of the planet, but rather the path of the center of a small circle, the epicycle, 
around which the planet revolves. Claudius Ptolemy, to rationalize these ideas with his 
accumulated observations, proposed the notion of eccentric solar motion. His system 
as described by the Almagest was perhaps as complicated, relative to his own time, as 
Einstein’s relativity theory is to our time. 

It will be enough for our purposes to say that Ptolemy set the earth eccentrically 
within the main circle representing the deferent of the planet and made the center of the 
epicycle move with uniform velocity, not about the center of the deferent, but about an 
offset point. This latter point, called the “equant,” or equalizing point, lay at an equal 
distance from the earth on the opposite side of the circle. The equant was a remarkable 
invention that not only allowed Ptolemy to describe important features of planetary 
motion in terms of circles but also tted the observational data available in the second 
century. It obviously had the motion appear the fastest when the deferent was near the 
terrestrial observer and slowest at the opposite point; and that was the explanation of 
why the sun appeared sometimes near the earth and sometimes farther away. 

The chief aw in Ptolemy’s system lay in its mistaken premise of an earth-centered 
universe. Yet the heliocentric theory was not ignored. Ptolemy devoted a column or two 
to the refutation of this theory, thereby preserving it for the ages to ponder on and for 
Copernicus to develop. Copernicus was still plagued by epicycles and the matter was not 
resolved until Kepler (1609) observed that the planets moved, not in Pythagoras’s ideal 
circle, but in elliptical orbits. As soon as Kepler made this radical break with tradition, 
everything fell into place. 
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Planet 


Epicycle 
(orbit of planet) 


Moving center 


Deferent 
(orbit of moving center) 7 = 
/ - 
4 ,-—©@ Earth 
yan (fixed in space) 
_- 7 Equant 
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Ptolemy’s Geographical Dictionary 


A work that exerted almost as much in uence on succeeding centuries as the Al/- 
magest did was Ptolemy’s Geographike Syntaxis (Geographical Directory). Written in 
eight books, it is an attempt to summarize the geographical knowledge of the habitable 
world as known at that time, that is, the continents of Europe, Asia, and Africa. The Ge- 
ography was accompanied by a collection of maps, a general map of the world and 26 
others showing regional details. Ptolemy developed his own manner of representing the 
curved surface of the earth on a plane surface. He divided the circumference of the globe 
into 360 parts, or degrees, as they came to be called, and covered the surface with a net- 
work of meridians and parallels. In choosing an arbitrary prime meridian, Ptolemy drew 
a line passing through the westernmost of the Fortunate Islands (the Canaries), but was 
mistaken by about 7° in his idea of the distance of these islands from the mainland. On 
his world map, he sought to reproduce on a at surface the contour of the globe by repre- 
senting the parallels and meridians as curved lines, with the meridians converging to the 
poles; for the smaller regional maps, a simple rectangular grid was considered suf cient. 

A glance at Ptolemy’s map will reveal a somewhat misleading picture of the known 
world. Its length from his own zero meridian in the Fortunate Islands to the city of Sera in 
China covers 180° (as against 126° in reality), with the result that the westward distance 
from western Europe to eastern Asia is much less than it should be. He was ignorant of the 
peninsular shape of India, so Ptolemy completely distorted the southern coastline of Asia; 
and the island of Ceylon is exaggerated to 14 times its actual size. He somehow assumed 
that the land mass of China ran far to the south and then to the west until it joined the 
east coast of Africa, thereby making the Indian Ocean a landlocked sea. The distortion of 
Ptolemy’s world map is partly due to his rejection of Eratosthenes’ estimate of the earth’s 
circumference, and his adoption of the less appropriate estimate of 180,000 stadia. This 
gure is too small by nearly 5000 miles, or about one-quarter of the correct distance. 

The main part of the Geography is an exhaustive gazetteer of some 8000 places, 
arranged by regions, with their supposed latitudes and longitudes. Although Ptolemy 
gave the impression that his coordinates were based on astronomical observation, he 
relied largely on Roman road-itineraries (of cial lists of stopping-places on the roads 
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of the empire, with distances between them) and on reports accumulated from traders 
and travelers who came to Alexandria. Because he worked from this sketchy data, it 
is not surprising that the positions he gave for many localities outside the well-known 
Mediterranean area were grossly inaccurate. Paris, for instance, was put opposite the 
mouth of the Loire River. But Ptolemy came remarkably close to the truth when he 
described the Nile as formed by two rivers owing from two lakes a little south of 
the equator (these are Victoria and Albert Nyanza), a fact of geography that was not 
conrmed until the nineteenth century. 

Ptolemy’s geographical treatise had its effect on western Europe much later than his 
Almagest did. It was translated into Latin in 1409, not from an Arabic manuscript but 
from a Greek one brought from Constantinople. Although initially printed in 1475, the 
rst printed edition to be accompanied by maps, drawn by medieval cartographers from 
coordinates contained in the text, was published in Rome in 1478. Columbus possessed 
a copy of this latter edition. The Latin Geography was received with great deference, 
partly because the author represented the world approximately as it had been known 
for many centuries and partly because of the mistaken conception that he had used 
rigorous mathematical methods for determining places. Besides, the scholars of the early 
fteenth century had no reliable criteria for criticizing Ptolemy. The maps based on this 
information, despite their many errors, were vastly superior to those previously available 
and covered many areas not usually touched by marine charts of the day. 

Ptolemy’s diminution of the distance between Europe and Asia by some 50° lati- 
tude fortied Columbus’s belief that he could easily reach the Orient by sailing west- 
ward across the Atlantic—perhaps even induced him to undertake his great voyage of 
discovery. Indeed, Columbus died in the conviction that the land he had_ rst sighted was 
an outlying island of southeastern India; and the error is perpetuated in the application 
of the name “Indian” to the natives of the American continents. 


The habitable world according to Ptolemy. (From Ancient Times by James Henry Breasted, 
©1916 by James Henry Breasted. Reproduced by permission of Ginn and Company.) 
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4.4 Problems 


1. In the A/magest, Ptolemy proved a geometrical result 
known today as “Ptolemy’s theorem.” If ABCD is a 
(convex) quadrilateral inscribed in a circle, then the 
product of the diagonals is equal to the sum of the 
products of the two pairs of opposite sides. In 
symbols: 


AC: BD=AB:CD+ BC: AD. 
B 


D 


If BE is drawn so that ABE = ZDBC, complete the 
details of the following proof of Ptolemy’s theorem: 
(a) The triangles ABE and DBC are similar, 


whence 
AB AE 


BD CD’ 
(b) ZABD = LABE + LEBD 
= /DBC + LEBD = LEBC. 


(c) The triangles ABD and EBC are similar, 
whence 
AD BD 
EC BC 
(d) The result of adding AB. CD = AE - BD and 
BC-AD= EC: BD is 


AC: BD=AB-CD+ BC: AD. 


2. Let AB and AC, where AB < AC, be two chords of a 
circle terminating at an endpoint 4 of the diameter 
AD. 


If CDA =a and ZBDA = B, show that Ptolemy’s 
theorem leads to 
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— =sinacos f — cosa sin 
= p. 


a result which is reminiscent of the trigonometric 
formula for sin(a — £). 


Use Ptolemy’s theorem to prove that if P lies on the 
arc AB of the circumcircle of the equilateral triangle 
ABC, then PC = PA + PB. 


Like other Greek geometers, Ptolemy used chords of 
angles rather than sines. Sines were invented much 
later, around the fth century, by the Hindu 
astronomers. Book I of the A/magest contains a table 
giving the lengths of the chords of central angles in a 
circle of radius 60, increasing by half a degree at a 
time from 1/2° to 180°. 


(a) Derive the relation 
chord 2a = 120 sina 


between Ptolemy’s value for the length of a 
chord corresponding to angle 2a and the sine 
of a. 


WV 


(b) From Ptolemy’s value chord 1° = 1;2,50 and 
using an inscribed 360-gon to approximate the 
circumference of a circle, obtain his 


approximation to z. [Hint: x = 
. : (360 chord 1°) 
circumference/diameter ~ ———_————_—.. 
diameter 
(c) From Ptolemy’s value chord 120° = 103;55,23 
and using the fact that /3 = 2sin60°, obtain 


his approximation to /3. 


Supply the missing details in the following proof of 
the formula for the area K of a triangle in terms of 
its sides a, b, and c, namely 


K = V/s(s —a\j(s — b\(s —c), 
s= s(at+b+c). 


(This formula appears in Heron’s Metrica, and a 
proof is worked out in his Dioptra. According to 


Arc 


4.5 


himedes 


Arabic tradition the result was known earlier to 
Archimedes, who undoubtedly had a proof of it.) 


In triangle ABC, inscribe a circle with center O, 
touching the sides BC, AC, and AB at points D, E, 
and F,, respectively. Extend segment CB to H so that 
HB = AF; also draw OL perpendicular to OC to cut 
BC at J and meet the perpendicular to BC at B in 
the point L. Then 
(a) K = 3(BC\(OD) + $(AC)(OE) + $(4B)(OF) = 

s(OD) = (HC)(OD). 
(b) ZCLB+ ZBOC = 180° and 

ZBOC + LAOF = 180°, so that ZCLB = ZAOF. 
(c) Triangles AOF and CLB are similar, hence 

BC BC BL BL Bs 
BH AF OF OD’ JD 

BC BJ CH BD 

— +1 = — +1 implies that — = —. 
ga ogy oe oD 
) (CH) BD-CD  BD-CD 
e = — , 

CH.HB JD-CD (OD) 
() K? =(CH)(ODY 

= CH: HB-BD-DC 
= s(s —a)(s — b\(s —c), 


where a = BC, b = AC, and c = AB. 
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The Hindu mathematician Brahmagupta (circa 600) 
discovered a formula for the area K of a 
quadrilateral inscribed in a circle: 


K=vV(s —ays — bys 


c\(s — da), 


where a, b, c, and d are the sides of the quadrilateral 
and s = s(a +b6+c+d) is its semiperimeter. Prove 
that Heron’s formula is a special case of 
Brahmagupta’s formula. 


If a quadrilateral with sides a, b, c, and d is 
inscribed in one circle and circumscribed about 
another, show that its area K is given by 


K = Vabcd. 


[Hint: Use the fact that the tangents to a circle from 
an external point are equal in length to conclude that 
s=a+c=b+d. Now apply Brahmagupta’s 
formula. ] 

Establish the following result due to Brahmagupta: If 
a quadrilateral inscribed in a circle has perpendicular 
diagonals meeting at a point P, then any line through 
P that is perpendicular to a side of the quadrilateral 
will bisect the opposite side. 


Cc 


[Hint: If XY is perpendicular to BC, then 


ZDPX = ZBPY = LPCY = LACB 
= ADB = LXDP, 


so that triangle XPD is isosceles. Similarly, triangle 
XPA is isosceles. | 


The work of Archimedes (about 287-212 B.c.) 
epitomizes Alexandrian mathematics. Consid- 
ered the greatest creative genius of the ancient 
world, Archimedes lived a generation or two af- 
ter Euclid and was a contemporary of Eratos- 


thenes. We know few details of his life, though several fanciful stories have clustered 
around his name. Archimedes was the son of the astronomer Phidias and was born in 
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Syracuse, a Greek settlement on the southeastern coast of Sicily. At the time, it was the 
largest city in the Hellenistic world. According to Plutarch, Archimedes came from the 
same royal family as the city’s ruler, King Hieron II. This enlightened dictator reigned, 
according to the historian Polybius, for 54 years “without killing, exiling, or injuring a 
single citizen, which is indeed the most remarkable of all things.” Archimedes almost 
certainly visited Egypt, and because he corresponded regularly with several scholars at 
the Museum in Alexandria, it is likely that he studied at that center of Greek science. He 
spent most of his productive years in Syracuse, however, where under Hieron’s protection 
and patronage, he devoted himself wholeheartedly to study and experiment. Archimedes 
earned great renown in antiquity for his mathematical writings, his mechanical inventions, 
and the brilliant way in which he conducted the defense of his native city during the 
Second Punic War (218-201 B.c.). It is well attested that he perished in the indiscriminate 
slaughter that followed the sacking of Syracuse by Roman troops. 

Archimedes’ mechanical skill together with his theoretical knowledge enabled him to 
devise a series of ingenious contrivances. Of these the most famous is the Archimedean 
screw, a pump still used in parts of the world. Archimedes apparently invented it during 
his visit to Egypt for the purpose of raising canal water over levees into irrigated elds. 
It was later used for pumping water out of mines and from ships’ holds. The simple 
and useful device consists of a long tube, open at both ends and containing a continuous 
screw or spiral piece of metal of the same length as the cylinder. When the lower end of 
the tube is tilted into the standing water and the spiral insert is rotated, water is carried 
to the top and_ ows out of the cylinder’s upper opening. 

Several of the stories about Archimedes that have come down to us relate to his skill 
as an engineer, for it is natural that his mechanical inventions would have a broader appeal 
than his more specialized mathematical achievements. One familiar legend concerns his 
exploit in launching a large ship. When King Hieron was amazed at the great weights 
that Archimedes could move by means of levers, cogwheels, and pulleys, Archimedes 
is reported to have boasted that if he had a xed fulcrum to work with he could move 
anything: “Give me a place to stand and I will move the earth.” Hieron asked Archimedes 
to reduce the problem to practice, and pointed out the difculty that his men were 
experiencing with a ship so heavy that it could not be launched from the slips in the 
usual way. Archimedes designed a combination of levers and pulleys that (in the words 
of that man of letters, Plutarch) he alone “while sitting far off, with no great effort, but 
only holding the end of a compound pulley quietly in his hand and pulling at it, drew the 
ship along smoothly and safely as if she were moving through the water.” The same story 
was told by Proclus, who represented Hieron as operating the pulley himself and crying 
out in amazement, “From this day forth Archimedes is to be believed in everything that 
he may say.” 

Despite his mechanical talents, Archimedes was far more concerned with theo- 
retical studies than with discoveries connected with practical needs, regarding these 
as the “diversions of geometry at play.” In The Life of Marcellus, Plutarch went on 
to say: 


Though these inventions had obtained for him the reputation of more than human sagacity, he 
yet would not deign to leave behind him any written work on these subjects, but, regarding 
as ignoble and vulgar the business of mechanics and every sort of art which is directed 
towards use and prot, he placed his whole ambition in those speculations whose beauty 
and subtlety are untainted by any admixture of the common needs of life. 
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Although Archimedes was not greatly interested in the practical applications of his 
knowledge, he was usually willing to help his admiring friend and patron, King Hieron, 
with a problem. One of the best-known stories tells of his success in determining the 
purity of a golden crown. It appears that Hieron, on gaining power in Syracuse, had a 
crown of pure gold made as an offering to the gods. The weight of the completed crown 
matched the weight of the gold that had been assigned to the goldsmith; yet Hieron 
suspected that the maker had appropriated some of the gold, replacing it with an equal 
weight of silver. Being unable to verify his suspicion, Hieron consulted Archimedes. The 
story has it that the great scientist suddenly realized how to settle the question while he 
was at the public baths of the city. Getting into the tub, he observed that the lower his 
body submerged into the water the more water over owed the top of the tub. This gave 
him the idea that if the goldsmith had actually debased the crown by alloying it with 
silver, the crown would displace a greater volume when immersed in water than would 
a quantity of gold equal to the weight of the crown; for pure gold would be more dense 
than an alloy of gold and the lighter metal silver. The Roman architect Vitruvius related 
that Archimedes, recognizing the value of this method of solution, 


without a moment’s delay and transported with joy ... jumped out of the tub and rushed 
home naked, crying out in a loud voice that he had found what he was seeking; for as he 
ran, he shouted repeatedly in Greek, “Eureka, eureka!” [“I have found it, I have found it!’’] 


Whether Archimedes actually dashed naked through the streets of Syracuse, as alleged, is 
a matter of speculation; but the common people cheerfully believed such a story, because 
it made a great man look ridiculous. 

The widest fame Archimedes enjoyed in the classical world came from the active 
part he took in defending his city against the Romans. During the third century B.c., 
Rome and the African city-state Carthage were locked in the bitter Punic wars. It was 
clear to the Romans that their mastery of southern Italy would be threatened if ever a 
hostile power controlled Sicily. While King Hieron was still alive, Syracuse remained 
Rome’s loyal ally; but Hieron died in 215 B.C. and was succeeded by his 15-year- 
old grandson, who fell under the in uence of courtiers in the pay of Carthage. Roman 
forces under a tough and businesslike general named Marcellus, seizing the opportunity 
to annex the whole of Sicily, attacked Syracuse by land and sea. Geographically the site 
was a natural fortress, and Archimedes, then an old man of 75, personally directed the 
defense. 

A vivid account of this famous siege was given by Plutarch in his writing on the life 
of Marcellus. He told how Archimedes used his engineering skill to construct ingenious 
war machines, by which he inicted great losses on the Romans. The city walls were 
fortied with a series of powerful catapults and crossbows set to throw a hail of missiles 
at specied ranges, so that however close the attackers came, they were always under 
re. The assault by sea was repulsed by devices that could be run out from the walls to 
drop huge stones or masses of lead through the planking of the galleys beneath. Cranes 
caught the bows of the vessels with grapnels, lifted them out of the water, and dropped 
them stern- rst from a height. Plutarch wrote that the Roman soldiers were in abject 
terror and refused to advance. 


If they only saw a rope or piece of wood extending beyond the walls, they took ight 
exclaiming that Archimedes had once again invented a new machine for their destruction. 
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The death of Archimedes during the siege of Syracuse. (The Bettmann Archive.) 


But the tale that Archimedes set the enemy ships on re by concentrating the sun’s rays 
on them through the use of great concave mirror, though repeated by many later writers, 
is probably not true. (Such a device was, however, used in defending Constantinople 
in 514.) After a two-year siege, the Romans temporarily withdrew their forces and the 
overcon dent Syracusans relaxed their vigilance. When the defenders had feasted and 
drunk their Il at a religious festival, pro-Roman sympathizers inside the city directed 
the enemy to a weak point in the walls. Marcellus gave explicit orders to his of cers 
that the life and house of Archimedes should be spared; but before they could locate the 
great scientist, he had been slain by a common soldier. 

The account of how Archimedes met his death has been told in various forms. 
According to the traditional story, he was absorbed in a geometrical problem whose 
diagram was drawn in the sand. As the shadow of the approaching Roman soldier fell over 
his diagrams, the agitated mathematician called out, “Don’t spoil my circles!” The soldier, 
insulted at having orders thus given to him, retaliated by drawing his sword. Another 
legend has it that Archimedes was slain by looters who supposed that his astronomical 
instruments, constructed of polished brass, were actually made of gold. 

Marcellus deeply regretted the death of Archimedes and erected an elaborate mon- 
ument in his honor. Archimedes had expressed the wish to friends that his tomb should 
bear the gure of a sphere inscribed in a right cylinder, in memory of his discovery of 
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the relation between the two bodies (the volume of the sphere is equal to two-thirds that 
of the circumscribing cylinders). In building his tomb, the Romans complied with his 
wish. Many centuries later, the Roman orator Cicero identi ed the monument by means 
of this inscription. His account in Tuscalan Disputations of how he found it in a ruined 
state, neglected by the people of Syracuse, is worth repeating: 


When I was questor [B.C. 75] I hunted out his grave, which was unknown to the people 
of Syracuse, since they entirely denied its existence, and I found it completely covered 
and surrounded by brambles and thorn-bushes.... Slaves sent in with sickles cleared and 
uncovered the place. When a passage had been made to it, we approached the pedestal 
facing us: the epigram was apparent with about half of the little verse worn away. And thus 
one of the noblest cities of Greece, once indeed a very great seat of learning, would have 
been ignorant of the monument of its most brilliant citizen, except that it was revealed by 
a man of Arpinum [Cicero]. 


The tomb has since disappeared and its exact location is unknown. 


Estimating the Value of 


A survey of the contents of a few of Archimedes’ principal works is enough to 
reveal the wide range of subjects he studied and the surprising ingenuity with which he 
treated them. The dozen items that have come down to us were preserved by a school 
of Byzantine mathematicians in Constantinople; between the sixth and tenth centuries, 
they made it their objective to collect and copy the dispersed treatises of Archimedes. 
These have greatly lost their original form, having suffered the linguistic transformation 
from the Sicilian-Doric dialect into Attic Greek. Unlike the Elements of Euclid, the 
works that have immortalized Archimedes were never popular in antiquity; where Euclid 
worked up existing material into systematic treatises that any educated student would 
understand, Archimedes aimed at producing small tracts of limited scope addressed to 
the most eminent mathematicians of the day. “It is not possible,” wrote Plutarch several 
centuries later, “to nd in all geometry more dif cult and more intricate questions, or 
more simple and lucid explanations.” 

It was Archimedes’ practice rst to send statements of his results, with the request 
that the other mathematicians discover the proofs for themselves; the complete treatise, 
with its supporting evidence, would follow thereafter. He was not above enunciating 
theorems he knew to be false so that “those vain mathematicians who claim to discover 
everything, without ever giving their proofs, may be deceived into saying that they have 
discovered the impossible.” 

Of all his mathematical achievements, Archimedes seems to have taken chief pride 
in those contained in On the Sphere and Cylinder. Written in two books, some 53 propo- 
sitions in all, it begins with a prefatory letter announcing the main results obtained. 
Archimedes indicated that he was publishing them for the rst time so that expert math- 
ematicians could examine the proofs and judge their value. Those propositions selected 
for mention included: 


1. The surface of a sphere is four times the area of a great circle of the sphere [or as 
we would say, S = 4zr?]. 


2. If about a sphere there is circumscribed a cylinder whose height is equal to the 
diameter of the sphere, then the volume of the cylinder is three halves of the 
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volume of the sphere; and the surface of the circumscribing cylinder, including its 
bases, is three halves of the surface of the sphere. 


Then follow some de nitions and assumptions. Of the ve assumptions, there is a famous 
one, a property that Archimedes himself attributed to Eudoxus. This is usually known 
today as the postulate of Archimedes: of two unequal line segments, some nite multiple 
of the shorter one will exceed the longer. Using this, Archimedes derived the above 
results, plus numerous others relative to the area or volume of gures bounded by curved 
lines or surfaces. 

Book II of On the Sphere and Cylinder treats some problems and theorems suggested 
by the rst book. In his work on segments of a sphere, Archimedes was confronted with 
the solution of a cubic equation. This occurs in Proposition 4 of Book I, which poses 
one of the great problems of Greek geometry—to pass a plane through a sphere in such 
a way that the volumes of the segments cut off are in a given ratio. 

The problem can be analyzed as follows. Suppose 2r is the diameter of the given 
sphere. It is required to nd a plane cutting this diameter at right angles so that the 
segments into which the sphere is divided have their volumes in a given ratio, say m/n. 
Because the volume of a spherical segment of height h, cut from a sphere of radius r, is 
given by the formula V = 2h?(r — h/3), we must have 

h?(3r — h) _m 

hBr—k) n° 
If & is eliminated by the relation h + k = 2r, this becomes 

nh?(3r — h) = m(2r —hyY(r +h) 
= m(h? — 3h?r + 4r°), 
or what amounts to the same thing, 
(m +n)h? — 3r(m + n)h? + 4mr? = 0, 

a cubic equation in which the term containing / is missing. This can be written 


3r —h 4r? 
mr/(m+n) h2’ 
and Archimedes treated it as a particular instance of the more general equation 


a-x ¢ 
b x2! 

Archimedes promised to provide a complete solution to the equation and then to 
apply it to the particular case at hand; but either the explanation was omitted or else 
this part of the text has been lost. The details were found centuries later in a fragment 
of a manuscript, which is usually attributed to Archimedes because it was written in the 
Sicilian-Doric dialect he used. The reconstructed solution proceeds in much the same way 
that the geometer Menaechmus attacked the Delian problem—by nding the intersection 
of conics. That is, both members of (a — x)/b = c?/x* are equated to a/y. This leads 


to two equations, 


a 
= (<) vy; (a —x)y =ab, 
a 
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which represent, respectively, a parabola and a hyperbola. The points of intersection 
of these two conics will furnish the solutions of x?(a — x) = bc”. The fragment also 
proves that if bc? = 4a*/27, then the curves touch at the point for which x = 2a/3, 
while if bc? < 4a?/27, there are two solutions. Except for a simple cubic encountered by 
Diophantus of Alexandria in the rst half of the fourth century, interest in cubic equations 
disappeared after Archimedes, not to reappear in the history of European mathematics 
for more than a thousand years. 

Of the works of Archimedes known in the Middle Ages, the most popular, and the 
rst to be translated into Latin, was The Measurement of a Circle. It is a short treatise, 
perhaps a part of a longer work, comprising only three propositions. The object of the 
rst is to show that the area of a circle can be calculated as soon as its circumference is 
known. 


The area of any circle is equal to the area of a right triangle in which one of the sides 
about the right angle is equal to the radius, and the other to the circumference of the 
circle. 


a 


The next proposition (whose proof we include) establishes that if the circumference of a 
circle is 34 of the diameter, then the area of the circle is to the square of its diameter as 
11 is to 14. Archimedes could not have originally placed it before Proposition 3, because 
the approximation depends on the result of that proposition. 


The area of a circle is to the square on its diameter as \1 to 14, very nearly. 


Proof. Take a circle with diameter AB and let a square CDEF be circumscribed 
about it. Produce the side CD so that DG is twice CD and GH is one-seventh CD. 
Because the areas of triangle ACG and ACD are in the ratio 21:7 and ACD and 
AGH are in the ratio 7:1, triangle ACH and triangle ACD are in the ratio 22:7. But 
the square CDEF is four times the triangle ACD, and therefore the triangle ACH is 
to the square CDEF as 22:28, or 11:14. The triangle ACH equals the circle, since 
AC equals the radius and CH equals the circumference (which will be shown in 
Proposition 3 to be very nearly 34 of the diameter). Thus the circle and the square 
CDEF are in the ratio 11:14, very nearly. 


D 
C Coy 
A B 
F E 
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The most important proposition in The Measurement of a Circle contains 
Archimedes’ estimate of the numerical value of wz. He did not call it 7. The symbol z 
for the ratio of the circumference of a circle to its diameter was not used by Archimedes 
or any other Greek mathematician. It was introduced in 1706 by an obscure English 
writer, William Jones, in his Synopsis Palmariorum Matheseos, or a New Introduction 
to the Mathematics. In this book for beginners, Jones published the circumferences-to- 
diameter ratio to 100 decimal places, all correct. It was not until the usage given it by 
Leonhard Euler in the famous /ntroductio in Analysin Infinitorum (1748) that the letter 
mz was de nitely adopted for this ratio, no doubt because it is the rst letter of the Greek 
word perimetros (perimeter). 

The approach Archimedes took in obtaining a value for 2 was based on the following 
fact: the circumference of a circle lies between the perimeters of the inscribed and 
circumscribed regular polygons of n sides, and as n increases, the deviation of the 
circumference from the two perimeters becomes smaller. This type of demonstration 
has since become known as the “method of exhaustion”—not for what it does to the 
user, but because the difference in area between the polygons and the circle is gradually 
exhausted. Although it amounts to considering the circle as the limit of the inscribed 
(or circumscribed) polygons as the number of sides increases inde nitely, there is no 
direct passage to the limit. For the Greek mathematician never thought of the process 
as continued for an in nite number of steps; he considered it only carried out in nite 
stages to a desired degree of accuracy. 

In calculating a suitable approximation for 2, Archimedes successively inscribed 
and circumscribed regular polygons of 6, 12, 24, 48, and 96 sides within and without the 
circle. The choice for the number of sides was natural. Of all the regular polygons, the 
hexagon is most easily inscribed. Simply mark off from any point on the circumference 
chords of a length equal to the radius of the circle until all six vertices, say, 4, B, C, 
D, E, and F, are obtained. When tangents are drawn to the circle A, B, C, D, E, and 
F, another regular hexagon is produced, one that circumscribes the circle. 


From the regular hexagon, the regular inscribed 12-sided polygon is constructed by 
bisecting the arc subtended on the circumscribed circle by each side of the hexagon, 
using the additional points thus found and the original vertices to form the required 
dodecagon. Continuing in this way, by repeated bisection of arcs, Archimedes obtained 
the regular polygons of 12, 24, 48, and 96 sides from the hexagon. 
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If p, and P, represent the perimeters of the inscribed and circumscribed regular 
polygons of n sides, and C the circumference of the circle, it follows that 


Po < Pir < Pra < Pas < Pos < +++ < Pn <C 
< Py <+++< Poo < Pag < Pog < Pir < Po. 


Both of these sequences are bounded monotonic sequences, and hence each has a limit; 
and it can be proved that the limits are the same, with C their common value. Moreover, 
P>,, is the harmonic mean of p, and P,, and po, is the geometric mean of p, and P>,: 


2DnP, 
Po, = ——"—, Pon = V PnPr2n- 
Pn + P, 
Starting from the perimeters pg = 3d and Py = 2./3d, where d is the diameter of the 
circle, one can use these recursion relations to compute P2, and p2, successively until 
the values Pos and pos required by Archimedes are reached. Assuming the inequality 


265 1351 
a a2 
153 ~*~ ~ 780 
as known without further explanation, Archimedes found that 


10 96 - 66 96 - 153 10 
ii ee ae me bee ea hg 
( 7) “oo 6 < "1673 <( x) 


whence the nal result 
10 1 
3a <WT< 35. 


The result of Archimedes’ computation was expressed as this proposition. 


The circumference of any circle exceeds three times its diameter by a part that is less 
than 4 but more than = of the diameter. 


ee ————————————— | 


The approximation of 2 is often called the Archimedean value of 2. Because _ x 


3.1429 is less than 0.2 percent larger than the actual value of wz and is such a simple 
number for ordinary calculation, it was good enough for most purposes in antiquity. 
Archimedes could theoretically have provided a better estimate of 2 using polygons of 
192 or 384 sides, but the arithmetic—made dif cult in any case by the clumsy Greek 
alphabetic number symbols—would have been prohibitive. 

Historians of science have focused considerable attention on the attempts of early 
societies to arrive at an approximate value for the ratio of a circle’s circumference to its 
diameter (that is, the number z), perhaps because the increasing accuracy of the results 
seems to offer a measure of the mathematical skill of the culture at that time. The ancient 
Chinese were considerably more advanced in arithmetic calculation than their Western 
contemporaries, so it is not surprising that they obtained remarkably accurate values for 
a. Texts from the pre-Christian era generally used 3 as an approximation for z, but 
from the rst century, mathematicians in China were searching for better estimates. Liu 
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Hsin (circa 23) employed 3.1547, and Chang Heng (78-139) used the value 10, whose 
decimal approximation is 3.1622; or the fraction 92/29, whose decimal approximation is 
3.1724. 

By taking the ratio of the perimeter of a regular inscribed polygon to the diameter 
of a circle enclosing the polygon, third century mathematicians obtained more accurate 
approximations. Liu Hui, in his commentary on the Nine Chapters of the Mathematical 
Art, used a polygon of 384 sides to derive for z the bounds 


3.141024 < mw < 3.142904, 


and with a 3072-sided polygon found his best value for 7, namely 3.14159. In the fth 
century, the brilliant mathematician and astronomer Tsu Chung-Chi (430-501) re ned 
the method to obtain 


3.1415926 < mw < 3.1415927; 


and, from these, gave the fraction 22/7 as an “inaccurate” value for z and 355/113 as the 
“accurate” value. This latter value yields 2 correct to six decimal places. Comparable 
rational approximations were not attained in the Western world until the sixteenth century 
when the Dutch fortress engineer Adriaan Anthonizoon (1527-1607) derived anew the 
ratio 355/113. No fraction with denominator less than 113 gives a closer approximation 
to mz; in fact, 355/113 is such a good rational estimate that no better one is reached 
until 52,163/16,604. By using the Archimedean method on a polygon of 2% sides, the 
indefatigable Ludolph van Ceulen (1540-1610) carried the value of z correctly to 35 
decimal places. (This computational feat was considered so extraordinary that his widow 
had all 35 digits of the “Ludolphine number” carved upon his tombstone.) His was one 
of the last major attempts to evaluate 2 by the method of perimeters; thereafter, the 
techniques of calculus prevailed. 


The Sand-Reckoner 


The Sand-Reckoner of Archimedes was a computational accomplishment of another 
kind. It contained a new system of notation for expressing numbers in excess of one 
hundred million, for which Greek mathematics had not yet developed any characters. 
Archimedes contrived a procedure for counting in units of ten thousand myriads, 10° in 
our notation, and used exponents for ordering his classes of magnitudes. To demonstrate 
that his system would adequately describe enormously large numbers, he undertook to 
enumerate the grains of sand that the nite universe, bounded by the sphere of the xed 
stars, could hold. (Like other astronomers of the time, Archimedes believed the universe 
to be a sphere whose center was the immobile earth and whose radius equaled the distance 
from the earth to the sun.) 

To give a reasonable maximum bound on the dimension of the universe, Archimedes 
quoted certain earlier views on the size of the celestial bodies. Like most earlier as- 
tronomers, he assumed that the earth had a diameter greater than that of the moon but 
less than that of the sun, and that the diameter of the sun was 30 times the diameter of 
the moon. (The factor 30 was a convenient exaggeration of the traditional estimate of 
20.) If the diameters of the sun, moon, earth, and universe are represented by D with 
suitable subscripts, this means postulating that 


Dgun = 30D moon i 30 Dearth: 
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By a clever geometric argument, Archimedes proved that the perimeter of a regular 
polygon of 1000 sides inscribed in a circle of diameter Duniy was greater than 3 Dyniy and 
at the same time less than 1000D,,,; hence, 


3Duniv < 1000Dgun < 30,000 Dearth 


For the circumference of the earth, he took a then accepted value of 300,000 stadia, but 
in order to be on the safe side multiplied by a factor of 10, thereby assuming that 


Dearth < 1,000,000 stadia. 


Archimedes concluded from these assumptions that for the diameter of the universe as 
far as the sun, 


Dyniy < 10'° stadia. 


To make good his boast, Archimedes next supposed that a grain of sand had minute 
but de nite size. Underestimating the size of a grain of sand, he proposed that 10,000 
grains of sand would be needed to 11 the space of a poppy seed and that 40 poppy seeds 
lined up in a row would exceed one nger-breadth. Therefore (using V = im D> < D) 
a sphere of diameter one nger-breadth would contain at most 64,000 poppy seeds, 
consequently at most 640 million grains of sand—in any event, no more than | billion = 
10° grains. Taking one stadium to be less than 10,000 = 10* nger-breadths, Archimedes 
then found the number of grains of sand in a sphere of diameter | stadium to be fewer 
than 10°(10*)? = 107!. A secure upper bound for the grains in a sphere with diameter 
10!° stadia was 107!(10!°)? = 10°!, or as Archimedes put it, “one thousand units of the 
seventh order of numbers.” 

The gure just mentioned gives the number of grains of sand needed to Il up 
the “conventional universe.” To demonstrate the practicality of his method beyond any 
doubt, Archimedes also referred to the view of Aristarchus of Samos (sometimes called 
the Copernicus of antiquity) that the universe was heliocentric, with the earth revolving 
around the sun. He showed that a universe of the dimensions Aristarchus proposed in 
On the Size and Distance of the Sun and Moon had room for only fewer than 10° grains 
of sand. Archimedes concluded the discussion with the following words: 


These things will appear incredible to the numerous persons who have not studied mathe- 
matics; but to those who are conversant therewith and have given thought to the distances 
and the sizes of the earth, the sun, and the moon, and of the whole universe, the proof will 
carry conviction. 


The treatise On Spirals contains 28 propositions dealing with the properties of the 
curve now known appropriately as the spiral of Archimedes. It is described in the words 
of the inventor himself: 


If a straight line [half-ray] one extremity of which remains xed be made to revolve at a 
uniform rate in the plane until it returns to the position from which it started, and if, at the 
same time as the straight line is revolving, a point moves at a uniform rate along the straight 
line, starting from the xed extremity, the point will describe a spiral in the plane. 


In modern polar coordinates, the equation connecting the length r of the radius vector with 
the angle 0 through which the line has revolved from its initial position is 7 = a0, where 
a > 0 is some constant. For let OA be the revolving half-line, O the xed extremity, 
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and P the point that moves away from O along OA. If OP =r and AOP = 0, then the 
characteristic property of the Archimedean spiral requires r/@ to be constant. 


P=(r,60) 


DY) 


r 


In the view of the modern mathematician, perhaps the greatest mathematical achieve- 

ment of Archimedes, and certainly one of the most fascinating results, was his calculation 
of the area enclosed by the rst loop of the spiral (corresponding to 0 < 6 < 2zr) and the 
xed line. As he put it: “The space bounded by the spiral and the initial line after one 
complete revolution is equal to one-third of the circle described from the xed extremity 
as center, with radius that part of the initial line over which the moving point advances 
in one revolution.” This is equivalent to the modern formulation 4 = am (20a). Nowa- 
days, a problem of this kind is made easy by the use of integral calculus. Archimedes, 
in its stead, used the method of exhaustion; he divided the spiral curve into numerous 
equal parts and circumscribed and inscribed circular sectors, adding up their areas. 


The method of exhaustion is traditionally attributed to Eudoxus of Cnidos (390— 
337 B.c.), although Euclid and Archimedes used it most frequently and to greater advan- 
tage. The method plays a leading part in Book XII of the Elements, where it was used to 
prove that the areas of circles are to one another as the squares of their diameters, and 
also that the volumes of pyramids that are of the same height and have triangular bases 
are proportional to the areas of their bases. Archimedes subsequently exploited exhaus- 
tive techniques in nding the areas of curvilinear plane gures and volumes bounded by 
curved surfaces. The method is encountered in Archimedes’ work in two main forms. 
One version consists in enclosing the geometric gure whose area or volume is sought 
between two others, which can be calculated and can be shown to approach each other 
inde nitely. The essence of the other approach is to inscribe suitably chosen gures 
within the gure for which the area or volume is required; then in some fashion the area 
or volumes of the inscribed gures are increased until the difference between them and 
the quantity to be calculated becomes arbitrarily small. The phrase “method of exhaus- 
tion” was not used by the ancient Greeks to describe this procedure but introduced by 
the Jesuit mathematician Gregory St. Vincent in his Opus Geometricum (1647). 
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Quadrature of a Parabolic Segment 


Archimedes used the method in the Quadrature of a Parabola to nd_ the area of the 
segment formed by drawing any chord of the parabola. Archimedes begins to “exhaust” 
the area of the parabolic segment by inscribing in it a triangle of the same base as the 
segment and of a height equal to the height of the segment. (By “height of a parabolic 
segment” we mean the distance from the chord to the point on the parabola at which the 
tangent is parallel to the chord.) The other two sides of the inscribed triangle provide 
two new parabolic segments; in each of these another triangle is inscribed in the same 
way, with the process continued as far as desired to build up an inscribed polygon as the 
sum of a sequence of triangles. In this way, Archimedes found that the segment cut off 
by the chord had an area equal to 4 the area of the rst triangle constructed. 


B 


A 


Archimedes’ argument is typical of his general approach in determining areas or 
volumes by exhaustion, so it is worth looking at more closely. In the parabolic segment 
bounded by the chord AB, Archimedes constructed a triangle ABC having AB for its base 
and the point C for the third vertex. At C, the tangent to the parabola was parallel to the 
chord. (It is proved that C is the point on the curve that has the greatest perpendicular 
distance from the base AB.) Let the area of triangle ABC be denoted by A. In each of the 
two smaller segments cut off by the chords AC and CB, Archimedes similarly inscribed 
triangles ADC and CEB. From the properties of the parabola, he demonstrated that each 
of the two new triangles had an area equal to zA; hence, the area of ADC and CEB 
together equaled zA. Next, more triangles were constructed with vertices on the parabola 
and bases on the new chords AD, DC, CE, and EB. Each of these four triangles had an 
area equal to < that of triangle ADC, or equal to (1/8")A, so that this set of triangles 
added (1/47)A to the area of the inscribed gure. Continuing, Archimedes obtained a 
sequence of polygonal gures by adding an ever-increasing number of triangles to the 
original triangle ABC. The area of the nth such polygon is given by 


IN pee eee je 
4 7 4 4n }- 


This is a nite geometric progression of ratio ; whose sum, 
4 1/1\" 
Kame le : 
3 3\4 


measures areas closer and closer to the area required. At this point, the modern math- 
ematician would use the limit concept to conclude that the parabolic segment has an 
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area of tA. Archimedes, who did not have a symbol for this notion, instead proved by 
a double reductio ad absurdum argument that if the polygons exhausted the parabolic 
segment, then its area could be neither greater nor less than 4A. 

In 1906, the Greek text of yet another work by Archimedes was discovered almost 
by accident in the library of a monastery in Constantinople. A Danish philologist, Johan 
Ludvig Heiberg, was drawn there by the report of a tenth-century parchment manuscript 
that seemed originally to have had mathematical content (a so-called palimpsest). Some- 
time between the twelfth and fourteenth centuries, monks had washed off the earlier 
text to provide space for a collection of prayers and liturgies, a not uncommon practice 
caused by the high cost of parchment. Fortunately, most of the expunged contents could 
be deciphered with a magnifying glass. The manuscript contained fragments of many 
treatises of Archimedes that had suf ciently wide circulation to be preserved elsewhere; 
it also contained the only surviving copy of a largely unknown work entitled The Method. 
Historians had been aware of the existence of The Method through allusions by ancient 
writers, such as Heron, but it had been believed irretrievably lost. Sent as a letter to 
Eratosthenes, it recalled certain mathematical results that Archimedes had propounded 
without proof on a former occasion; and it went on to acquaint Eratosthenes with the 
method that had been used in reaching these and many other conclusions. Anticipating 
the view of modern integral calculus, Archimedes asserted that surfaces were to be con- 
sidered “made up” of an in nity of parallel lines and that solids of revolution were “ lled 
up” by circles. But Archimedes did not regard such intuitive reasoning as a proof, only 
as an investigation preliminary to a rigorous demonstration by the method of exhaustion. 
By this ingenious method, he found the surface areas, volumes, and centers of gravity 
of numerous solids of revolution. Although these achievements are remarkable anticipa- 
tions of results found later in the integral calculus, we must be careful not to impute to 
Archimedes the idea expressed in the calculus; for the concept of limit, which lies at the 
very heart of the subject, was entirely alien to his arguments. 

In the preface to The Method, Archimedes says, “I presume there will be some 
among the present as well as future generations who by means of the method here 
explained will be enabled to nd other theorems which have not yet fallen to our share.” 
Unfortunately, his hope of nding successors to continue his work remained unful Iled. 
After Archimedes’ time, the trend of Greek mathematics was in other directions; and 
more than eighteen centuries were to pass before Newton and Leibniz took up the task of 
developing the classical method of exhaustion into the principles constituting the calculus. 


Apollonius of Perga: The Conics 


The last of the three great geometers who ourished in the period of 300 to 200 
B.c. was Apollonius, a younger contemporary of Archimedes. Apollonius was born in 
the Greek city of Perga, close to the southeast coast of Asia Minor. As a youth, he 
went to Alexandria—perhaps to study at the Museum with the successors to Euclid— 
and resided there for many years to lecture and compose the rst draft of his famous 
Conics. Later, Apollonius moved to Pergamum, which had a newly founded university 
and library modeled after those in Alexandria. While there, he became acquainted with 
the geometer Eudemus of Pergamum, to whom he subsequently dedicated the rst three 
books of the Conics. 

Apollonius wrote 11 works, only 2 of which have survived, and he is particularly 
renowned for his Conics. It contains a wealth of 389 propositions organized into eight 
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books. The rst four books have come down to us in the original Greek, the next three are 
preserved in Arabic translation, while the last is lost. The study of the three curves that 
we call “conic sections” was not a new topic with Apollonius, although he did introduce 
the familiar names parabola, hyperbola, and ellipse. Proclus’s Commentary tells us that 
Menaechmus, a pupil of Eudoxus and a member of Plato’s Academy, discovered these 
curves some time around 350 B.c. The initial four books of the Conics make up a 
systematic exposition and improvement of much that was previously set forth, with the 
remaining books devoted to original material. Apollonius’s treatment of the theory of 
conics was so admired that it was he, rather than Euclid, who in antiquity earned the 
title The Great Geometer. 

Apollonius de ned a circular conic as being generated by a rotating line that traverses 
a circle, while also passing through a_ xed point not in the plane of the circle. A right 
circular cone is one whose axis is perpendicular to the circle’s plane. 

Prior to Apollonius, geometers treated the conic sections as arising from three types 
of right circular cones, distinguished by their vertex angles. They cut each cone by 
a plane perpendicular to the generating line. Depending on whether the cone’s vertex 
angle was right, obtuse, or acute, the resulting curve was a parabola, a hyperbola, or 
an ellipse. The earlier investigators called these curves the section of a right-angled 
cone, the section of an obtuse-angled cone, and the section of an acute-angled cone. 
Both Euclid and Archimedes are known to have approached the subject from this point 
of view. Apollonius’s decisive achievement was to show that all three curves could 
be obtained from any cone simply by varying the inclination at which the intersecting 
plane meets the generating line. 

Using the method of “application of areas” favored by Euclid, Apollonius derived 
the geometric counterparts of the Cartesian equations of the conics. Consider, for in- 
stance, the case of the parabola. Let A be the vertex and the line AB be the axis of 
symmetry. Suppose P is any point on the parabola and Q is the foot of the perpen- 
dicular from P to AB. Now at A erect a line L perpendicular to AB. On L mark off a 
segment AR equal in length to that of the latus rectum of the conic. (The latus rectum 
or parameter is the chord passing through the parabola’s focus F’ and is perpendicular 
to AB.) 


Apollonius was able geometrically to construct a rectangle of area (PO) having the 
segment AR as one side and AQ as the other side. This led him to the de ning equation 
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of the parabola, 
(PO) = (AR\(AQ). 


The expression can be formulated algebraically by naming the segments AQ and PQ as x 
and y, respectively, and denoting as p the constant length AR. We then have the modern 
equation y* = px for the parabola. 

Nowadays, a parabola is usually dened as the locus of all points equally distant 
from a xed point (the focus) and a xed line (the directrix). Other than oblique ref- 
erence, Apollonius never attached a name to the focus of a conic—it was introduced 
as a mathematical term by Johannes Kepler in 1604—nor was the notion of directrix 
mentioned in his writings. 

Apollonius is also credited with signi cant accomplishments in optics and astronomy, 
especially planetary theory. He is reported to have earned the nickname Epsilon, because 
the Greek letter € is shaped like the crescent of the moon to which he devoted considerable 
study. One early writer said that Apollonius determined the distance of the moon from 
earth to be ve million stadia, about 600,000 miles; but the gure seems unlikely, as it is 
some two and a half times too great. (The astronomer Hipparchus of Nicaea, ca. 190-120 
B.C., gave the moon’s distance as 605 earth’s radii of 242,000 miles, quite close to the 
modern gure of 239,770 miles.) To account for the asymmetry in the orbit of the planet 
Mars, Apollonius broke with tradition by asserting that its apparently circular orbit was 
not about the center of the earth but about some point far distant from the earth. His 
conjecture anticipates the work of Kepler, who showed that Mars travels in an elliptical 
path around the sun. 

Apollonius is often remembered for a celebrated geometrical problem that he posed 
in his lost treatise, On Tangencies. Known today as the Problem of Apollonius, it says: 
Given three circles, construct a fourth circle that is tangent to each of the given ones. 
When Francois Viéta reconstructed the contents of On Tangencies in 1600, the circles 
problem became a focus of activity for many of the outstanding mathematicians of the 
seventeenth century. 


as the side of the cone has to the radius of the 


45 Problems circle that is the base of the cone. 
(d) Proposition 33. The surface area of any sphere 
1. Verify the following results from Book I of is equal to four times the area of a great circle 
Archimedes’ On the Sphere and Cylinder : of the sphere. 


(e) Proposition 34. The volume of any sphere is 
equal to four times the volume of the cone 
whose base equals a great circle of the sphere, 
and whose height equals the radius of the 
sphere. 


(a) Proposition 13. The surface area of any right 
circular cylinder, excluding its bases, is equal to 
the area of a circle whose radius is the mean 
proportional between the side of the cylinder 
and the diameter of the base of the cylinder. 

(b) Proposition 14. The lateral area of any isosceles 
cone, excluding the base, is equal to the area of 
the circle whose radius is the mean proportional 


2. Prove that if a sphere is inscribed in a right circular 
cylinder whose height is equal to the diameter of the 
sphere, then: 


between the side of the cone and the radius of (a) The volume of the cylinder is the volume of 
the circle that is the base of the cone. the sphere. 
(c) Proposition 15. The lateral area of any isosceles (b) The surface area of the cylinder, including its 


cone has the same ratio to the area of its base bases, is the surface area of the sphere. 
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3. Prove Archimedes’ “theorem of the broken chord”: If 


AB and BC make up any broken chord in a circle 
(where BC > AB), and M is the midpoint of the arc 
ABC and MF the perpendicular to the longer chord, 
then F is the midpoint of the broken chord. That is, 
AB + BF = FC. [Hint: Extend chord BC to D, so 
that FD = FC; then AMBA is congruent to AMBD.] 


. To nd a formula for the length of the side of a 
regular inscribed polygon of 2n sides in terms of the 
length of the side of the regular polygon of 7 sides, 
proceed as follows. Let PR = S,, be the side of a 
regular n-gon inscribed in a circle of radius 1. 
Through the center O of the circle, draw a 
perpendicular to PR, bisecting PR at T and meeting 
the circle at QO; then PO = OR = S,,, are sides of the 
inscribed regular 2n-gon. Prove that 


2 
(a) OTF = OR? — TR? =1- a, 


2 
— @ 
(b) or = on = (1-45) 


(c) S3,=O7 + TR? =2-—/4-S8?. 


Q Son 
Pe _| R 


Ni 


5. For regular polygons inscribed in a circle of radius 1, 


use Sg = | to conclude that 


Sy =V2- V3, 


So = ¥2-V2+ V3, 
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and hence that 2 © 48S, + 3.14103 © 2, which 
was the value Archimedes found. 


. In the Book of Lemmas (a collection of 13 


geometrical propositions that has come down to us 
only in an Arabic translation), Archimedes 
introduced a_ gure that, owing to its shape, is known 
as the arbelos, or “shoemaker’s knife.” 


If a straight line AB is divided into two parts at C 
and if on one side of AB are described semicircles 
with AB, AC, and CB as diameters, then the region 
included between the circumferences of the three 
semicircles is the shoemaker’s knife. Prove that 
if PC is the straight line perpendicular to AB at C, 
then the area of the shoemaker’s knife equals 
the area of the circle whose diameter is PC. 
[Hint: AB* = AC? + BC? + 2AC- BC 

=AC 4+ BC 4..-4+2PC*] 


. Prove that if the common external tangent to the two 


smaller semicircles in the shoemaker’s knife touches 
these curves at R and S, then RS and PC bisect each 
other, and R, S, P, and C lie on the circle whose 
diameter is PC. 


. The Book of Lemmas also contains a geometrical 


gure called the “salinon,” or “salt cellar.” Take 

AC = DB on the diameter AB of a semicircle, as on 
the next page. Then describe semicircles, with AC 
and DB as the diameters, on the same side of AB as 
the given semicircle; also describe a semicircle, with 
CD as the diameter, on the other side of the given 
semicircle. The region bounded by the circumference 
of the semicircles is the salt cellar. Prove that if PO 
is the line of symmetry of the gure , then the area of 
the salt cellar equals the area of the circle whose 
diameter is PQ. 
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9. 


10. 


11. 


Use the techniques of calculus to show that the area 
bounded by the rst complete turn of the spiral 

r = a0 and the initial line is equal to one-third of the 
“ rst circle” (that is, the circle with radius 27a). 


Like Hippias’s quadratrix, the spiral of Archimedes 
can be used to trisect an angle and square the circle. 
Given a spiral, place the angle to be trisected so that 
the vertex and the initial side of the angle coincide 
with the initial point of the spiral and the initial 
position OA of the rotating ray. Let the terminal side 
of the angle intersect the spiral of P. Trisect the 
segment OP at the points Q and R, and draw circles 
with center at O and with OQ and OR as radii. 
Prove that if these circles meet the spiral in points U 
and V, then the lines OU and OV will trisect ZAOP. 


A clever solution to the problem of the quadrature of 
the circle is achieved by means of the spiral of 
Archimedes. Given a circle with center at O and 
radius a, draw the spiral whose equation in polar 
coordinates is r = a0 and whose initial point is O. 
Prove that when the rotating ray is revolved 
perpendicular to its initial position OA, the segment 
OP will have a length equal to one-fourth the 
circumference of the circle. Show how this resolves 
the quadrature problem. 
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12. If OA is the initial line and A the end of the rst 
revolution of the spiral, and if the tangent to the 
spiral at A is drawn, then the perpendicular to OA at 
O will meet the tangent at some point B. Establish 
that the length of the segment OB is equal to the 
circumference of the circle with radius OA; hence, 
the area of AAOB is equal to the area of this circle. 
[Hint: The slope of the tangent at A is 27.] 
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CHAPTER 5 


The Twilight of Greek Mathematics: 
Diophantus 


When we cannot use the compass of mathematics or the touch of experience . . . it is certain that we 
cannot take a single step forward. 
VOLTAIRE 


The end of the third century B.C. saw the 


The Decline of Alexandrian Mathematics close of the Golden Age of Greek math- 


ematics. As the next century wore on, 
The Waning of the Golden Age political strife and anarchic conditions in 

Egypt proved more and more stiing to 
original scientic work and scholarship at the Alexandrian Museum. Ptolemy VII, the 
victor in a power struggle in 146 B.c—unheedful of his predecessors’ enlightened poli- 
cies toward the arts and sciences—banished from Egypt all those scientists and scholars 
who had not demonstrated their loyalty to him. Alexandria’s loss enriched the rest of the 
Mediterranean world, for learning was noticeably stimulated in those places to which the 
exiled Alexandrian scholars ed. According to Athenaeus of Naucratis: 


The King sent many Alexandrians into exile, ling the islands and towns with men who had 
been close to his brother—philologists, philosophers, mathematicians, musicians, painters, 
physicians and other professional men. The refugees, reduced by poverty to teaching what 
they knew, instructed many other men. 


Until Diophantus once more brought fame to the Museum, Alexandria no longer enjoyed 
the primacy that it had once held over leading Eastern centers of learning. 

The last two centuries of the pre-Christian era saw the steady and relentless growth 
of Roman power. When Rome began to expand outside of peninsular Italy, it rst gained 
mastery over the western half of the Mediterranean basin. Syracuse, though protected by 
ingenious military machines that the mathematician Archimedes had devised, yielded to 
siege in 212 B.c.,, as Carthage did in 202 B.c. Then, after 200 B.c., the Roman armies 
turned eastward into Greece and Asia Minor. Greece proper was conquered in 146 B.c., 
and by 64 B.c. Mesopotamia had fallen before the Roman legions. On the Ides of March in 
44 B.c., the daggers of Brutus, Cassius, and their fellow conspirators brought an abrupt 
end to the reign of Julius Caesar. After Caesar’s death, the Roman world was ruled by 
Caesar’s grandnephew Octavian (who later received the honoric title Augustus) in the 
West; and in the East by Mark Antony in association with Cleopatra, Queen of Egypt. In 
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the inevitable clash with Antony, Octavian’s general Agrippa won a decisive naval battle 
at Actium off the west coast of Greece in 31 B.C. The suicides of Antony and Cleopatra 
in the following year ended the Ptolemaic dynasty. Nothing remained for Octavian but 
to incorporate Egypt into the dominions of the Roman people. 

On August 1, 30 B.Cc., Octavian entered Alexandria in triumph. He visited the tomb 
of Alexander the Great, laying a crown of gold upon the glass cof n and scattering 
owers to pay his respects. The Macedonian king whose body lay before him had lived 
only to the age of 32. Octavian at 32 was now the sole ruler of a world-state stretching 
from the Euphrates to Scotland and from the Danube to the Sahara. 

With the passing of Cleopatra, Egypt was reduced to the status of a province in the 
Roman Empire. During Octavian’s reign the empire consisted of Italy and more than 30 
provinces of varying size and importance. Egypt was a Roman province of a peculiar 
kind; it was like a vast private estate of the emperor. With the annual sailing of the 
grain eet from Alexandria, the country could send enough grain to satisfy Italy’s needs 
for four months of every year. Because an ambitious Egyptian governor might try to 
starve out Rome itself, Octavian decided that it would be unsafe to put such manifest 
temptation in the hands of a senator. He determined instead, against all tradition, to rule 
the land through a military commander, whom he titled the Prefect of Alexandria and 
Egypt. Further, he ordained that no senator should set foot in the new province without 
the emperor’s express permission. 

The beginning of Roman rule brought a period of tranquility to Alexandria, in which 
the city enjoyed reasonable prosperity. It was the second city of the empire and still the 
greatest port on the Mediterranean Sea, with an active trade reaching westward and north- 
ward to Italy, Greece, and Asia Minor, and eastward as far as India. With some justi - 
cation, Edward Gibbon, in his six-volume Decline and Fall of the Roman Empire (1776) 
could say, “If a man were called to x the period in the history of the world during which 
the condition of the human race was most happy and prosperous, he would, without hesi- 
tation, name that which elapsed from the death of Domitian to the accession of Commodus 
(96-180 A.D.).” For Rome at its height brought to the Mediterranean peoples the blessings 
of Pax Romana, a durable peace the like of which had not previously been seen over so 
large an area and has never been seen again. With the passage of time, unfortunately, 
this sense of security was rarely to exist in Alexandria. The story of Roman Egypt is 
a sad record of short-sighted exploitation by an absentee landlord, leading inevitably to 
economic distress, mismanagement, and constant civil unrest. The population of Alexan- 
dria was a mixture of different cultures and ethnic groups—Greeks, Christians, Jews, 
and native Egyptians—who, it became increasingly clear, were unable to live together 
in one society without the subjugation of one group by another. By A.D. 200, the city 
was plagued by large, unruly mobs who at the slightest provocation sought to vent their 
frustrations in brawls and bloodshed. The relative stability of the 300-year reign of the 
Ptolemies had given way to an era of street riots and political confusion, during which 
the commercial and intellectual glories of Alexandria slowly but surely deteriorated. 

The question of when and why Greek mathematics began to wane is both contro- 
versial and complex. Although it is always perilous to x dividing lines in the study 
of history, one may safely say that under Roman rule the overall picture was one of 
declining mathematical activity and originality. The new masters of the Mediterranean 
were a practical and utilitarian people, who never showed any inclination or aptitude 
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for extensive theoretical studies. It is remarkable that although the Roman and Greek 
civilizations existed over roughly the same centuries—750 B.C. to A.D. 450—in all that 
time there appeared no Roman mathematician of note. The chief Roman concern was 
the application of arithmetic and geometry to impressive engineering projects: viaducts, 
bridges, roads that survive even today, public buildings, and land surveys. Even among 
the Roman engineers, the small amount of mathematics they required could be applied 
in practice without any grasp of the theory behind it. Agrippa for instance, in carry- 
ing out Julius Caesar’s plan of surveying the empire, was obliged to call in specialists 
from Alexandria to carry out the measurements. Cicero’s attitude illustrated the Roman 
intellectuals’ contempt for theoretical knowledge. In Tuscalan Disputations he recorded: 


The Greeks held the geometer in the highest honor; accordingly nothing made more brilliant 
progress among them than mathematics. But we have established as the limits of this art its 
usefulness in measuring and counting. 


It would be wrong to conclude that Alexandrian mathematics immediately deteri- 
orated with Roman neglect, or that the intellectual stagnation could not be temporarily 
arrested by exceptional individuals working in particular elds. There were occasional 
rallies, as in the period 250-350, when the extraordinary talents of Diophantus and Pappus 
succeeded in making their age a “silver age” of Greek mathematics. But cultural interests 
in the Roman world were by this time so completely alienated from mathematics that 
their brilliant work aroused but slight and passing attention. 


The Spread of Christianity 


Soon after the foundation of the Roman Empire, a new movement developed in 
Alexandria, and also in many other parts of the empire, which was to accelerate the 
demise of Greek learning. This was the development of Christianity. The new religion 
began as a sect within Palestinian Judaism, spread throughout the Roman world in spite 
of sporadic but repeated imperial repression, and nally won of cial recognition as the 
religion of the empire. This reversal in condition, from enemy of the government to 
subsidized state religion subordinate to the emperor, was to transform the future of 
Europe and the Mediterranean world. 

It seems that initially the Christians were merely an annoyance to the Roman state in 
their stiff-necked refusal to acknowledge the divinity of the emperor, and the movement 
was allowed to develop with little interference. In the second and third centuries, as the 
Roman Empire was racked with internal crises and frequent invasions from without, the 
Church became a scapegoat on which to blame these catastrophes. As one Church father 
of the time, Tertulian, observed: 


If the Tiber reaches the walls, if the Nile fails to reach the elds, if the heaven withholds its 
rain, if the earth quakes, if there is famine, if there is pestilence, at once the cry is raised, 
“The Christians to the lions!” 


When in 249-250 the Germanic tribes momentarily broke through the frontier defenses 
(in 268 even taking Athens for a short time), the emperor issued an order that all citizens 
should worship the traditional gods of the Roman state to gain divine support in this time 
of trouble. The Christians could not make the necessary sacri ces; the result was a series 
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of savage outbursts of violence against them. The Church was still relatively small and 
unin uential, comprising not more than one-third of the population in the Greek-speaking 
eastern part of the empire and less than 10 percent of the Latin-speaking inhabitants in 
the west. Had the repressions continued for a longer time the growth of the Christian 
movement might well have been slowed or even stopped. As it was, most emperors felt 
that in desperate times it was better to conciliate factions than to identify scapegoats. 
Even the most extended and sweeping persecution, the Great Persecution (303) initiated 
by Diocletian, was almost entirely restricted to the eastern empire, lasting for 8 years in 
its European provinces and 10 years in North Africa and Asia. 

The fourth century saw the conversion to Christianity of a Roman emperor and the 
subsequent imposition of Christianity as the single of cial religion in the entire empire. 
One of the principal instigators of the Great Persecution, the Emperor Galerius, who died 
in 311, repented while mortally ill. Apparently thinking that the god of the Christians was 
punishing him, he issued an edict of universal toleration, which not only ended active 
persecution but also made Christianity a legal religion for the rst time. Constantine 
the Great, who came to the throne in 312, went further; he became the rst emperor to 
adhere personally to the Christian faith. Later in life Constantine recounted that while 
crossing the Alps, some time before his conquest of Italy, he had seen a aming cross 
in the sky with the words, “By this sign you shall conquer.” It was also reported that the 
day before his victorious battle of the Milvian Bridge outside Rome, he was bidden, in 
a dream, to mark the shields of his troops with some symbol of Christianity. Although 
Constantine made Christianity a favored religion, he realized that the vast majority of 
his subjects was pagan, and he did not try to make his religion the only recognized one. 
Like many Christians of the time, Constantine himself put off his baptism until he lay 
on his deathbed—when presumably he could sin no more. In 392, Emperor Theodosius, 
a devout Christian, promulgated laws closing all the pagan temples in the empire and 
forbidding the exercise of pagan ceremonies of any kind, even those conducted in the 
privacy of the home. By the time Theodosius died, in 395, the empire was of cially 
Christian. 

By the fourth century, the great days of Greek mathematical thought were past. 
Scholars were beginning to turn their intellectual interests and energies to the debates on 
theological questions. The spirit of the early Church was not a spirit of scienti c inquiry, 
for doctrines of faith were not demonstrable in terms of logic. Christianity looked inward 
to the mysteries of the soul, not outward to the mysteries of the natural world. Most of 
the signicant Christian thinkers of the fourth century ridiculed physical science and 
mathematics, promoting the Bible as the source of all knowledge. The position taken by 
Saint Augustine was emblematic of an age that preferred revelation to reason: “The words 
of the Scripture have more authority than the whole human intellect.” Certainly the idea 
that truth depends on divine revelation is not uniquely Christian; nevertheless, the recent 
success of the new religion created a climate of opinion increasingly hostile to pagan 
scientists and scholars. Whereas Christians were formerly persecuted, they now took steps 
to apply against paganism the proscriptions once enforced against them. Unfortunately, 
all Greek learning was identi ed with paganism, and Alexandrian mobs could rely on the 
encouragement of the Roman emperors as they looted libraries as well as pagan temples. 
In a period of growing antirationalism, the destruction of ancient learning was of little 
consequence to the majority of the people. The days of the Museum as an island of 
reason in a sea of ignorance were nally at an end. 
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Constantinople, A Refuge for Greek Learning 


The next few centuries were unhappy times for the empire as a whole. There were 
constant civil wars as one usurper after another rose to claim the title of emperor. Few 
successful claimants maintained themselves on the throne for as long as 10 years. No 
single emperor was strong enough to deal with external threats and internal usurpations 
in every part of the empire at the same time, so Constantine was forced in 330 to found a 
new Christian capital on the old site of Byzantium. He renamed it Constantinople, which 
remained its name until 1930; the city then became known as Istanbul. After 330, the em- 
pire was more or less permanently divided into an eastern and a western half. In the fth 
century, the Roman state in the west disintegrated before the onslaught of the invading 
Germanic peoples, the so-called barbarian invasions. First, Britain was overrun by the in- 
vading Saxons. Then the Vandals and kindred tribes ravaged Gaul and moved into Spain. 
Finally the Visigoths, followed by the Huns under Attila, sacked Italy. By this time the 
Church, in the person of the bishop of Rome, had taken the place of the emperor as the de- 
fender of the eternal city; twice, in 452 and again in 455, the pope went out from Rome to 
negotiate with the barbarian chiefs and implore them to spare the capital. The year 476 is 
taken by most historians as the symbolic end of the western empire. For then, the imperial 
forces (by now entirely German) elected one of their own generals to replace the reigning 
emperor and to rule under the title King of the Germans in Italy. In truth, the death knell 
of the empire had sounded years before; there was a visible lack of loyalty to empire 
and emperor, and by the fth century, few cared to save the Roman state in the West. 

The eastern territories around Constantinople, which had been largely spared these 
invasions, remained independent and isolated for nearly a thousand years after the empire 
in the West had slipped into the hands of the Germans. While Europe was blanketed with 
barbarism and general illiteracy, the spark of Greek learning was kept alive in the Eastern 
Roman, or Byzantine, Empire. Science and mathematics, to be sure, were as dormant in 
one half of the empire as the other. But a knowledge of the Alexandrian tradition never 
completely died out in the East; although Byzantine scholars did not attempt original 
research on their own account, they were actively engaged in preserving and multiplying 
copies of the works of antiquity. Eight centuries would elapse before Western Europe had 
a second opportunity to acquaint itself with the treasures of Greek civilization. Without 
the efforts of the Byzantine copyists, most of the ancient scientic and literary texts 
would have been lost forever. There might never have been a Renaissance. 


From the time of the discovery of irrational 


5 The Arithmetica numbers, Greek mathematics had veered away 


from the purely arithmetical approach. One re- 
Diophantus’s Number Theory sult was that all algebraic problems, even to 

the solution of simple equations, were cast in 
a clumsy and in exible geometric mold. With Diophantus, next to Pappus the last great 
mathematician of classical antiquity, came an emancipation of algebra. 

Practically nothing is known of Diophantus as an individual, save that he lived 
in Alexandria about the year 250. Although his works were written in Greek and he 
displayed the Greek genius for theoretical abstraction, Diophantus was most likely a 
Hellenized Babylonian. What personal particulars we have of his career come from the 
wording of an epigram problem (apparently dating from the fourth century) to the effect: 
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Woodcut of the temple of knowledge, showing the gradations from the Seven 
Liberal Arts to the theology of Peter Lombard. (From Margarita Philosophica (1508) of 
Gregor Reisch.) 


His boyhood lasted for : of his life; his beard grew after + more; after z more he 
married, and his son was born ve years later; the son lived to half his father’s age and 
the father died four years after his son. If x was the age at which Diophantus died, the 


equation becomes 
Dope etix $544e44e% 
6 12 7 2 te. 


and he must have reached an age of x = 84, but in what year or even in what century is 
not certain. 
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The great work on which the reputation of Diophantus rests in his Arithmetica, 
which may be described as the earliest treatise devoted to algebra. Only 6 books out of 
the original 13 have been preserved; the missing books were apparently lost at a very 
early date, probably before the tenth century, for there is no indication that the Arabs 
ever possessed them. Of the other works attributed to Diophantus, we know little except 
for their titles. Fragments of a tract on polygonal numbers have come down to us, and 
the Arithmetica alludes to the existence of a collection of theorems referred to as The 
Porisms, but this is lost in its entirety. 

Like the Rhind Papyrus, the Arithmetica is an assortment of individual problems, 
189 in all, with their solutions. The apparent object was to teach the method of solution 
of certain problems in which it is required to nd rational numbers satisfying prescribed 
conditions. First, a word about the notation. Before Diophantus, algebra was rhetorical, 
that is to say, the results were reached by verbal argument without recourse to symbols or 
abbreviations of any kind. One of Diophantus’s main contributions was the “syncopation” 
of algebra. “Syncopated algebra,” as it is called, is more a case of shorthand for expressing 
much used quantities and operations than of abstract symbolism in our sense. Diophantus 
had stenographic abbreviations for the unknown, successive powers of the unknown up 
through the sixth, equality, subtraction, and reciprocals. 

Instead of our customary x, he used the symbol ¢ for unknown quantities; this is 
perhaps a fusion of ap, the rst two letters of arithmos, the Greek word for “number.” 
The square of the unknown was denoted by A”, the rst two letters of the word dunamis, 
meaning “power.” Similarly, KY represented the cube of the unknown quantity, com- 
ing from the Greek word kubos, for “cube.” For higher powers, he used the following 
abbreviating symbols: 


A‘ A (for square-square) indicates x*, 
AK” (for square-cube) indicates x°, 


KK” (for cube-cube) indicates x°. 


Diophantus did not go beyond the sixth power, since he had no occasion to use a higher 
power in solving any of his problems. 

The sign for subtraction was 7 something like an inverted w; and ¢ acted as an equals 
sign, connecting two sides of an equation. He had no symbol for addition but relied on 
juxtaposition, that is, putting terms alongside one another. In Diophantus’s system of 
notation, the coef cients of the different powers of the unknown were represented by 
ordinary numerals following the power symbol: 


K"35 means 35x°. 


(To avoid confusion, we have retained Arabic numerals; Diophantus would have written 
K* Xe for 35x, where A& stands for 35.) When there were units in the addition, they were 
indicated by M—an abbreviation for the Greek work monades, meaning “units”—with 
the appropriate numeral: 


K*35M12 means 35x° + 12. 


Because Diophantus had no addition symbol, in an expression containing several terms 
with different signs, he had to place all negative terms together after the sign for 
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subtraction. Thus, the expression x? — 5x* + 8x — 2 would appear as 
K*1 684 A°5M2. 


Since most of the problems in the Arithmetica require the determination of several 
quantities, Diophantus worked under a serious notational handicap. For want of other 
symbols besides ¢ to represent variables, he was compelled to reduce all his problems, 
no matter how complicated, to equations in one unknown. Either he expressed the other 
unknown quantities in terms of the one symbol, or he assigned them arbitrary values con- 
sistent with the conditions of the problem. All these eliminations were done beforehand, 
as a preliminary to the actual work. 

Only positive rational answers were admitted, and Diophantus felt satis ed when 
he had found a single solution. (It made no difference to him whether the solution 
was integral or rational.) Diophantus had no concept of negative quantities, although 
he allowed for subtraction as an operation. Thus, in Problem 2 of Book V, we nd 
his description of the equation 4x + 20 = 4 as “absurd,” because it would lead to the 
“impossible” solution x = —4. As he said, “The 4 ought to be some number greater than 
20.” It will be seen that his methods varied from case to case, and there was not a trace 
in his work of a systematic theory. Each question required its own special technique, 
which would often not serve for the most closely related problems. 


Problems from the Arithmetica 


We shall now describe several typical problems from the Arithmetica, though in 
modern notation. These will tell you more about the ingenuity of Diophantus’s methods 
than any summary of this work could hope to do. 


1. Book I, Problem 17. Find four numbers such that when any three of them are 
added together, their sum is one of four given numbers. Say the given sums are 20, 
22, 24, and 27. 
Let x be the sum of all four numbers. Then the numbers are just x — 20, x — 22, 
x — 24, and x — 27. (For instance, if (1) + (2) + (3) = 20, then when (4) is added 
to both sides of this equation, x = (1) + (2) + (3) + (4) = 20+ (4) or 
(4) = x — 20.) It follows that 


2 20 = SH HEH} 


and so 3x = 93, or x = 31. The required numbers are therefore 11, 9, 7, and 4. 


2. Book II, Problem 8. Divide a given square number, say 16, into the sum of two 
squares. 
Let one of the required squares be x”. Then 16 — x” must be equal to a square. 
Here Diophantus was satis ed to choose a particular instance of a perfect square, 
in this case the number (2x — 4)’, so that 


16 — x” = (2x — 4). 


Diophantus’s choice of (2x — 4)? was designed to eliminate the constant terms 
from the foregoing equation; he could just as well have picked (3x — 4)”. The 
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result was the equation 


5x* = 16x, 
with (positive) solution x = . Therefore one square would be oe and the other, 
16 — 28 — 14, 
25 25 

Book II, Problem 20. Find two numbers such that the square of either added to the 
other gives a square. 

Diophantus chose the numbers to be x and 2x + 1. If these are used, the square 
of the rst plus the second automatically becomes a square, no matter what the 
value of x, thereby satisfying one condition: 


x? 4+ (2x +1)=(«4+ 1). 
The square of the second number plus the rst is 
(2x +1 +x = 4x7 4+ 5x41. 


To make this expression into a square, Diophantus assumed that it would equal 
(2x — 2). The effect would be to produce a linear equation in x, which would also 
happen if one used (2x — 3)* or (2x — 4)? instead of (2x — 2)*. Then 


4x? +5x +1 = (2x —2) = 4x? — 8x +4, 
leading to the equation 13x = 3, or x = * The desired numbers are ~ and z. 


Book II, Problem 13. Find a number such that if two given numbers, say 6 and 7, 
are subtracted from it, both remainders are squares. 

Call the number x, so that the problem is one of making x — 6 and x — 7 into 
perfect squares. Let 


? and x-7=b. 


x—-6=a 


Here, we see an approach that comes close to a “method” in Diophantus’s work: 
the use of the algebraic identity 


a> —b? =(a+b\(a —b). 


The difference a* — b* = (x — 6) — (x — 7) = 1 is resolved into two suitably 
chosen factors, from which a and b can be obtained. If one takes 2 and 5 as the 
factors, setting 

a+b=2 and a-b=}, 
then a = 3 and b= 3. It follows that 


x-6=%, x-7= 


1 16’ 


whence x = a is the number sought. 


Book III, Problem 17. Find two numbers such that their product added to either one 
or to their sum gives a square. 
Call the numbers in question x and 4x — 1. Then 


x(4x — 1) +x = 4x? = (2xy’, 
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so that one condition is satis ed immediately. Now it is also required that each of 
the expressions 

x(4x — 1)+ (4x — 1) +x = 4x7 4+ 4x -1 
and 
x(4x — 1) + (4x — 1) = 4x? 43x -1 


has to be a square. Diophantus’s method of solution again depends on using the 
identity 


a’ —b* =(a+b)a— bd). 


It involves taking the difference between 4x? + 4x — 1 and 4x? + 3x — 1, namely, 
x, separating this into the two factors 4x and +, and equating one factor with a +b 
and the other with a — b. But 


at+b=4x, a—-b= ‘ 
implies that a = 5(4x + 7) and b = 5(4x - i): Thus, 
4x? + 4x — 1 = [$(4x + 4)? 
4x? + 3x —1 = [4(4x — DP, 


from either of which equations we arrive at the value x = ae The two numbers 


65 36 
are therefore _ and x4 


Book III, Problem 21. Divide a given number, for instance 20, into two parts and 
nd a square whose addition to either of the parts produces a square. 
Let (1) and (2) be the two parts of 20, and take 


x? 4+2x+1l=(x+17 
to be the added square. The conditions require that each of the expressions 
(eG =e i 
and 
(2) + (x? + 2x +1) 


should be squares. Diophantus observed that when x? + 2x + 1 was added to either 
2x +3 or 4x + 8, a perfect square resulted: 


(2x +3) + (x? + 2x +1) = (x +29, 
and 
(4x + 8) +(x? + 2x +1) =(x +3/. 


Taking 2x + 3 and 4x + 8 as the two parts of 20 gives 6x + 11 = 20, whence 


a 3. The two parts of 20 are therefore 6 and 14, while the added square is 2. 
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a2 


There are other possibilities. If Diophantus had called the square to be added x 
and used the relations 


(4x + 4) +x? = (x +29 
(6x +9) +x? = («x +3), 


then the two required parts of 20 would be ss and ae 


Book VI, Problem 19. Find a right triangle such that its area added to one of its 
legs gives a square and its perimeter is a cube. 

Using the formula for right triangles attributed to Pythagoras, Diophantus called 
the sides 


Q2x+1, 2x? 42x, 9 2x? 42x41. 
The perimeter of the triangle would then be 
Ax? + 6x +2 = 2(2x + 1)(x + 1). 


It is dif cult to make a quadratic a cube, and Diophantus, noticing the factor x + 1 
in the expression for the perimeter, considered in turn the triangle 

2x +1 5 2x? 4+ 2x +1 

x+1’ : x+1 
obtained by dividing each of the sides by x + 1. This new triangle would have 
perimeter 2(2x + 1) and area (2x* + x)/(x + 1). Adding the latter value to 
(2x + 1)/(x + 1), one nds _ that 

2x? +x ax+1  Qx+)e+) 
x+1 > x+1 

The problem requires x to be chosen so that 2x + | is a square and 2(2x + 1) isa 


cube, that is, nding a cube that is twice a square. The obvious choice is 
2(2x +1) = 8, orx = 3, which leads to the triangle with sides g, 3, and q. 


=2x+1. 


Diophantus was not the rst to propose or solve 


Diophantine Equations in Greece, indeterminate problems of second degree. Arith- 
India, and China metical problems clothed in poetic garb were a 
common type of mathematical recreation long be- 

The Cattle Problem of fore his time. Perhaps the most dif cult of these— 

Are ceadce in the sense that it leads to excessively large 


numbers—is the famous “cattle problem.” This ap- 
pears in a memorandum that, according to its head- 


ing, Archimedes sent to Eratosthenes with instructions that it “be solved by those in 
Alexandria who occupy themselves with such matters.” In essence, the problem is to 
calculate “the number of oxen of the Sun, which once grazed upon the isle Thrinacia 
[Sicily].” The wording appears to hark back to the twelfth book of Homer’s Odyssey, in 
which the following line occurs: “Next you will reach the island of Thrinacia, where in 
great numbers feed many oxen and fat sheep of the Sun.” 


The cattle problem requires that one nd the number of bulls and cows of each 


of four colors—eight unknown quantities. The rst part of the problem connects the 
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unknowns by seven simple linear equations. To add to the problem’s complexity, the 
second part subjects the unknowns to the additional conditions that the sum of a certain 
pair must be a perfect square while the sum of another certain pair must be a triangular 
number. To be specic, if W, X, Y, and Z denote the numbers of white, black, spotted, 
and brown bulls, and if w, x, y, and z are the numbers of cows of the corresponding 
colors, then the relations among the numbers of bulls are 


5 9 13 
W=2X4+Z, X=ZY+Z, Y=2W+zZ, 
and among the numbers of cows, 
w= L(X +x), x= F(¥+y), y= (Z+2), z=2(W+w); 


and also W + X is a square number and Y + Z is a triangular number. When reduced 
to a single equation, the problem involves solving the equation 


x? — 4,729,494y" = 1, 


where y is a multiple of 9314. The problem led to what would later be known as the Pell 
equation. The name originated in the mistaken notion of Leonhard Euler that the English 
mathematician John Pell (1611-1685) was the author of the method of solution that was 
really the work of his countryman Lord Brouncker. Although the historical error has long 
been recognized, Pell’s name is the one that is indelibly attached to the equation. 

Many tried to solve the cattle problem, but the large numbers required to satisfy 
the nine conditions discouraged investigators. It was not until the 1880 publication of 
an article by A. Amthor that there was serious progress. By expanding ./4,729,494 as a 
continued fraction, Amthor concluded that the number of cattle must be 776..., where 
the dots represent 206,542 unknown decimal digits. In 1889 a surveyor and civil engineer, 
A. H. Bell, undertook to determine the exact gures needed to express Amthor’s result. 
After nearly four years of computation by himself and two others who constituted the 
Hillsboro Mathematics Club of Hillsboro, Illinois, Bell speci ed what he believed to be 
32 of the leftmost digits and 12 of the rightmost digits. The rst complete solution of 
Archimedes’ problem was given by H. C. Williams, R. A. German, and C. R. Zarnke in 
1965, using a computer. They conrmed that the total number of the “cattle of the sun” 
is an enormous integer written in 206,545 digits, the rst 30 and last 12 of which Bell 
correctly calculated. 

A clearer idea of the magnitude of the answer can be obtained by considering the 
space it would take to print it. If we assume that 15 printed digits take up 1 inch of 
space, the number would be over i of a mile long. The resulting value is so large that 
the island of Sicily, whose area is about 7 million acres, could not contain all the cattle. 
Moreover, there are 1397 bulls for each cow, a ratio that could lead to serious dif culties 
in herd management. 

We have seen that Archimedes speculated about very large numbers, for the Sand- 
Reckoner was an attempt to prove that his system could be used to express the number of 
grains of sand in a sphere the size of the universe. But given the magnitude of the values 
required to ful ll the conditions of the cattle problem and the great dif culty inherent 
in the work, it is hardly likely that the famous geometer of Syracuse or the Alexandrian 
mathematicians came anywhere near its solution. They probably displayed the equations 
involved and left the matter at that. 
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Early Mathematics in India 


Owing to its geometric signi cance, it is not surprising that the Pythagorean equa- 
tion x? + y* =z? received attention earlier than the conceptually simpler rst-degree 
equation ax + by =c, where a, b, and c are known integers. Although the theory re- 
quired for solving the latter equation is found in Euclid’s Elements, it does not appear 
in the extant works of subsequent Greek writers. Possibly Diophantus considered the 
equation too trivial to be included in the Arithmetica. Most of his problems involved 
making expressions of rst- or second-degree terms into squares or cubes. The earliest 
attempts to solve the indeterminate equation ax + by = c by a general method were made 
in India, beginning about the fth century, in the work of the Hindu mathematicians 
Aryabhata (born 476), Brahmagupta (circa 600), Mahavira (circa 850), and Bhaskara 
(1114-1185). 

Alexander’s invasion of India, and the founding of Greek kingdoms within India 
and on its borders, immensely stimulated the communication of ideas between Asia and 
the Mediterranean world. It seems likely that Indian mathematics was directly in uenced 
and inspired by the Greeks at an early stage and affected by Chinese traditions at a later 
time. The whole question of which methods were evolved by the Indians themselves is 
the subject of much conjecture. Initially, their mathematics developed as an outgrowth 
of astronomy, and it is no accident that a substantial part of what has come down 
to us appeared as chapters in works on astronomy. Indeed there seem to have been 
no separate mathematical texts. Because the writers lacked algebraic symbolism, they 
expressed problems in verse and with a owery style. This both pleased and attracted 
readers and aided the memory. Little emphasis was placed on demonstrations, so that 
sometimes there would be only an illustrating gure and the author’s comment, “Behold.” 

In the period from 400 to 1200, the Indians developed a system of mathematics 
superior, in everything except geometry, to that of the Greeks. Among those who con- 
tributed to the subject, the noted astronomer Aryabhata investigated the summation of 
arithmetic and geometric series, drew up a table of sines of angles in the rst quadrant, 
and tried to solve quadratic and linear indeterminate equations. In the Aryabhatiya, he 
calculated the value of z as follows: 


Add four to one hundred, multiply by eight and then add sixty-two thousand; the result is 
approximately the circumference of a circle of diameter twenty thousand. By this rule the 
relation of the circumference to diameter is given. 


In other words, 
__ circumference — 8(100+ 4) + 62,000 62,832 


"diameter 20,000 ~ 20,000 
a remarkably close approximation. Brahmagupta, who lived more than a century after 
Aryabhata, based his work largely on what his illustrious predecessor had done. His 
practice, however, of taking /10 as the “neat value” of 2 was somewhat of a step 
backward. He introduced negative numbers (the term mentioned was equivalent to our 
word negative) and developed a satisfactory rule for obtaining two roots of a quadratic 
equation, even in cases in which one of them was negative. Brahmagupta also gave the 
formula A = ./(s — a)(s — b)(s — c)(s — d) for the area of a cyclic quadrilateral whose 
sides are a, b, c, and d and whose semiperimeter is s. 


= 3.1416, 
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The most enduring contribution of Aryabhata and Brahmagupta was to the study 
of indeterminate equations, the favorite subject of Diophantus. Although they repeated 
many of Diophantus’s problems, the approach was different. Where Diophantus sought 
to solve equations in the rational numbers, the Indian mathematicians admitted only 
positive integers as solutions. Nowadays, in honor of Diophantus, any equation in one 
or more unknowns that is to be solved for integral values of the unknowns is called a 
diophantine equation. The term is somewhat misleading, for it seems to imply that a 
particular equation is under consideration, whereas what is important is the nature of the 
required solutions. 

Although Aryabhata apparently knew of a method for nding a solution of the 
linear diophantine equation ax + by = c, Brahmagupta was the rst to obtain all possible 
integral solutions. In this he advanced beyond Diophantus, who had been content to give 
one particular solution of an indeterminate equation. 

The condition for solvability of this equation is easy to state; the diophantine equation 
ax + by =c admits a solution if and only if d|c, where d = gcd (a, b). We know that 
there are integers r and s for which a = dr and b = ds. If a solution of ax + by =c 
exists, so that axo + byo = c for suitable xo and yo, then 


c= axo + byo = drxy + dsyp = d(rxo + syo), 


which simply says that d|c. Conversely, assume that d|c, say c = dt. Now, integers x 
and yo can be found satisfying d = axo + byo. When this relation is multiplied by t, we 
get 


c = dt = (axy + byo)t = a(txo) + b(tyo). 


Hence, the diophantine equation ax + by =c has x = tx and y = typ as a particular 
solution. This proves part of the following theorem. 


The linear diophantine equation ax + by = c has a solution if and only if d\c, where 
d = gcd (a, b). If xo, vo is any particular solution of this equation, then all other 
solutions are given by 


as ue 
x =x) +-t, i a 
0 ] y= V0 ] 


for some integer t. 


Proof. To establish the second assertion of the theorem, let us suppose that a 
solution xo, yo of the given equation is known. If x’, y’ is any other solution, then 


axy + by) = c = ax' + by’, 
which is equivalent to 
a(x’ — x9) = b(vo — y’). 


There exist integers r and s such that a = dr, b = ds, and 


ab 
d(r,s)=ged{—,—)=1. 
gcd (r,s) = ge (5 =) 
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Substituting these values into the last-written equation and canceling the common 
factor d, we nd that 


r(x’ —x9) = s(vo — y). 


The situation is now this: r|s(vo — y’), with r and s relatively prime. By Euclid’s 
lemma, it must be the case that r|(vo — y’); that is, yy — y’ =rt for some integer f. 
Substituting, we obtain 


x’ — x9 = st. 
This leads us to the formulas 


x’ =x — 2 
= a ae a 


j“<ewes 
d 


It is easy to see that these values satisfy the diophantine equation regardless of the 
choice of the integer ¢; for, 


b a 
ax'+ by'=a (x0+ a) +b (y— <1) 


ab 


ab 
= (axo + byo) + ( - “) t 


=c+0-t=c. 


Thus, there are in nitely numerous solutions of the given equation, one for each 
integral value of f. 


| 


Bhaskara (1114-1185) was the leading Indian mathematician of the twelfth century. 
His most celebrated work is the Siddhanta Siromani (Head Jewel of an Astronomical 
System), written in 1150. The contents became known to western Europe through its 
Arabic translation in 1587. The Siddhanta Siromani is arranged in four parts, of which 
the rst two, the Lilavati (The Beautiful) and the Vijaganita (Root Extractions), deal with 
arithmetic and algebra, respectively. The rst part is named after Bhaskara’s daughter, 
and many of his fanciful problems are propounded in the form of questions addressed to 
her. For instance: 


One- fth of a swarm of bees is resting on a kadaba bush and a third on a silindha bush; 
one-third of the difference between these two numbers is on a kutaja, and a single bee has 
own off in the breeze, drawn by the odor of a jasmine and a pandam. Tell me, beautiful 
maiden, how many bees are there? 


Bhaskara was celebrated as an astrologer no less than as a mathematician. There is 
a legend that astrologers predicted that his daughter Lilavati would never wed, but he 
calculated a lucky day and hour for her marriage. As the hour for this event approached, 
the young girl was bending over a water clock, when a pearl dropped unnoticed from her 
wedding headdress and chanced to stop the out ow of water. So the propitious moment 
passed, and since any other time was prophesied as being sure to bring misfortune, 
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Lilavati never married. To console the unhappy girl, Bhaskara promised to give her 
name to a book which, “will last to the latest times.” 

By Bhaskara’s time, mathematics in India had long since evolved from its purely 
utilitarian function, and problems were often posed simply for pleasure. The following 
is a typical problem taken from the Lilavati: 


Say quickly, mathematician, what is the multiplier by which 221 being multiplied, and 65 
added to the product, the sum divided by 195 becomes exhausted [leaves no remainder]? 


In present-day notation, the problem is equivalent to nding integers x and y that will 
satisfy the linear diophantine equation 221y + 65 = 195x, or 


195x — 221y = 65. 
Applying Euclid’s algorithm to the evaluation of gcd (195, 221), we nd that 
221 = 1-195 4+ 26, 
195 =7- 26+ 13, 
26 = 2-13, 


whence gcd (195, 221) = 13. Because 13|65, a solution of our equation exists. To obtain 
13 as a linear combination of 195 and 221, we work backward through the preceding 
calculations: 


13 = 195 — 7-26 
= 195 — 7(221 — 195) 
= 8-195 +(—7)221. 
On multiplying this relation by 5, we obtain 
65 = 40 - 195 + (—35)221, 


so that x = 40 and y = 35 provides one solution to the diophantine equation in question. 
All other solutions are expressed by 


x = 40+ (=34)r = 40-171, 


y = 35 — (8)t = 35 - 151, 


for any integer ¢. In the Lilavati, Bhaskara arrived at the values 6 and 5 for x and y, 
respectively, and noted that there were many solutions; the equation, he added, was also 
satis ed by x = 57 and y = SO. 


The Chinese Hundred Fowls Problem 


There is evidence of a close acquaintance between Hindu and Chinese mathematics at 
this time, although the question of priority has been disputed. A testimony to the algebraic 
abilities of Chinese scholars is provided by the contents of the Mathematical Classic of 
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Chang Ch’iu-chien (sixth century), a contemporary of Aryabhata. This elaborate treatise 
contains one of the most famous problems in indeterminate equations, in the sense of 
its transmission to other societies—the problem of the “hundred fowls.” This problem, 
which occurs in the works of Mahavira and Bhaskara, states: 


If a cock is worth 5 coins, a hen 3 coins, and three chickens together 1 coin, how many 
cocks, hens and chickens, totaling 100, can be bought for 100 coins? 


In terms of equations, the problem would be written (if x equals the number of cocks, y 
the number of hens, and z the number of chickens): 


1 
Sa x+y+z= 100. 


Eliminating one of the unknowns, we shall wind up with a linear diophantine equa- 
tion in the other two unknowns, which is the case just discussed. Speci cally, since 
z= 100—x — y, we have 5x + 3y + 4(100 — x — y) = 100, or 


Tx +4y = 100. 


This equation has the general solution x = 4f, y=25—7t, which then makes 
z = 75 + 3t, where ¢ is an arbitrary integer. Chang himself gave several answers: 


x = 4, y= 18, z= 78; 
x= 8, y=11, z= 81; 
os —a ee y=4, z= 84. 


A little further effort produces all solutions in the positive integers. For this, f must be 
chosen to satisfy simultaneously the inequalities 


4t > 0, 25—7t > 0, 754+ 3t > 0. 


The last two of these are equivalent to the requirement —25 < ft < 34. Because ¢ must 
have a positive value, we conclude that ¢ = 1, 2, 3, leading to precisely the values Chang 
obtained. 

The type of word puzzle that involves simultaneous linear diophantine equa- 
tions has a long history, appearing in the Chinese literature as early as the rst 
century. In what is the oldest known instance of the “remainder theorem,” Sun-Tsu 
asked: 


There are certain things whose number is unknown. When divided by 3, the remainder is 
2; when divided by 5, the remainder is 3; when divided by 7, the remainder is 2. What will 
be the number of things? 


Thus, we are to nd an integer N that simultaneously satis es the three equations: 
N=3x+4+2, 
N=S5y +3, 
N=724+2, 
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where x, y, and z are integers. With regard to the rst equation, N — 3x = 2, our theorem 
tells us that 


N=8-—3t, x= 2.=t, 


for any integer ¢. If this value of N is substituted in the second equation of the system, 
we obtain the relation 


Sy+3t=5S. 
Here, the general solution is given by 
y= —5+3s, t=10-—5s, 
where s is arbitrary. The implication is that N will be of the form 


N =8 —3t = 8 —3(10 —5s) = —22 4 I5s. 


For N to satisfy the last equation of the system, we must have 
7z — 15s = —24, 
which leads to 
z = 48 — 15r, s = 24—-—7r. 
This yields in turn 
N = —22+4 15s = —22 4+ 15(24 — 7r) = 338 — 105r. 


All in all, NV = 338 — 105r provides a solution to the system of diophantine equations for 
any integer r. Sun-Tsu seems not to have been aware that there were an in nite number 
of solutions to this indeterminate problem, for N = 23 is the only value he gave. In fact, 
it is not at all certain that he had a general method of solution in mind. 

To conclude, let us mention that the Indians devoted considerable effort to solving 
indeterminate quadratic equations, particularly the misnamed Pell equation x* = 1 + ay’, 
and more generally x7 = c + ay”, where a is a nonsquare integer. Diophantus was fre- 
quently led to special cases of this equation in solving problems in his Arithmetica; in 
Problem 28 of Book II, for instance, he made 9 + 9y? equal to a square x? by taking 
x = 3y — 4. Brahmagupta discussed the equation x* = 1 + ay’, but its solution was rst 
effected, as far as we know, by Bhaskara. Brahmagupta said that a person who could, 
within a year, solve the equation x? = 1+ 92y* would be a good mathematician; for 
those times he must at least have been an efcient arithmetician, because x = 1151, 
y = 120 is the smallest solution in the positive integers. In his Lilavati, Bhaskara found 
particular solutions of x=1t+ ay” for the ve cases a = 8, 11, 32, 61, and 67. In the 
case of x? = 1+ 61y’, for example, the answers that were given, 


x = 1,776,319,049, y = 22,615,390, 


were the least positive solution. From one solution, an in nite number of integral solutions 
can readily be obtained using a rule Brahmagupta discovered. It amounts to the following. 
If p and q are one set of values of x and y satisfying x7 = 1 + ay? and p’ and q’ are the 
same or another set, then x = pp’ + aqq’ and y = pq’ + p’q give another solution. Thus, 
the solution x = 17, y = 6 of x? = 1+ 8y” leads to a second pair of values x = 577, 
y = 204, satisfying the equation. 
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5.3 Problems 


Solve problems 1-12, which are from the Arithmetica of 
Diophantus. 


1. 


Book I, Problem 16. Find three numbers such that 
when any two of them are added, the sum is one of 
three given numbers. Say the given sums are 20, 30, 
and 40. 


Book I, Problem 18. Find three numbers such that the 
sum of any pair exceeds the third by a given amount; 
say the given excesses are 20, 30, and 40. [Hint: Let 
the sum of all three numbers be 2x. Add number (3) 
to both sides of the equation (1) + (2) = (3) + 20 to 
get (3) = x — 10. Obtain expressions for (1) and (2) 
similarly.] 


Book I, Problem 27. Find two numbers such that their 
sum and product are given numbers; say their sum is 
20 and their product is 96. [Hint: Call the numbers 
10+ x and 10 —x. Then one condition is already 
satis e d.] 


Book I, Problem 28. Find two numbers such that their 
sum and the sum of their squares are given numbers; 
say, their sum is 20 and the sum of their squares is 
208. 


Book II, Problem 10. Find two square numbers 
having a given difference; say their difference is 60. 
[Hint: Take x? for one of the squares and (x + a)? 
for the other, where a is an integer chosen so that a? 
is not greater than 60.] 


Book II, Problem 12. Find a number whose 
subtraction from two given numbers (say, 9 and 21) 
allows both differences to be squares. [Hint: Call the 
required number 9 — x7, so that one condition holds 
automatically. | 


Book II, Problem 22. Find two numbers such that the 
square of either added to the sum of both gives a 
square. [Hint: If the numbers are taken to be x and 
x + 1, then one condition is satis e d.] 


Book Ill, Problem 12. Find three numbers such that 
the product of any two added to the third gives a 
square. [Hint: Let the numbers be x, x + 6, and 9, so 
that one condition is satis e d.] 


Book Ill, Problem 14. Find three numbers such that 
the product of any two added to the square of the 
third gives a square. [Hint: Let the numbers be x, 
4x + 4, and 1, so that two of the conditions are 
satis e d.] 


10. 


11. 


12. 


13. 


14. 


15. 


16. 


17. 


18. 


19. 
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Book IV, Problem 2. Find two numbers such that 
their difference and also the difference of their cubes 
are given numbers; say, their difference is 6 and the 
difference of their cubes is 504. [Hint: Call the 
numbers x + 3 and x — 3.] 


Book IV, Problem 26. Find two numbers such that 
their product added to either one gives a cube. 
[Hint: If the numbers are called 8x and x? — 1, then 
one condition holds, for 8x(x* — 1) + 8x = (2x)>.] 


Book VI, Problem I. Find a right triangle such that 
the hypotenuse minus either side gives a cube. 
[Hint: Consider the triangle with sides x? — 4, 4x, 
and x? + 4.] 


Which of the following diophantine equations cannot 
be solved? 


(a) 6x+5ly = 22. 
(b) 33x+ 1l4y = 115. 
(c) 14x +35y = 93. 


Determine all solutions in the integers of the 
following diophantine equations. 


(a) 56x + 72y = 40. 
(b) 24x + 138y = 18. 
(c) 221x+35y = 11. 


Determine all solutions in the positive integers of the 
following diophantine equations. 


(a) 18x+5y=48. 
(b) 123x+57y = 30. 
(c) 123x + 360y = 99. 


Alcuin of York, 775. A hundred bushels of grain are 
distributed among 100 persons in such a way that 
each man receives 3 bushels, each woman 2 bushels, 
and each child half a bushel. How many men, 
women, and children are there? 


Mahavira, 850. There were 63 equal piles of plantain 
fruit put together and 7 single fruits. They were 
divided evenly among 23 travelers. What is the 
number of fruits in each pile? [Hint: Consider the 
diophantine equation 63x + 7 = 23y.] 


Yen Kung, 1372. We have an unknown number of 
coins. If you make 77 strings of them, you are 50 
coins short; but if you make 78 strings, it is exact. 
How many coins are there? [Hint: If N is the 
number of coins, then N = 77x — 50 = 78y for 
integers x and y.] 


Christoff Rudolff, 1526. Find the number of men, 
women, and children in a company of 20 persons, if 
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21. 
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together they pay 20 coins, each man paying 3, each 
woman 2, and each child ;. 


Euler, 1770. Divide 100 into two summands such 
that one is divisible by 7 and the other by 11. 


Bhaskara, 1150. What number divided by 6 
leaves a remainder of 5, divided by 5 leaves a 
remainder of 4, divided by 4 leaves a 
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22. 


23. 


remainder of 3, and divided by 3 leaves a 

remainder of 2? 

Fibonacci, 1202. Find a multiple of 7 having the 
remainders of 1, 2, 3, 4, and 5 when divided by 2, 3, 
4, 5, and 6. 

Regiomontanus, 1436-1476. Find a number having 
remainders 3, 11, and 15 when divided by 10, 13, 
and 17. 


The Later Commentators 


The time of Diophantus brings us to the nal stages 
of Hellenistic mathematics. Although nominally 
the School of Alexandria continued to exist for 
The Mathematical Collection several hundred years more, the days of creative 
of Pappus scholarship were over. Those who follow Diophan- 

tus are known mainly for their commentaries on 
earlier treatises. There are, however, a few mathematicians of this period whose names 
deserve particular mention. The most notable, the last in a long line of accomplished 
geometers, is Pappus of Alexandria. Although nowhere near the equal of Archimedes, 
Apollonius, or Euclid, who ourished ve centuries earlier, Pappus towered above his 
contemporaries. 

The great work on which his reputation rests is the Mathematical Collection, origi- 
nally written in eight books, of which the rst and part of the second are missing. Only 
one of Pappus’s other writings has survived, and that in fragmentary form, namely, his 
commentary on Ptolemy’s A/magest. We can fairly well date when Pappus lived, for in 
his commentary on the A/magest, Pappus referred to an eclipse of the sun that took place 
in the year 320, and he spoke as though it were an eclipse he had recently seen. Proclus 
(who died in Athens in 485) quoted Pappus several times in his Commentary on the First 
Book of Euclid’s Elements, so that it is reasonable to infer that Pappus thrived in the rst 
half of the fourth century A.D. 

The Mathematical Collection of Pappus was intended to be a consolidation of the ge- 
ometric knowledge of its time. The books contain theorems of all kinds about proportion, 
solid geometry, and higher plane curves, and also contributions to mechanics—which was 
at that period regarded as part of mathematics. The design of the Collection was to give 
a synopsis of the contents on the great mathematical works of the past and then to clarify 
any obscure passages through various alternative proofs and supplementary lemmas. Not 
content with mere description of an earlier treatise, Pappus went on at several points to 
extend and generalize the results of his predecessor. The rst proposition of Book IV, for 
instance, contains Pappus’s generalization of the Pythagorean theorem on the square of 
the hypotenuse. Pappus himself made many notable contributions. One of these was the 
discovery that the quadratrix of Hippias could be obtained as the intersection of a cone of 
revolution with a right cylinder whose base was the spiral of Archimedes. Most striking 
is his theorem on the generation of a solid by the revolution of a plane area about an 
axis: The volume of the solid of revolution is equal to the product of the rotating plane 
area and the distance traversed by its center of gravity. 

Part of Book IV of the Collection is devoted to the three classical problems of 
antiquity: the quadrature of the circle, the duplication of the cube, and the trisection of 
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an angle. After reproducing the “solutions” of the various ancient geometers, Pappus 
virtually stated that the problems were impossible of solution under the terms in which 
they had been formulated by the Greeks, for they belonged neither among the “plane 
problems” nor among the problems solvable by straightedge and compass: 


The earlier geometers were not able to solve the aforementioned problem about the angle, 
when they sought to do so by means of planes [plane methods], because by nature it is solid; 
for they were not familiar with the sections of a cone, and for this reason were at a loss. 


The nineteenth century saw the curtain fall on the three famous problems of Greek 
geometry with formal proof of their insolubility under Platonic conditions. 

Though the Collection of Pappus is not of the same order as the earlier classics of 
the Alexandrian School, it is an invaluable record of parts of mathematics that would 
otherwise be unknown. Of all the extant Greek works, the Collection is richest in in- 
formation on the lost treatises of the ancient geometers and particularly on the missing 
books of Euclid and Apollonius. Many results of ancient authors are available to us only 
in the form in which Pappus preserved them. Despite Pappus’s attempt to arouse interest 
in the traditional geometry of the Greeks, this study was never effectively revived and it 
practically ceased to be a living interest. 


Hypatia, the First Woman Mathematician 


An unhappy consequence of the conversion of the fourth-century Roman emperors 
to Christianity was that the role of the persecuted was now shifted to the pagans. When 
the Greek temples were ordered razed, the immense library housed in the Temple of 
Serapis, consisting of over 300,000 rolls of manuscripts, fell prey to the vandalism of 
fanatics. With it, a painfully accumulated record of centuries of genius was heedlessly 
wiped away. Not content with eradicating “pagan science” by the torch, Christian mobs 
murdered many of the Museum’s scholars in the streets of Alexandria. Such was the fate 
of the rst prominent woman mathematician, Hypatia, daughter and pupil of Theon of 
Alexandria. 

Hypatia (370-415) was distinguished in mathematics, medicine, and philosophy and 
is reported to have written a commentary on the rst six books of Diophantus’s Arith- 
metica, as well as a treatise on Apollonius’s Conic Sections. From her father she had 
obtained a knowledge of the astronomical discoveries of Claudius Ptolemy, and she 
edited the Almagest of this great astronomer. She was also the unquestioned leader of 
the neo-Platonic school of philosophy and took part in the last attempt to oppose the 
Christian religion. As a living symbol of the old culture, she was destined to be a pawn 
in a struggle for political mastery of Alexandria. 

Hypatia lectured at the Museum on mathematics and philosophy, and her classes 
attracted many distinguished listeners. Among these was the philosopher Synesius of 
Cyrene, who was later to become bishop of Ptolemais. In the letters of Synesius that 
have come down to us, he always spoke of Hypatia in the highest terms, calling her 
“mother, sister, reverend teacher,” and praising both her learning and her virtue. In spite 
of support from Synesius and other Christians, the Christian leaders regarded Hypa- 
tia’s neo-Platonic philosophy as heretical. Her position was further threatened by her 
friendship with Orestes, Roman governor of the city and the only countervailing force 
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to Cyril, Bishop of Alexandria. The followers of Cyril spread rumors that Hypatia was 
drawing large crowds to her lecture hall, where under the guise of scholarship, she 
was expounding paganism—and that moreover, her in uence over Orestes was the only 
obstacle to a reconciliation between the governor and himself. As she returned one 
day from her classes, Hypatia was waylaid by a mob of religious zealots, slashed by 
sharp oyster shells, and nally torn limb from limb, her remains delivered to the ames. 
With the death of Hypatia, the long and glorious history of Greek mathematics was at 
an end. 

Although it seems likely that the greater part of the Museum and its library was 
plundered by the Christians well before Alexandria was taken by the Moslems in 641, 
the Moslems burned what books were still left. Accounts say that the Arab military 
governor, perplexed by the hoard of writings gathering dust in the library, referred the 
matter to Mecca for advice. According to the Christian writer Bar-Hebiaeus, the reply 
came: “Either the manuscripts contain what is in the Koran, in which case we do not 
have to read them, or they contain what is contrary to the Koran, in which case we 
must not read them.” In either event, their destruction was decreed; the contents of the 
library were distributed among the public baths, of which there were some 4000, where 
they served to supply the res for the next six months. The story is probably based on 
truth, but the library was then only a ghost of the past. A great stock of writings appears 
to have been destroyed at the time of Julius Caesar’s siege of Alexandria in 48 B.C. 
Caesar, fearing that he would be cut off by sea, sent an incendiary crew to set re to the 
Egyptian eet, which had been left undefended in the harbor. The con agration spread 
to the wharves and warehouses, and before it could be brought under control, consumed 
the original library building. Part of the loss was recouped when Marc Antony presented 
200,000 volumes from the library of Pergamon to Cleopatra. 

The successors of Diophantus are noted mainly for translating and commenting on 
the writings of earlier scholars. The traditions of Alexandrian mathematics became more 
remote as one commentator after another skimmed the surface of his predecessors’ work 
to produce a volume that would gain a wider audience. At a time of general decay 
of learning, originality often gave way to fraudulent scholarship—the indiscriminate 
appropriation of materials without acknowledgement or the falsi cation of sources to gain 
the appearance of quoting Greek treatises. There were, to be sure, reputable commentators 
who showed some knowledge of the mathematics of the Golden Age, most notably 
Proclus (410-485) and Boethius (475-524); their own work, even though a pale re ection 
of what Greek mathematics was at its highest moment, furnishes a link between classical 
and medieval learning. 

Proclus received his early training in Alexandria but spent most of his life in Athens, 
where he was head of the Academy of Plato. The range and volume of his production 
was enormous, extending from philosophy and theology through mathematics, physics, 
and astronomy, to literary criticism and poetry. Although Proclus lived a good thousand 
years after the inception of Greek mathematics, he had access to numerous historical and 
critical works that have since vanished completely. The historical work whose loss is 
most deeply to be deplored is the great History of Geometry by Eudemus, the pupil of 
Aristotle. Luckily a brief outline (called the Eudemian Summary) has been preserved by 
Proclus in his Commentary on the First Book of Euclid’s Elements. Even in its fragmentary 
form, the history is of incomparable value as our main source of information on Greek 
geometry from Thales to Euclid. 
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Woodcut showing a contest between the old and new arithmetic, symbolized by 
Boethius (left) using the Hindu-Arabic numerals and Pythagoras (right) still 
reckoning with a counting board. (From Margarita Philosophica (1508) of Gregor Reisch.) 


Roman Mathematics: Boethius and Cassiodorus 


For those who are interested in tracing developments in theoretical mathematics, the 
Roman period is singularly barren of interest. However excellent the Romans may have 
been in the arts, literature, and law, they showed no disposition to master the Greek 
sciences, let alone to add to them. As the early Church emerged from the catacombs, 
people argued less about mathematics and more about salvation. Learning of any kind 
was deemed useful as it was necessary for the proper understanding of the Scriptures 
and the writings of the Church fathers. The trivium of liberal arts became the accepted 
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format of Christian education; and within the trivium, the study of grammar and rhetoric 
received far more attention than what was devoted to logic. Contributing to this lack 
of interest in theoretical studies was the fact that a knowledge of the Greek language, 
in which much of the scientic learning of antiquity remained, gradually faded in the 
Latin-speaking West. 

Perhaps the best known Roman commentator to interest himself in the Greek works 
then available was Anicius Boethius (circa 475-524). Memorably characterized as “the 
last of the Romans and the rst of the Scholastics,” Boethius provided a bridge between 
Antiquity and the Middle Ages. Born into one of the wealthy and illustrious families 
of senatorial rank, he received the best education to be had in those troubled times. 
Scholars disagree over where Boethius was educated, some favoring Athens and others 
Alexandria. As a young man Boethius entered the Roman administrative system. The 
Ostrogothic king Theodoric, who had ruled Italy since 493, needed the experience of the 
old Roman aristocracy in his task of governing. Holding a number of trusted positions, 
Boethius reached the height of his political power in 522 when he became Master of 
the Of ces, a post in which he functioned virtually as the king’s prime minister. Soon 
afterward, Boethius fell out of Theodoric’s favor and was accused of treasonable conduct. 
The of cial charge—widely accepted now as unjust—was that he corresponded with the 
Byzantine Emperor Justinian in a conspiracy to overthrow Theodoric. In prison awaiting 
execution, Boethius wrote The Consolation of Philosophy, one of the classics of Western 
thought. 

Boethius, realizing the sad state of the sciences and aware of his own command 
of Greek, had previously embarked on the ambitious program of providing the scholars 
of his day with textbooks on all four subjects of the quadrivium (arithmetic, geometry, 
music, and astronomy). His geometry consisted of nothing more than de nitions and 
statements of theorems—with no proofs—from Books I, IH, and IV of the Elements, 
along with various practical applications. Boethius’s popular work, De Jnstitutione 
Arithmetica, is actually a paraphrase, bordering on a translation, of the /ntroductio 
Arithmeticae of Nicomachus. Although occasionally adding material and condensing 
portions of the original, Boethius contributed nothing really new. He was not an expert 
mathematician, and in his departures from Nicomachus he was trying to exhibit his 
own talents, after the fashion of Latin commentators. Yet such was the poverty of 
mathematical learning of the time that it is mainly through Boethius that the Middle 
Ages came to know the principles of formal arithmetic. His Arithmetica remained for 
over a thousand years the authoritative text on the subject in monastic schools (that the 
Church proclaimed him a martyr no doubt helped too). Indeed, the last known edition 
of Boethius’s Arithmetica was published in Paris in 1521. In the East, meanwhile, 
the Greek masterpieces were being zealously preserved, studied, and recopied by each 
generation. After the original texts were rediscovered by the Latin West in the fteenth 
century, Boethius sank into an obscurity that became as great as his reputation once was. 

Magnus Aurelius Cassiodorus (circa 480-575), a younger friend of Boethius and not 
so great a scholar, yet made a more substantial contribution to the preservation of the 
classical heritage. Like Boethius, he was a Roman aristocrat who rose to high position 
in the government of Theodoric. Upon retiring from public life to his estate at Vivarium, 
Cassiodorus founded a large monastery with the conscious aim of making it a center 
of Christian learning and scholarship—the rst education-oriented monastic house. This 
involved the creation of a scriptorium for the translation into Latin of the classical texts 
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to be studied; copies were made for their own library and to be sent to monasteries that 
were less well equipped. For the education of his monks and to facilitate their teaching 
of others, Cassiodorus composed the /ntroduction to Divine and Human Writings. Most 
of it was devoted to holy scripture and the works of the church fathers, but he did offer 
a brief discussion of each of the seven liberal arts. This primitive textbook served as the 
basis of the curriculum of the church schools in the early Middle Ages. 


5.4 Problems 


Let the cubic equation ax* + bx? + cx +d =0 have 
integral coef ci ents a, b, c, and d. 


(a) Prove that if this equation has a rational root 
r/s, where r and s are relatively prime, then r 
divides d, and s divides a. [Hint: Substitute 
x =r/s in the equation, clear of fractions, and 
use Euclid’s lemma.] 

(b) Show that if a = 1, every rational root of the 
cubic must be an integer that divides the 
constant term d. 


Find the rational roots of the following cubic 
equations: 


(a) 2x3—5x?-—2x4+15=0. 
(b) 32x3-—6x-—1=0. 

(c) 6x3? —x*-—4x-1=0. 
(d) x3 —7x?+ 20x —24=0. 
(e) x? —2x?+7x+2=0. 


A real number r is said to be constructible if there 
exists a line segment of length |r| that can be 
constructed by straightedge and compass from a 
given line segment of unit length. (Because there is a 
one-to-one correspondence between constructions by 
straightedge and compass and algebraic operations 
that are purely rational or involve real square roots, 
this translates into: A real number r is constructible 
if it can be calculated from 0 and 1 by a _ nite 
number of additions, subtractions, multiplications, 
and divisions, and extractions of square roots.) The 
following theorem is well-known: If the cubic 
equation 


ax? + bx? +ex+d=0 
a,b,c,and d integers, a4~0 


has a constructible real number as a root, then it has a 
rational root. Show the impossibility of constructing, 
with straightedge and compass, the side of a cube of 


a volume twice that of a given cube. [Hint: If the 
original cube has side of length 1, the edge of the 
desired cube must satisfy x3 — 2 = 0. Show that this 
equation has no constructible real number as its root.] 


The relation between the angle a (that is, the angle 
whose measure is a) and the numbers sina and cosa 
is exhibited in the accompanying gure. 


Prove that it is possible to construct an angle a with 
the aid of a straightedge and compass if and only if 
either sina or cosq@ is a constructible real number. 


Establish that it is impossible by straightedge and 
compass alone to trisect the angle 60°. [Hint: A 60° 
angle can be trisected if and only if it is possible to 
construct a 20° angle. From the trigonometric identity 


cos 3a = 4cos' a — 3cosa 
applied to a = 20°, it is found that 
+ = 4 cos’ 20° — 3 cos 20°. 


If we let x = cos 20°, this can be rewritten as a cubic 
equation 8x* — 6x —1=0.] 


(a) Show by means of the trigonometric identity 
cos 3a = 4cos* a — 3cosa@ that the angle 90° 
can be trisected using only a straightedge and 
compass. 

(b) Show by means of the trigonometric identity 
sin3a = 3 sina — 4sin’ q that the angle 30° 
cannot be trisected. 
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The immediate cause of the seventh-century 


Mathematics in the Near and Far East surge in Arabic power was the emergence of 


a new faith, Islam, founded on the teachings 
The Algebra of al-Khowarizmi of Mohammed. United in religious fervor, the 

desert tribes of the Arabian peninsula met few 
obstacles as they spread swiftly through the Mediterranean world. The historic city of 
Damascus fell to Arab arms in 635, as did Jerusalem in 637. Egypt was conquered 
between 639 and 642. Advancing westward, the Arabs crossed the Straits of Gibraltar 
(711), moving up through Spain and into France as far as Poitiers. Meanwhile Arab 
armies swept in the other direction, through Syria and Persia, even reaching northern 
India. A hundred years after the death of Mohammed, his followers were the masters 
of an empire half again as large as was ever controlled by the Romans. Only Christian 
Europe, except for Spain, was barred to them. 

The rulers of the new empire, known as caliphs (that is, “successors”), governed 
from Damascus. However, in 762 they resolved to build a new capital on the Tigris 
River, at a place that bore the old Persian name of Baghdad. Quickly becoming a great 
commercial and cultural center, Baghdad boasted a population of 800,000 in the ninth 
century, making it larger than Constantinople. Arabic became the language of learning 
for a large part of the world—all Moslems were required to know it to read the holy 
Koran—and anything written in Arabic carried the prestige previously accorded to Greek 
works. 

The intellectual legacy of Greece was the most important treasure of the lands 
that came under the domination of the Arabs, and the main task of Arabic scholarship 
was its absorption. To this purpose, Caliph al-Ma’min set up the renowned House of 
Wisdom, a kind of academy comparable with the Museum at Alexandria. An intense and 
energetic effort was made to acquire Greek manuscripts, even to the extent of sending out 
emissaries to Constantinople to secure a copy of Euclid’s Elements from the Byzantine 
emperor. These were rendered into Arabic by a corps of translators at the House of 
Wisdom and placed in a library there for the use of scholars. By the beginning of the 
tenth century, practically the whole extant body of Greek scientic and philosophical 
writings had been recorded in the Arabic language. This classical heritage, along with 
re nements and extensions that the Arabs developed themselves, was eventually to reach 
the Latin West. 

The most illustrious of the Arab mathematicians was Mohammed ibn Misa al- 
Khowarizm’ (circa 780-850), who enjoyed the patronage and friendship of Caliph al- 
Ma’min. As court astronomer, he was doubtless one of the scholars early associated 
with the House of Wisdom. It was largely through his work, consisting mainly of two 
books—one on arithmetic and the other on algebra—that Europe became acquainted 
with the Hindu numerals and the algebraic approach to mathematics. Few details of 
al-Khowarizm”’s life are known. There is one story, grounded in several sources, that 
connects him with a later caliph: al-Khowarizm’ was called to the bedside of a seriously 
ill caliph and asked to cast his horoscope. As a result al-Khowarizm’ assured the patient 
that he was destined to live another 50 years, but unhappily he died within 10 days. 

Al-Khowarizm* compiled a small treatise on arithmetic with a title something like 
Book of Addition and Subtraction According to the Hindu Calculation. It is the earliest 
work in Arabic to explain the use of the Hindu decimal system of numerals. Although 
al-Khowarizm* mentions just “nine letters” (that is, symbols for the digits 1 through 9) 
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to be used for writing numbers, he does also make use of the zero: 


When nothing remains [in subtraction], put down a small circle so that the place be not 
empty, but the circle must occupy it. 


No copies of the original Arabic version of the book have survived; it has reached us 
only in a Latin translation Algoritmi de numero Indorum, made by John of Seville at the 
beginning of the twelfth century. Its in uence on European mathematical thought was 
so great that the new numerals were misnamed “Arabic” despite their Indian origin. In 
1857 a copy of the Latin translation was discovered at the Cambridge University library. 
It begins with the words “Dixit Algoritmi,” or “Thus spoke al-Khowarizm’.” 

Western Europeans rst learned about algebra from the work of al-Khowarizm’. It 
does not seem likely that his knowledge of algebraic techniques derives from Diophantus, 
whose Arithmetica was not translated until the end of the tenth century; besides that, 
Diophantine algebra has an entirely different character, being primarily concerned with 
the theory of numbers. 

The name “algebra” is the European corruption of al-jabr, part of the title of al- 
Khowéarizm’’s treatise Hisdb al-jabr w’al mugabalah. Apparently the title means “the 
science of reunion and reduction.” The words refer to the two principal operations the 
Arabs used in solving equations. “Reunion” refers to the transference of negative terms 
from one side of the equation to the other and “reduction” to the combination of like terms 
on the same side into a single term, or the cancellation of like terms on opposite sides of 
the equation. For example, in the equation (in modern notation) 6x? — 4x + 1 = 5x? +3, 
“reunion” gives 


6x? + 1 = 5x7 + 4x +3, 
and from “reduction,” 
x? = 4x42. 


In the twelfth century, the book was translated into Latin under the title Liber Alge- 
brae et Almucabola, which ultimately gave the name to that part of mathematics dealing 
with the solution of equations. The in uence of al-Khowarizm’ is also re ected in the 
fact that algorism (or algorithm), a Latin corruption of his name, for a long time meant 
the art of computing with Hindu-Arabic numerals. Today it is used for any method of 
calculation according to a set of established rules. 

The traditional explanation of the Arabic word jabr is that it means “the setting 
of a broken bone” (hence, “restoring” or “reunion”). When the Moors reached Spain in 
the Middle Ages, they introduced the word algebra, and there, in the form algebrista, it 
came to mean “a bonesetter.” At one time in Spain, it was not uncommon to see a sign 
reading Algebrista y Sangradoe (“bonesetting and bloodletting”) over the entrance to a 
barbershop; for until recent times, barbers performed many of the less skilled medical 
services as a sideline to their regular business. 

In speaking of al-Khowarizm’, we do not mean that he personally was the inventor 
of algebra, for no branch of mathematics sprang up, fully grown, through the work of 
one person. He was only the representative of an old Persian school who preserved 
its methods for posterity through his books. This early Arabic algebra was still at the 
primitive rhetorical stage—a phase characterized by the complete lack of mathematical 
symbols, in which the calculations were carried out by means of words (even numbers 
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were written out in words rather than presented as symbols). Algebraic rules of procedure 
were proclaimed as if they were divine revelations, which the reader was to accept and 
follow as a true believer. Whenever reasons and proofs were given, they were presented 
as geometric demonstrations; the Arabs, inspired by Euclid’s Elements, seemed to believe 
that an argument had to be geometric to be convincing. 

In dealing with quadratic equations, al-Khowarizm’ divided them into three funda- 
mental types: 


(1) x? +ax=b, 
(2) x? +b =ax, 
(3) x? =ax +b, 


with only positive coef cients admitted. (Negative quantities standing alone were still not 
accepted by these Arabic mathematicians.) All problems were reduced to these standard 
types and solved according to a few general rules. 

Al-Khowarizm”s geometric demonstration of the correctness of his algebraic rules 
for solving quadratic equations may be illustrated by his discussion of the equation 
x? + 10x = 39, a problem he solved by two different methods. This equation reappears 
frequently in later Arab and Christian texts, running “like a thread of gold through the 
algebras of several centuries.” The rst geometrical solution is explained as follows. 
Given x? + 10x = 39, construct a square ABCD having sides of length x to represent 
x’. Now one has to add 10x to the x. This is accomplished by dividing 10x into four 
parts, each part representing the area ( ®)x as a rectangle, and then applying these four 
rectangles to the four sides of the square. 


[4 (10/4)x ea 10/4 
D C 


This produces a_ gure representing x? + 10x = x? + 4(40x). To make the gure into a 
larger square of sides x + 2 we must add four small squares at the corners, each of 
which has an area equal to (42). That is, to “complete” the square, we add 4(2)? = (2). 
Then we have 


i\? 5 10\? 10\? 
+>) =@?+10x)+4(T) =39+ (=) =39425 = 64. 


Hence, the side of the square must be x + — = 8, from which it is found that x = 3. 


In the general setting, the quadratic equation x* + px = q is solved by this method 
of completion of squares by adding four squares, each of area (p/4)*, to the gure 
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representing x? + px, to get 


(x + py =x? + px+4(p/4P =q4+ (2) . 


This leads to the solution 


For al-Khowarizm”s second method of solving x* + 10x = 39, the starting point is 
a gure composed of a square of side x (and area x”) and two rectangles, each having 
length x and width 2. Because the area of each rectangle is x(2), the area of the entire 
gure is x7 + 2(2)x. To complete this gure so as to form a square, it is necessary 


to add a new square of area ( wy. The area of the completed square is (x + Ry, and 


consequently 
ne are pc eee ge 64 
— =X = EX — = : 
ug 2 2 


10/2 


This side of the square is then x + 2 = 8, whence the value of the unknown is x = 3. 
In solving the general equation x? + px = q in this manner, a square of side p/2 is 
added to the gure, which represents x? + 2(p/2)x, thereby making 


(+f) = a(B)e+ (BY <a (8). 


This yields, as before, the solution 


In looking at the solution of the quadratic equation, we have seen that geometry 
was undisputed mistress in Euclid’s Elements; the algebraic content was clothed in 
geometric language. With the work of al-Khowarizm’, however, we see the beginning 
of doing away with this limitation, as geometric explanations come to be auxiliary to a 
newly predominant algebraic reasoning. The old, ingenious Babylonian tricks and devices 
for solving individual problems are nally seen as part of al-Khowdarizm”s systematic 
reduction of the quadratics to their standard types, with each type solved according to its 
own rules. In the work of al-Khowarizm’, we discern a sure progress and an evolution 
from ancient mathematical practices to improved and more general methods. 
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The Arabic mathematicians of this period, besides transmitting Hellenistic learning to 
the West, made lasting contributions of their own. Indeed they revised and reconstructed 
many fundamental ideas in mathematics. The Arabs recognized, for example, irrational 
roots of quadratic equations, although these had been disregarded by the Greeks. 

Although the Arabs recognized the existence of two solutions of a quadratic 
equation—something never done by Euclid or the Babylonians—they listed only the 
positive ones. They did not perceive the reality of negative solutions to an equation. The 
very idea of a negative root implies the acknowledgment of negative numbers as indepen- 
dent entities having the same mathematical status as positive ones. This understanding 
is of more recent origins; in the works of al-Khowarizm’ and the other Arab algebraists, 
negative numbers are consistently avoided. The existence and validity of negative as well 
as positive roots was rst afrmed by the Hindu mathematician Bhaskara (born 1114). 
Europeans have admitted them only since the sixteenth or seventeenth century. 


Aba Kamil and Thabit ibn Qurra 


Abii Kamil (circa 850-930), often called “The Reckoner from Egypt,” was the second 
of the great Arabic writers on algebra. Little is known of his life and activities other 
than that he is apparently of Egyptian descent and wrote in the period following al- 
Khowarizm*. His Book of Algebra (Kitab fil-jabr w’al muqdbalah), a title commonly 
used by early Muslim algebraists, is essentially a commentary on and elaboration of 
al-Khowarizm”s work; in part for that reason and in part for its own merit, the book 
enjoyed widespread popularity in the Muslim world. A much more extensive treatise on 
algebra than that of al-Khowarizm’, Aba Kamil’s Algebra contains a total of 69 problems 
compared with the 40 of his famous predecessor. 

As would be expected with a commentary, Abii Kamil carried over intact many of 
the problems that al-Khowarizm* had explained. At the same time, he did not hesitate 
to add further methods of solution to those presented by the earlier author. This may be 
seen in Problem 8 of the Algebra. As expressed by Abt Kamil, it reads: “Divide 10 into 
two parts in such a way that when each of the two parts is divided by the other their 
sum will be 43.” In modern notation, the problem consists of nding two numbers that 
satisfy the equations 


xt+ty=10, =p = de. 
y 


The algebraic identity 


is used to convert the second of these to 
2 2 1 
x+y= a 


Abti Kamil rst solves the problem along the lines of al-Khowarizm’. That is, he puts 
y = 10 —x into the previous equation to obtain a standard type of quadratic equation, 


Mathematics in the Near and Far East 243 


namely 
ae +100 = ae 
4 2, 


with solution x = 2; hence, the corresponding value of y is 8. Abi Kamil then presents 
a method of solution of his own, one which involves the old Babylonian procedure of 
introducing a new unknown quantity z by letting 


x=5-z, y=5+2. 


When these values are substituted, the equation x7 + y* = 4ixy becomes 
2 1 2 
50+ 2z° = 4425 —2z*), 


which yields z? = 9. This gives z = 3 and, in turn, the numbers sought are 
KS S23S2, y=54+3=8. 


Abia Kamil developed a calculus of radicals that is quite distinctive. He managed 
the addition and subtraction of square roots, without using our symbols, by means of the 


equalities 
JatvVb=Va+b+2Vab. 


As with al-Khowarizm”s work, the Algebra is entirely rhetorical, with all computations 
(often quite complicated) described in words; the only notation in the text is of integers. 
For instance, the rule for subtracting the square root of 4 from the square root of 9 is 
expressed as 


If you wish to subtract the root of 4 from the root of 9 until what remains of the root of 
9 is a root of one number, then you add 9 to 4 to give 13. Retain it. Then multiply 9 by 
4 to give 36. Take 2 of its roots to give 12. Subtract it from the 13 that was retained. One 
remains. The root is 1. It is the root of 9 less the root of 4. 


This is just a verbal description of what we would write as 


J9—-V4=94+4-2/9-4=1. 

The major advance of Abi Kamil over earlier writers is in his use of irrational 
coef cients in indeterminate equations. A case in point is Problem 53 of the Algebra. In 
it he asks for a number such that, if the square root of 3 is added to it and the square 
root of 2 is added to it, then the product of the two sums will be 20. This, which today 
would be written as 

(x + ¥3)\(x + V2) = 20, 
leads to the quadratic equation 


x24 J6 4+ V3x2 + 72x? = 20. 


Abt Kamil gives the correct value 


ee 
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The introduction of irrational solutions for some quadratics is another point of departure 
from the foundation work of al-Khowarizm’. 

Abti Kamil’s Algebra holds an especially important place in the development of 
mathematics in the West through its in uence on the works of the Italian Leonardo of 
Pisa, better known as Fibonacci. When Fibonacci wrote his Liber Abaci (1202), he drew 
heavily on the Arabic author, reproducing some 29 problems from the Algebra with 
little or no change. Although Fibonacci was a borrower, he should not be regarded as a 
plagiarist; Abi Kamil’s methods were so well-known at the time that any mathematician 
felt free to use his results as common property. 

Another prominent scholar in the early history of Arabic mathematics is Thabit ibn 
Qurra (circa 836-901). He was a mathematician of diverse talents and accomplishments: 
distinguished physician, skilled translator, and the prolic writer of close to 150 works. 
Thabit’s gift with languages—resulting perhaps from his being a moneychanger in his 
youth—allowed him to prepare excellent translations of the bulk of the Greek math- 
ematical works. These include Apollonius’s Conic Sections, Nicomachus’s /ntroductio 
Arithmeticae, and The Measurement of a Circle and On the Sphere and Cylinder of 
Archimedes. Euclid was highly venerated by the Arabs, with the Elements being among 
the earliest translations made from the Greek: an abridged version appeared around 800, 
followed by two complete translations before the tenth century. Thabit’s thorough revision 
of one of the latter is the most faithful rendition into Arabic of the Elements, which in turn 
came to the West through Gerard of Cremona’s Latin adaptation in the twelfth century. 

Thabit’s Book on the Determination of Amicable Numbers concerns the study of 
numbers, as understood in the Pythagorean sense. It is usually regarded as the rst 
completely original mathematics written in Arabic. The work contains 10 propositions, 
including one of the construction of “amicable” number pairs (that is, pairs of numbers 
each of which is equal to the sum of the proper divisors of the other). In the /ntroductio 
Arithmeticae, Nicomachus mentions such numbers, but fails to derive any of the theory. 

Thabit’s remarkable rule is the following: if p= 3-2” —1, q=3-2""'—1, and 
r=9-27"-! _ 1 are all prime numbers, then M = 2"pq and N = 2"r form a pair of 
amicable numbers. For instance, let n = 2 so that p = 11, g = 5, and r = 71; then the 
pairs 220 = 2? - 5-11 and 284 = 27-71 are amicable. To see this, note that the proper 
divisors of 220 give rise to the sum 


14+24+44+5+410+11+20+4 22+ 44+ 554 110 = 284, 
while those of 284 yield 
14+24+4+4+71+4 142 = 220. 


In a contribution to geometry, Thabit gave a generalization of the Pythagorean the- 
orem that would apply to any triangle whatsoever. Suppose that from vertex A of a 
triangle ABC two lines are drawn intersecting BC in points B’ and C’ such that ZAC’ B’ 
and ZAB’C’ are each equal to ZA; then, according to Thabit, 


AB? + AC? = (BC)(BB’ + CC’). 


This was stated without proof, other than to say that it can be obtained with the aid 
of Euclid’s Elements. A solution based on what is called the law of cosines (Euclidean 
Propositions 12 and 13 of Book II) is as follows: 
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Since ZA = LC’ = ZB’, it follows that 
AB? + AC? = BC” + 2(AB)(AC)(cos A) 
= BC? + (AB)(AC)(cos C’ + cos B’) 
= BC? + (AB)(AC) wes + a 
7 AC’ AB’ 
FC 4FBR’ 
AB’ ; 
Now, from the similarity of AABC and AAB’B, we have AB/AB’ = BC/AC and so 


= BC? + (48,40) ( 


AB? + AC? = BC? + (BC)\(FC' + FB’) 
= (BC)(BC + C’B’) = (BC\(BB' + CC’). 


Thabit also presented several completely original dissection proofs of the 
Pythagorean theorem itself. He called his approach the method of reduction and compo- 
sition, or to be somewhat more precise, reduction to triangles and their rearrangement 
by juxtaposition. The gure for one such proof is indicated below. 


E 


F B 
C B 


Starting with a given right triangle ABC, squares 4A’B’'B and DFB'D’ on its legs are 
constructed; also ACDE is the square on the hypotenuse. The rst two squares, 4 4’B'B 
and DFB'D’, can be obtained by adding triangles I and II to the shaded region S in the 
diagram. Likewise, the square on AC DE is arrived at by adding triangles HI and IV to 
the same shaded region. But all four triangles are copies of each other, so that the sum 
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of the rst two squares must equal the third square: 
AA'B'B+ DFB'D' =14+114+S=111+1V+S=ACDE. 


Among Thabit’s mathematical writings is Book on the Measurement of the Conic 
Section Called Parabolic. In it, the determined the area of a parabolic segment to 
be two-thirds the product of its base and height. Because this involved inscribing 
triangles the length of whose bases were proportional to the sum of odd integers, Thabit 
ascertained the formula 1+3+5+7+---+(2n—1)=n?’. In another of his many 
works, entitled The Proof of the Well-known Postulate of Euclid, he sought to prove 
Euclid’s parallel postulate from the four remaining ones. Here he introduced for the rst 
time quadrilaterals in which two base angles are right angles, and the angle sides not 
common to these are of equal length. When the gure later gured in the development 
of non-Euclidean geometry, it became known as a Saccheri quadrilateral in honor of the 
eighteenth-century Italian geometer Giovanni Saccheri. Thabit argued that the other two 
angles, the summit angles, are both right angles also. 


D C 


A B 


He reasoned that if angle D were acute, then CB would be shorter than DA, while if 
angle D were obtuse, then CB would be greater than DA. These contradictions would 
imply that angle D must be a right angle. A similar demonstration applies to the angle C. 

Having lived in Baghdad during the early years of its observatory, Thabit wrote a 
number of works on astronomy. He was one of the rst to draw attention to the errors 
and discrepancies in Ptolemy’s Almagest. Particularly troublesome was Ptolemy’s useful 
device of the equant point, which caused a planet to speed up or slow down in its orbit. 

A notable mathematician who was active at the end of the tenth century is Abt 
Bakr al-Karaj* (d. 1029) or Al-Karkh’ as he is sometimes called. Little is known of the 
details of his life other than that he lived in Baghdad around the year 1000. Al-Karaj”’s 
major work titled al-Fakhri (The Marvelous) is dedicated to a local ruler of that name. 
Its signi cance lies in being the earliest detailed account of the algebra of polynomials. 
Expressing the view that “the monomials are in nite,” al-Karaj* studies the properties 
of the two sequences {x, x*,x3,...} and {1/x, Le, 1x, ...}. Each power is derived 
from its predecessor by multiplying by x or 1/x, so that x*-x =x? and 1/x?-1/x = 
1/x3. The usual arithmetical operations are then extended to polynomials. In the case 
of multiplication, this required giving the rhetorical equivalent of the modern law of 
exponents, x» x" = x™"t", 

Al-Karaj* included several results on binomial expansions. After observing the 
pattern of the coef cients formed in the development of (a +5) and (a +5)*, he 
quite remarkably deduced the rule that governed the C* in the expansion (a +b)" = 

1-0 Cka"—*bk; namely, that Ck = C?~*C*_,. The values of these coef cients were ar- 
ranged in a triangular table, which in seventeenth-century Europe became known as the 
Arithmetic Triangle, or Pascal’s Triangle. 
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Al-Karaj” also took up the question of the sums of the rst m squares and cubes, 
expressing his results in the form 


17427437 4...4n? = (Qn/3 41/90 +2434+---4+7) 
B+B434---+W=(14+2434+---4+20). 


A successor, al-Samaw’al (ca. 1180), rened these two identities by showing the right- 
hand sides to be equal to (1/6)n(n + 1)(2n +1) and (1/4)(n(n + 1))’, respectively. 
Al-Karaj”’s proof was only for the particular case n = 10, but the demonstration ap- 
plies to an arbitrary value. 

Let us consider al-Karaj”’s argument fo the sum of cubes, as it provides an illustration 
of the early use of recursive reasoning. He begins with a square with sides of length 
1+2+3-+---+ 10, which has been partitioned as shown below. 


A B 
10 
E 
9 
H 
2 
fe 
D JIG, ©€ 


The L-shaped region ABCGFE can be viewed as consisting of two congruent 
rectangles and a square. As such, it has an area equal to 


20 10014 2 Sv 40) 4 17 


or, upon simplifying, to 9 - 107 + 107 = 103. The same reasoning as applied to the next 
L-shaped region EF GJ/H shows its area to be 


2-91 42434---48)497% =8- 749 =—9, 


The process continues until the square in the corner is reached; its area is 17. But the 
area of the whole square is equal to the sum of the areas of its pieces, which is to say, 
equal to 17 + 274+ 37 +---+10°. 


Omar Khayyam 


During the second half of the eleventh century, an aggressive new power advanced 
into Persia. The Moslem Seljuk Turks seized Baghdad in 1055, ending the political 
domination of the caliphs over the eastern provinces of their empire. Under the narrowly 
orthodox Turks, Baghdad gradually withered away as an important intellectual center; but 
learning still ourished in the western caliphate, particularly in such cities as Cordova, 
Toledo, and Seville. 

One of the scholars to remain in Baghdad is more celebrated outside the Islamic 
world for his poetry than for his mathematical achievements: Umar al-Khayyam’, better 
known as Omar Khayyam (1048-1123). The Rubaiyat, in which he praises the delights 
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of the senses, has been offered in hundreds of editions. The most in uential of his 
mathematical works, in full title Treatise on Demonstrations of Problems of al-Jabra 
and al-Mugqabalah, considerably advanced the subject of algebra. Where al-Khowarizm* 
dealt with linear and quadratic equations, Khayyam constructs the solutions of all kinds 
of cubics by the use of intersecting conic sections. He also compiled Commentaries 
on the Difficulties in the Premises of Euclid’s Book, which is concerned with the theory 
of parallel lines. Khayyam accepted the truth of Euclid’s Postulate 5, but viewed it 
as being less obvious than many of the propositions that required proof; he attempted 
to derive the parallel postulate from what he felt was the more intuitive principle that 
“converging” lines intersect. 

The establishment of magni cent observatories, such as the one in Baghdad (829), 
and the inux of Greek astronomical texts, stimulated the interest of Arabic mathemati- 
cians in astronomy as well. Their skill in computation provided the accurate astronomical 
tables so necessary for orienting the mosques in the direction of Mecca. Khayyam was 
among those who made notable contributions to astronomy. About 1079, he was called 
on to adjust the Persian calendar in order that feasts and fasts might be kept at the 
proper times; the result was a remarkable work known as the Jalalian calendar, which 
is so accurate that it requires a day’s correction every 5000 years. (Our Gregorian cal- 
endar has an error of no more than one day in a span of 3330 years.) After his several 
accomplishments in scientic areas, Khayyam took up poetry and philosophy. His free 
thinking, so evident in the hedonistic verses of the Rubiayat, led to charges of impiety. 
The court-sponsored patronage that had supported his observatory was withdrawn, and it 
had to be closed. Khayyam later undertook a pilgrimage to Mecca as a way of clearing 
himself of the accusation of atheism. 

Khayyam claimed to be the rst mathematician to solve every type of cubic equation 
having a positive root. Since he did not recognize negative numbers as coef cients, it 
was necessary to consider 14 separate types of cubics that cannot be reduced to linear or 
quadratic equations on division by x or x”. In each case, a geometric demonstration of the 
solution was given. Let us illustrate Khayyam’s procedure for the equation x*7 + qx =r, 
which he chooses to write as x? + b?x = b’e. 


Referring to the gure, Khayyam takes the line segment AB to have length b. A 
perpendicular BC of length c is then drawn to AB. Next he constructs a parabola with 
vertex B, axis BF, and parameter b. In modern notation, the parabola has equation 
x? = by. Now, on BC as diameter, a semicircle is described. Its equation is 


ere) 
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or 
x(x -c)+y =0. 

The semicircle will meet the parabola at a point D whose abscissa, or x-coordinate, 
provides a root of the given cubic. Geometrically the root is represented by the line 
segment BE, with E determined by dropping a perpendicular from D to BC. 

To see this algebraically, let (xo, vo) be the coordinates of the point D. Since D lies 
on the parabola, we have x = byo, which implies that x} = b’yé. But D is also on the 
semicircle, whence yg = xo(c — xo). Combining these equations yields 


xg = b’ yg = b’xo(c — x0), 


or i = b*(c — xo). Thus, xo satis es the cubic equation x* + b’x = bc. 

In treating each of his 14 cases, Khayyam was aware that a cubic might possess 
two positive roots, depending on how the conics involved intersect. Bold as he was, 
he ignored negative and repeated roots. He also failed to discover the possibility of 
three roots occurring, as with an equation of the type x° + qx = px” +r (one concrete 
example being x* + 11x = 6x? + 6). Khayyam also erroneously concluded that it is not 
possible to nd an algebraic solution of the general cubic. But this should not detract 
from his mastery of the geometrical theory of third-degree equations, which may be 
regarded as the most successful accomplishment of an Arabic mathematician. 

Western Europe was the inheritor of this mathematical owering, and much of the 
classical Greek legacy as well—saved from oblivion by the work of the Arabic translators. 
Scholarship, like other aspects of Arabic civilization, never wholly recovered from the 
shock of the crusades of the twelfth and thirteenth centuries. The crusades at least helped 
acquaint Christendom with a culture far superior to its own. By the time this great period 
of Islamic vitality and originality had passed, Latin translation of Arabic scientic and 
philosophical works was well underway. These Latin translations provided the foundation 
of the intellectual renewal of the West. 


The Astronomers al-Tiist and al-Kashi 


A careful observation of the heavens was a major focus of Islamic mathematical 
activity, partly out of genuine scienti c interest but mainly to regulate religious practices. 
The local times for the ve daily prayers (sunrise, midday, afternoon, sunset, and evening) 
were calculated from the direction and height of the sun. Moreover, the Arabic lunar 
calendar consisted of 12 months in a year—about 11 days short of a solar year—with 
each month beginning when the crescent of the new moon made its rst appearance in the 
evening sky. The precise determination of these astronomically de ned events required 
the development of accurate timekeeping tables. Caliph al-Ma’min, who had a personal 
interest in science, set up an observatory in Damascus. It was followed shortly, in 829, 
by an even larger one in Baghdad. A number of mathematicians with a keen interest in 
astronomy, such as al-Khowarizm’ and ibn Qurtra, spent time there correcting the errors 
and discrepancies in Ptolemy’s planetary model. Al-Ma’min’s astronomers revised the 
astronomical tables in the Almagest by taking simultaneous observations in Damascus 
and Baghdad. Their own tables ultimately replaced those of Ptolemy. 

Over the next few centuries, Arabic astronomical activity underwent periods 
of decline and revival. Many observatories enjoyed only a short life, ending often 
with the death of a patron or hostility from religious authorities. The thirteenth and 
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fteenth centuries saw moments of renewal with the creation of two major Islamic 
observatories. 

The mathematician and astronomer Nas’ al-D’n al-Tiis* (1201-1274) ourished dur- 
ing the mid-thirteenth century, in one of the most tumultuous periods of Islamic history. 
Early in the century, the Mongol conqueror Genghis Khan had swept through central Asia 
and into eastern Europe. After his death in 1227, his vast domain was divided among his 
sons and grandsons. One grandson, Hulagu Khan, was subsequently charged with putting 
down a revolt in northern Persia in 1256. Al-Tas’, by midcentury a respected scholar, 
sought refuge in a mountain fortess of his patron, the ruler of the despised Assassin 
sect. Possibly he felt that this was the only group that could provide security in the face 
of the advancing Mongols. Many contemporaries believed al-Tis* to be an opportunist: 
when the fortress fell to Hulagu, it was frequently said that he had betrayed its defenses. 
Hulagu went on to sack Baghdad in 1258, thereby ending forever the rule of the caliphs. 
He then captured Damascus in 1260. 

After the fall of Baghdad, al-Tis* immediately switched his loyalties and became 
a scientic advisor to Hulagu. The new ruler’s deep interest in astrology prompted him 
to construct a magnicent observatory—its foundations still survive—at Maragha near 
the northwest corner of what has become Iran. The most renowned astronomers, some 
from as far away as Spain and China, were invited to work there under the leadership 
of al-Tas*. The instruments they employed were remarkable for their size and quality. 
A major achievement from 12 years of observation and calculation was the compilation, 
in 1271, of a new set of accurate astronomical tables. The Maragha astronomers also 
developed a non-Ptolemaic theory of planetary motion, mathematically similar to the 
coming great work of Copernicus. Three years later (1274), al-Tis* died on a trip 
to Baghdad. His death brought an end to the creative period at Maragha, although 
observations continued into the next century. 

Although al-Tas* is best known for his numerous astronomical works, he was a 
scholar of exceptionally wide learning who tried his hand at many things. He com- 
posed a variety of treatises on such subjects as logic, philosophy, ethics, arithmetic, and 
trigonometry. He brought out greatly improved translations of Euclid’s Elements, Apol- 
lonius’s Conics, and Ptolemy’s Almagest. Al-Tiis* did not follow Ptolemy slavishly, but 
questioned epicycle theory as not adequately describing uniform circular motion of a 
planet about the center of its orbit. In geometry, al-Tis* attempted to prove Euclid’s fth 
postulate using the following argument: Consider two lines AB and CD, which are so 
related that consecutive perpendiculars EF, GH, and L/ constructed from CD meet AB in 
unequal acute angles on the side toward B. 


A E G 


C D 
F H J 


If AB and CD do not meet in the direction of B and D, the perpendicular // will grow 
forever shorter than GH while EF will grow forever larger. 

About a century and a half after the building of the Maragha observatory, there was 
another brief resurgence of astronomical activity, again sponsored by the grandson of the 
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conqueror. Tamerlane, or Temur the Lame (ca. 1336-1405), a Tartar who claimed Mongol 
ancestry from Genghis Khan, ruled over a wide region in Asia Minor stretching from 
Persia to western India. His capital was situated at Samarkand in what is now Uzbekistan. 
The city was a center of scholarship and science until the late 1500s. Here, Tamerlane’s 
grandson Ulugh Beg (1347-1449), himself an astronomer and patron of learning, founded 
an observatory in 1420. The three-story building was equipped with the nest instruments 
then available. Next to it a trench contained a giant sextant of radius 130 feet, capable of 
measuring with great accuracy the elevation of celestial bodies over one-sixth of a circle. 

The Persian mathematician Ghiyath al-Din al-Kash* (d. 1429) was brought to 
Samarkand to take charge of the new observatory. Under his leadership, it became more 
than a place for astronomy but was an institution of higher learning where nearly every 
branch of science was taught. Al-Kash”’s own effort seems to have been directed toward 
producing more precise tables of sines and tangents for every minute of arc, along with 
calculations of the longitudinal motions of the sun and moon. He also revised Ptolemy’s 
star catalog based on fresh observations, to give more precise positions for over 1000 
stars. His The Calculator’s Key, a work on arithmetic, algebra, and measurement, was 
dedicated to Ulugh Beg. 

Al-Kash”’s Treatise on the Circumference expounded on the use of decimal fractions. 
His representation of z as 


w = 3.14159265358979324, 


correct to 16 decimal places, greatly exceeded all previous calculations and remained so 
until the sixteenth century when the Dutchman Ludolph Van Ceulen produced z to 35 
decimal places. Al-Kash* was also the author of The Key to Arithmetic, Treatise on the 
Chord and Sine as well as several works arising from his astronomical research. 

After the thirteenth century, scientic enquiry in general began to wane throughout 
the Arabic world. Current scholarly thought was more engaged in religious, philosophical, 
or legal studies. Their advocates had become suspicious of “foreign sciences” whose 
contributions they deemed to be useless in an Islamic culture. Ulugh Beg was assassinated 
in 1449, and with his death scientic pursuit lost its last great defender. Early in the 
next century, his observatory was reduced to rubble by religious fanatics, its personnel 
eeing for their lives for having pried into the secrets of nature. The observatory’s actual 
location remained unknown until it was rediscovered in 1908. A major consequence 
of this changing environment was that although western Europe underwent its modern 
scientic revolution in the sixteenth and seventeenth centuries, no comparable creative 
breakthrough took place in the Islamic East. 


The Ancient Chinese Nine Chapters 


The Greek mathematicians of classical times had a unique genius for geometry; 
no other society of this period developed the subject as an abstract deductive system. 
Their Chinese counterparts, for instance, concerned themselves with some geometric 
questions, but always in an empirical, nondemonstrative way. All we nd_ in the early 
Chinese mathematical handbooks are practical problems connected with everyday life, 
problems involving the calculation of areas of all kinds of shapes, and volumes of various 
vessels and dams. Chinese mathematics was profoundly algebraic, so geometric gures 
served only to transmute numerical information into algebraic form. 
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The earliest Chinese work having mathematical content is the Arithmetic Classic of 
the Gnomon and the Circular Paths of Heaven. Its main concern is astronomical calcu- 
lations, with the word “gnomon” in the title referring to an L-shaped instrument akin to 
a sundial for measuring elevations and distances. The date of composition is uncertain. 
The version that has come down to us is thought to have been written around 300 B.c., 
although certain portions may be based on a text several hundred years older. Some 
authorities speculate that the mathematical knowledge contained therein may go back as 
far as 1000 B.c. 

At the beginning of the book, which is viewed as its oldest part, there is a discussion 
of properties of right triangles. An accompanying diagram reveals that the ancient Chinese 
had a thorough understanding of the mathematical relationship between the legs and 
hypotenuse of a right triangle, and did so well before the time of Pythagoras. 


Ags] |S 


The derivation of the Pythagorean property is presented only in the case of a 3-4-5 
right triangle, but its validity for any right triangle is apparent. Typical of the period, the 
passage takes the form of a lengthy discourse between two people, in this case, a duke 
and his minister. Bhaskara’s extremely brief proof, which employs the same diagram, 
dissects the oblique square into four congruent right triangles plus a small square, and 
rearranges the pieces to represent the area of two squares. 

The Nine Chapters on the Mathematical Art marks the beginning of the mathematical 
tradition in China. The oldest textbook on arithmetic in existence, it is conspicuously 
free of the mystic cosmology of the earlier Arithmetic Classic of the Gnomon and the 
Circular Paths of Heaven. The date and origin of the Nine Chapters is unknown. We 
know that it represents the collective effort of many mathematical minds over several 
centuries. First assembled as a book about the same time that Euclid was drawing up 
his Elements, original copies were destroyed in the famous Burning of the Books of 213 
B.C. Fragments of the collection were later recovered, put in order, and augmented by a 
number of mathematicians before the work received its nal form. The text as it survives 
today is a commentary on the Nine Chapters, prepared by Liu Hui in A.D. 263. Liu 
Hui gave theoretical veri cations of each of the problems, at the same time extensively 
expanding and enriching the material with his own contributions. 

In its inuence on Chinese mathematical thought, Liu Hui’s commentary on the 
Nine Chapters is the most important of all ancient works, studied by generation after 
generation for more than a thousand years. Indeed, later mathematical writings bear its 
imprint, both as to ideas and terminology. Chinese mathematics was geared toward pro- 
ciency in algebraic manipulation and problem solving, so that there was little incentive 
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to change a procedure that worked; this serves in part to explain the longevity of the Nine 
Chapters. 

In the seventh century, the Nine Chapters was given wide currency when the gov- 
ernment decreed its use throughout the universities as a standard syllabus for students 
preparing for civil service examinations. It became one of the earliest printed textbooks 
when a printed version appeared in 1084, the product of a wood-block technique in which 
each whole page was separately carved from one wooden block. (The oldest known book 
produced by wood-block printing, the Buddhist Diamond Sutra, was cut in 868; the en- 
tire Buddhist canon, printed between 972 and 983, required the engraving of 130,000 
two-page blocks.) 

As its title indicates, the Nine Chapters consists of nine distinct sections with a total 
of 246 problems and their solutions. It may be likened to the Elements in being an orga- 
nization of the mathematical knowledge accumulated by the Chinese up to the middle of 
the third century. The Nine Chapters was not intended as a theoretical work in the Greek 
style but as a practical handbook with problems that the ruling of cials of the state were 
likely to encounter: measurement of cultivated land, construction of dikes and canals, 
capacity of granaries, rates of exchange, and taxation of foodstuffs. Thus, the chapters 
bear such titles as “Field Measurement,” “Distribution by Proportion,” and “Fair Taxes.” 

The early chapters give computational rules—some correct, others not—for obtaining 
the areas of rectangles, triangles, trapezoids, and segments of circles; and for the volumes 
of such familiar solids as spheres, cylinders, pyramids, and circular cones. For instance, 
the area of a circle is given by 3a’, where d is the diameter; the result would be 
correct if the value of a were taken to be 3. The correct formula for the volume of 
a truncated pyramid, which was also known to the Egyptians, appears here. There are 
detailed procedures and explanations for the extraction of square and cube roots using 
counting rods. Special attention is paid to the arithmetic of fractions, with an emphasis on 
nding common denominators. A chapter is devoted to the solution of linear equations 
in one unknown by means of the rule of false position. 

Let us quote four problems from the Nine Chapters. Being quite diverse, they may 
suggest the extent of the topics covered. Problem 32 of Chapter 1 (“Field Extensions”’) 
takes z to be 3 in the calculation. 


There is a circular el d, circumference 181 bu and diameter 604 bu. Find [the area of] the 
eld. 


Problem 3 of Chapter 3 (“Distribution by Proportion”) involves nding the sum of an 
arithmetic progression. 


There is a woman weaver who increases an [equal] amount each day. She weaves 5 chi on 
the rst day and in a month 9 pi 3 zang. Find her increase each day. 


Problem 1 of Chapter 7 (“Excess and De ciency”) requires the solution of two linear 
equations. 


A number [of persons] are buying goods. If a person pays 8 there is a surplus of 3, if a 
person pays 7 there is a de cit of 4. Find the number of persons and the cost of the goods. 


Problem 13 of Chapter 9 (“Right Angles”), which is an application of the Pythagorean 
Theorem, also occurs in early Hindu mathematical texts. 


There is a bamboo of 10 ch’ih high. It is broken and the upper end touches the ground 3 
ch’ih away from the root. Find the height of the break. 
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It should be mentioned that Chapter 3 also deals with what is often known as the 
Rule of Three. This useful procedure was highly regarded for commercial transactions 
and later appears in early Indian mathematics. It requires nding the fourth term in a 
simple progression, that is, to solve the equation a/b = c/x for x. Problem 3.20, for 
instance, states, 


There is a loan of 1000 gian with a monthly interest of 30 gian. Now there is a loan of 750 
qian which is returned in 9 days. Find the interest. 


As did mathematicians in other ancient cultures, those in China attempted to calculate 
as accurately as possible the ratio of a circle’s circumference to its diameter, that is, 
the value of z. Their best estimate by the third century was m = 142/45 or 3.1555, 
obtained by Wan Fan (219-257). In his commentary “Field Measurements,” Liu Hui 
was prompted to nda more precise gure through an approach similar to that used by 
Archimedes. His new method of “cutting the circle” determined the circumference by 
calculating perimeters of inscribed polygons, whereas Archimedes used both inscribed 
and circumscribed polygons. 

Liu Hui began by inscribing a regular hexagon in a circle of radius r; the length S¢ 
of each side is r. Next the hexagon was replaced by a regular 12-sided dodecagon with 
Siz denoting the length of a side. 


AX 
WAY 


If ae is the length of the perpendicular from the center of the circle to a side of the 
hexagon, then a2 = r? — (S¢/2)° and consequently 


Si2 = ¥ (S6/2)? + (r — a6). 


Continuing the process, let S, represent the length of a side of an inscribed regular n- 
sided polygon and A,, denote its area. Then the value $5, can be obtained once S, is 
known by means of the formula 


Son = ¥V (S,/2)° + (r = ay). 


Each of the 27 triangles that make up A>, can be viewed as having base r and height 
S,/2, hence possessing an area of r'S,/4. As a result, the area A of the circle can be 
approximated by A», =rnS,/2. Liu Hui successively doubled the number of sides of 
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the inscribed polygons until a 192-sided polygon was reached. Taking the radius r = 10, 
he derived the values 


So = 10, Siz = 5.17638, — So4 = 2.61052, 
S4g = 1.30806, Sop = 0.65438 


and therefore approximated A by 


1 
Ayo = 596 - 10(0.65438) = 314.1024. 


Because z = A/100, the result of these calculations was to estimate z to six decimal 
places as 3.141024. Using the area inequality 


Aon <A< A, - 2(Adn = An), 


Liu Hui went on to determine that 2 lay between 3.141024 and 3.142704. His later 
work with an inscribed polygon having 3072 sides led to a more accurate gure for z 
as 3.14159. 

Two hundred years later, the distinguished mathematician-astronomer Tsu Chung- 
chih (430-501), with the help of his son, arrived at a value of z expressed by the bounds 
3.1415926 < mw < 3.1415927. 

Correct to seven decimal places, this value was derived using Liu Hui’s method 
as applied to a 12288-sided regular polygon. In the fourteenth century, Chao Yu-chin 
persevered even further with a polygon of 16382 sides to get wm ~ 3.1415926, which 
simply conrmed the correctness of Tsu Chung-chih’s calculation. The Chinese degree 
of accuracy was not reached in the West until the end of the sixteenth century. 

As early as 200 B.c., Chinese mathematicians had developed a procedure for ob- 
taining square roots and even cube roots of a given number. Chapter 4 (“What Width’’) 
of the Nine Chapters contains several problems that require nding the side of a square 
with a known area. As an example, Problem 12 says, “One has a square area of 55225 
pu. What is the side of the square?” The method of extracting the root essentially uses 
the algebraic identity 


(atb+cyP =a* +(2a+t bb + [2(a + b) + cle. 


In the indicated problem, a = 200, b = 30, and c = 5, so that 55225 = 235. 

Liu Hui cloaks the foregoing algebraic equation in a geometric argument that can be 
used for square roots of any number. It involves dissecting a square of area N = 55225 
in a certain way. To obtain the rst decimal digit in VN = d\d>d3, one seeks the largest 
square of side S, = d, - 10° satisfying S? < N = 55225. Clearly, d; = 2. 
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The square Ss? (the square in the upper left-hand corner) is removed from N, leaving the 
area 15225. The second digit dz is found by selecting the largest value S2 = dz - 10 with 
the property that 


28, S. + S3 < N — S? = 15225; 


thus, d> = 3. We notice that the expression 2.$,$) + SS = 12900 represents the area of 
the L-shaped region of width S$; bordering two sides of the included square. This area is 
also removed from N, so that 2325 is all that remains. Finally the digit d3 is the largest 
value $3 = d3-1 for which 


2(S; + S2)S3 +S? < 2325. 


Here, d3 = 5. When the area represented by this last expression (another L-shaped gure 
of width $3) is removed from N, nothing is left. Consequently, /55225 = 235. 

Square root extraction did not always lead to a whole number, but could be 
continued to produce a series of decimal places, those “little nameless numbers.” 
When this occurred, the Chinese achieved greater precision by relying on the formula 
VN = (VN10%)/10*. Thus, 10.5625 would be replaced by 

1 325 
—v 105625 = — = 3.25 
100 100 

Another problem from the Nine Chapters deserves comment, for it has appeared in 

numerous mathematical works over the years. Problem 16 of Chapter 9 states: 


Given a right triangle of kou [width] 6 and ku [height] 8, nd the largest circle that can be 
inscribed in this triangle. 


The associated gure suggests that the solution involves areas of the regions making the 
triangle, namely, four smaller right triangles and a rectangle. 


But later mathematicians found that the problem could be more easily treated using the 
three triangles formed by joining the center of the given triangle with its vertices. This 
would lead directly to the equation 


1 Lats al 
—xy = —rx+=—r —rz, 
a a im 


Mathematics in the Near and Far East 257 


whose solution yields the diameter of the desired circle, 
2xy 


d = 2r = ——_.. 
X+y+Z 


The Nine Chapters provides the rst evidence that we have of a systematic method 
for solving simultaneous linear equations. The method occurs in the 18 problems of the 
eighth chapter, which is called “The Way of Calculating by Arrays.” Some idea of the 
general procedure may be obtained from the rst problem: 


There are three grades of corn. After threshing, three bundles of top grade, two bundles 
of medium grade, and one bundle of low grade make 39 dou [a measure of volume]. Two 
bundles of top grade, three bundles of medium grade, and one bundle of low grade will 
produce 34 dou. The yield of one bundle of top grade, two bundles of medium grade, and 
three bundles of low grade is 26 dou. How many dou are contained in each bundle of each 
grade? 


The relations of the problem are equivalent to a system of three linear equations in three 
unknowns x, y, and z, namely 


3x +2y +z = 39, 
2x +3y+z = 34, 
x +2y + 3z = 26. 


The equations were not written in this fashion, but the coef cients of the unknowns and 
the constants were represented by rods on a counting board as the array 


1 2 3 
2 3 2 
3 1 1 
26 34 39. 


By performing appropriate multiplications and subtractions, coef cients are eliminated 
in this array until it is reduced to 


0 0 3 
0. oi. 22 
36 «(1 


99 24 39. 
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We would represent the last array as the system of equations 36z = 99, Sy +z = 24, 
3x + 2y +z = 39. From this system, we could easily calculate, in turn, the values z = 23, 
y= 45, and x = 95. 

During the course of carrying out the array computations, negative numbers might 
occur within an array. In this earliest accepted use of negative numbers, the Chinese 
used red rods on the counting board to represent negative numbers, while black rods 
stood for positive numbers. A coef cient of zero was indicated by a blank space on the 
board. 

Computations with counting rods led to a counting-rod numeration system, with 
symbols derived from the arrangement patterns of rods on the board. Two symbol forms, 
patterns of vertically and horizontally placed marks, were used for each of the nine 
nonzero digits. Within a given number, the forms for digit symbols would alternate, 
depending on each digit’s place value. 


Place position 1 2 3 4 | 6 7 8 9 


Units - oe, eee Zee 


Hundreds | of ot ott it tot 


Ten thousands 


Tens 
Thousands —_—_ = 


Il 
Ill 
I|— 
II|— 


Vacant spaces stood where we would put zeros, so that a number such as 95,071 
was expressed by |] = tL |. At a later stage, a circular symbol for zero was intro- 
duced. Counting-rod notation continued in common computational use until its eventual 
displacement in the sixteenth or seventeenth century. 

Besides writing a commentary on the Nine Chapters, Liu Hui also produced the 
shorter Sea Island Mathematical Manual. A treatise on surveying containing only nine 
practical problems, it seems to have been intended to supplement the last section of the 
Nine Chapters that dealt with the properties of right triangles. By the seventh century, 
it was separated from the Nine Chapters to become an independent mathematical work. 
The problems in the Sea Island Mathematical Manual involve measuring distances to 
inaccessible points by using tall poles with sighting bars xed at right angles on them. 
Unlike Thales’s technique for nding the distance of a ship at sea from the shore, 
Liu Hui’s problems usually require two observations, and sometimes three or four. The 
Manual most likely takes its name from the rst problem of the collection, which begins 
with the statement, “There is a sea island that is to be measured.” 


There is a sea island that is to be measured. Two poles that are each 30 feet high are erected 
on the same level, 1000 paces [1 pace = 6 feet] apart, so that the rear pole is in a straight 
line with the island and the rst pole. If a man walks 123 paces back from the rst pole, 
the highest point of the island is just visible through the top of the pole when he views it 
from ground level. Should he move 127 paces back from the rear pole, the summit of the 
island is just visible through the top of the pole when seen from a point on ground level. It 
is required to nd the height of the island and its distance from the nearer pole. 
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The rules given for calculating the required unknowns involve recognizing that corre- 
sponding sides of similar triangles are proportional. The problem is geometric, but 


the solution is algebraic. Using modern notation, Hui’s explanation of the solution pro- 
ceeds as follows. Let EK = DJ, so that FK is parallel to GJ. Knowing that triangles 
CHG and FEK are similar, as are triangles CGF and FKA, we have the proportions 
CH HG CG _ GF 
FE EK FK AK 
Now, if CB =x, BD=y, DE=d, GD= FE=h, DJ = EK = q,, and EA = az, then 
these proportions yield 


x-h yd 


h a ay—ay 


from which the desired distances can be obtained: 


hd aid 
+h = 1255 paces, y = — = 30,750 paces. 
a2— a, az — ay 


Pa 


The other eight problems of the Manual also deal with distance measurements, with the 
solutions always based on properties of similar right triangles. 


Later Chinese Mathematical Works 


The thirteenth century is regarded as the high point in the development of traditional 
Chinese mathematics. It was a time of vigorous growth in the subject, when a wealth 
of original ideas took root—only to be too soon forgotten. The foremost mathematicians 
were not necessarily government functionaries, but more often wandering teachers and 
recluse scholars. Unhindered by bureaucratic constraints, they moved beyond the limits 
of practical application to a new level of abstraction. Surprisingly, this mathematical 
awakening owered in a period of great unrest in the Chinese empire. In the early part 
of the thirteenth century, China began to experience the rst incursions by the Mongol 
armies of Genghis Khan, who brought the whole country under his control by 1279. The 
Mongols extended their power in other directions, sweeping to the banks of the Indus, 
overrunning Baghdad, and even reaching westward into portions of Hungary and Poland; 
in due course, they created the most extensive empire the world had yet seen. 

One of the leading gures at this time was Ch’in Chu-shao (circa 1202-1261), who 
published his celebrated Mathematical Treatise in Nine Sections in 1247. In the preface, 
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he mentions that he learned mathematics from a certain itinerant teacher, though he fails 
to indicate his name. A soldier in his youth, Ch’in rose in of cialdom to become the 
governor of two provinces. He was an unsavory person, reputed to poison those he found 
disagreeable, and was dismissed from one governorship for bribery and corruption. 

The idea of negative numbers was familiar to Chinese authors well before its ac- 
ceptance in Europe during the fteenth century. Using the colors familiar on counting 
boards, Ch’in began the custom of printing negative numbers in black type and positive 
ones in red. Not only is the Nine Sections the oldest extant Chinese mathematical text 
to contain a round symbol for zero, but it is the rst in which numerical equations of 
degree higher than 3 occur. This is exemplied by the equation 


—x* + 736,200x* — 40,642,560,000 = 0. 


As with all Ch’in’s equations, the constant term is taken to be negative. 

In uenced no doubt by the prized Nine Chapters on the Mathematical Art, the Chi- 
nese mathematicians of the thirteenth century tended to favor algebra over geometry, 
using picturesque geometric situations as devices for deriving more dif cult equations. 
Attention was restricted to positive roots, perhaps because Chinese equations always 
arose from concrete settings. To give an illustration from the Nine Sections: 


There is a circular walled city of unknown diameter with four gates. A tree lies 3 li north 
of the northern gate. If one walks 9 li eastward from the southern gate, the tree becomes 
just visible. Find the diameter of the city. 


Taking x? to be the diameter of the city, Ch’in was led to the tenth-degree equation 
x10 4 15x8 + 72x — 864x* — 11,664x? — 34,992 = 0. 


The unknown diameter is found to be 9 li. 

Among the mathematicians who gured prominently during this period was Li Ye 
(1192-1279). He passed the civil service examination in 1230 and was appointed gov- 
ernor of a city in northern China. But when the province fell to the Mongol onslaught 
in 1234, Li Ye gave up his ambitions for an of cial post and instead devoted himself to 
mathematical study. The result was his Sea Mirror of Circle Measurements, written in 
1248 and followed by Old Mathematics in Expanded Sections in 1259. When the great 
Kublai Khan, grandson of Genghis Khan, ascended the throne in 1260, Li Ye was asked 
to serve as a government consultant; but he resigned within a year, using the pretext of 
old age and ill health. 

The Old Mathematics in Expanded Sections is a revision and clari cation of an 
eleventh-century work, Collection of Old Mathematics, which is itself a collection of 
pieces of earlier origin. The central theme of its 64 problems is the construction of 
quadratic equations derived from imaginative geometric con gurations of circles, squares, 
rectangles, and trapezoids. For instance, consider Problem 8: 


There is a circular pond centered in the middle of a square eld, and the area outside the 
pond is 3300 square pu. It is known only that the sum of the perimeters of the square and 
the circle is 300 pu. Find the perimeters of the square and circle. 


Li Ye takes the unknown (call it x) to be the diameter of the circular pond. Using 
the ancient value “mz = 3,” the pond’s circumference is 3x, making the perimeter of the 
square eld 300 — 3x. Now (300 — 3x)? would be the area of a square eld 16 times 
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larger than the given one, while 16(3x7/4) = 12x? is the area of 16 circular ponds. The 
difference (300 — 3x)? — 12x? represents 16 portions of the area outside the pond, so 
that 


(300 — 3x)? — 12x? = 16 - 3300 = 52,800. 


This yields the equation 37,200 — 1800x — 3x* = 0. The diameter x of the pond is 20; 
hence the perimeters of the square eld and circular pond are 240 pu and 60 pu, respec- 
tively. 

Li Ye’s original contribution to Chinese mathematical notation was to indicate neg- 
ative quantities by drawing a diagonal stroke through the last digit of the number in 
question. Thus, —8643 would appear as ek T == AL. This was an improvement on 
the earlier use of red and black colors, and soon became the accepted notation in printed 
works. When expressing a polynomial equation, Li Ye listed the coef cients in a horizon- 
tal column, placing the constant term uppermost and all the others below. For example, 
the quadratic 24x? — 70x + 1600 = 0 is represented by 


— [o o 
+t ® 
= lll 


The column system was no doubt a rendition on paper of the actual arrangement of 
the rods on the counting board, where the calculations were being carried out. 

The last of the notable thirteenth-century mathematicians, Chu Shih-chieh ( ourished 
1280-1303) wrote two treatises, the /ntroduction to Mathematical Studies in 1299 and 
four years later the Precious Mirror of the Four Elements. As to the circumstances of his 
life, we only know from the preface of the second work that he traveled extensively in 
China for more than 20 years, presumably earning a living as a teacher of mathematics. 
His Introduction to Mathematical Studies is essentially a textbook for beginners; it was 
lost for some time in China, but reappeared in print in Korea in 1433 and later (1658) in 
Japan, where its wide study greatly in uenced the development of Japanese mathematics. 

The Precious Mirror begins with a diagram showing the coef cients of binomial 
expansions (x + 1)” through the eighth power. The triangular arrangement later came to 
be known in the West as “Pascal’s triangle.” Like his predecessors, Chu Shih-chieh deals 
with numerical equations of higher degree, even up to the fourteenth degree. Many are 
solved by a procedure that was rediscovered by William Horner in 1819 and is now known 
as Horner’s method. In treating the equation x? + 252x — 5292 =0, Chu rst nds by 
trial that there is a root between 19 and 20; then he shifts the roots of this equation 
by 19 by making the substitution y = x — 19. The new equation y? + 290y — 143 = 0 
must have a root between 0 and 1, which Chu approximates to be 143/(1 + 290). Since 
a root of the original equation is 19 greater than one of the second, he takes x = 19+ 
143/(1 + 290) as his nal approximate solution. 

In this work are also found rules for the sums of piles of balls arranged so that they 
form triangles, pyramids, cones, and so on. Among the various series that are discussed 
are the following: 


n(n + 1) 


es ae alae 
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1 1 2 
fgapeijacce (TY . rer) 


2 6 , 
1 2 1 2 3 
(ja 6 ops ee 
6 24 
Pikes o we eg Oe 
a aie 


Following the Chinese custom of presenting knowledge without justi cation, no theoret- 
ical proofs of these rules are given. 

Chinese mathematical activity suffered a grave decline from the fourteenth century 
onward, with the accomplishments of the past almost completely forgotten. It is not 
clear whether this was due to a lack of adequate symbolism, entrenched use of the 
counting-board and abacus, or a failure of earlier writers to record the various stages 
of their computations and methods. The arrival of European works in translation served 
temporarily to awaken Chinese interest in their own mathematical heritage, but little that 
was new emerged from it. 

Western mathematics was introduced into China at the end of the sixteenth century 
by European missionaries, above all by the Jesuits. One of the rst Jesuits to arrive was 
the Italian Matteo Ricci (1537-1610), who had been well educated in mathematics and 
astronomy by Christoph Clavius at the Collegio Romano. Ricci rst settled in Southern 
China in 1583 and some 20 years later received permission to live in the Imperial Court in 
Peking, where he remained until his death. He earned the respect of many of the dominant 
scholar-of cials by adopting the dress and manners of the Chinese and mastering their 
language to perfection. 

The Jesuit missionary strategy called for identifying European knowledge more 
speci cally as “Christian,” as if to say that those who were interested in the knowl- 
edge should adhere to the religion. In spite of this predominantly religious aim, Chinese 
scholars showed a great receptivity to Western advances in mathematics and astronomy, 
branches of learning in which Europe then surpassed China. Ricci’s inuence at court 
helped other Jesuits to obtain of cial posts with the Imperial Astronomical Bureau, where 
they adjusted and reformed the traditional Chinese calendar. 

Aided by learned converts, Ricci brought out a number of translations of European 
mathematical treatises. The most signi cant, a Chinese version of the rst six books of 
Euclid’s Elements, was made in 1601 from Christoph Clavius’s Euclidis Elementorum 
(1574). Under the title A First Textbook in Geometry, it carried the ef gy of Christ on its 
cover. This work had no counterpart in the Chinese mathematical tradition, which had 
failed to develop an interest in the theoretical aspects of geometry as did the Greeks— 
possibly because they lacked a formal system of logic. As we have seen, the Chinese did 
concern themselves with geometric questions based on the right triangle, but they were 
always tied to the solution of practical problems. Ricci also dictated Clavius’s Epitome 
Arithmeticae Practicae to the Christian scholar Li Chih-Tsao, who published it in 1614 
as Rules of Arithmetic. It contained written algorithms for elementary operations which 
the Chinese at that time performed on the abacus. 

Ricci’s place in the history of geography is ensured by his maps of the world, which 
appeared in 1584, 1589, and 1602. The rst two of these were merely European maps in 
Chinese guise; but the famous 1602 map (printed in Peking from wood blocks on thin 
Chinese paper) was especially constructed to bring China toward the center, a measure 
likely to win approval in of cial circles. For the rst time, it gave the Chinese a complete 
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idea of the relative positions of the oceans and landmasses. The western hemisphere had 
not previously been known to them. 

Inevitably, there was a Chinese reaction against alien ideas and methods. The anti- 
Western scholars attempted to disparage the Jesuit mathematical contributions on the 
one hand, and to exonerate the slow progress of indigenous mathematics on the other. 
Particular attention was paid to the Arithmetic Classic of the Gnomon and the Nine 
Chapters on the Mathematical Art as containing the rudiments of all mathematics. In 
this view, only the neglect of these ancient texts had precluded the full owering of 
Chinese mathematics—a owering that would also have included the triumphant Western 
mathematics. In this way, the reverence for Chinese antiquity paved the way for partial 
acceptance of some of European mathematics. One Chinese traditionalist mathematician, 
for example, chose 3.16 as the value of zr, “this exactness being in agreement with those 
of ancient authorities.” 

Opposition to “barbarian ways” continued for several centuries with the inevitable 
outcome that Chinese mathematics languished until more recent times. Beyond serving 
the needs of the Empire through surveying, calendar-making, and astronomical observa- 
tion, mathematics was not considered important. An impulse to discover new results for 
their own sake was never to develop in China as it did in Renaissance Europe. 


4. Use Thabit’s rule to obtain another pair of amicable 


5.5 Problems numbers. 
5. In 1866, a sixteen-year-old Italian student, Nicolo 
1. Solve the following quadratic equations by the Paganini, discovered the second smallest amicable 
Arabic method of completing the square: pair: 1184 = 2° - 37 and 1210 =2-5- 11°. Conrm 
that this pair is indeed amicable but is not found by 
(a) x° + 8x =9. Thabit’s rule. 
(b) x? + 10x = 144. -? 
(c) x24 12x = 64. 6. Thabit gave a proof of the Pythagorean theorem that 
(d) 3x? + 10x = 32. [Hint: Multiply both sides by depends on calculating the area of the gure below in 


3 and let y = 3x.] two different ways. (Each way involves three 
. triangles.) Fill in the details. 
2. Find the unknown quantity in each of the following 


problems from al-Khowarizm”’s Algebra: 


(a) I multiply a third of a quantity plus a unit by a 
fourth of the quantity plus a unit, and it 
becomes 20 units. [Hint: If x is the quantity, 
then (x/3 + 1)(x/4+ 1) = 20.] 

(b) I multiply a third of a quantity by a fourth of 
the quantity in such a way as to give the 
quantity itself plus 24 units. 

(c) I divide 10 units into a quantity and a 
remaining part, so that the sum of the squares 
of the two portions is 58 units. 


A B 


7. (a) Show that the cubic equation x3 + b*c = b?x 
can be solved by nding the intersection of the 
3. Below are two equations that occur in the Algebra. parabola x? = by and the hyperbola 
Solve them. ye tex = x?, 
(b) Show that the cubic equation x3 + c = ax? can 
be solved by nding the intersection of the 
(b) 10—x —_— 3B parabola y? + cx = ac and the rectangular 


x 10-x 6° hyperbola xy =c. 


(a) (10—x+x?+(10—x)—x = 54. 
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8. Khayyam’s derivation of a geometric solution of the 


cubic equation x* + b*x + c} = ax” proceeded as 
follows: Take the line segments AB and BC to have 
lengths c?/b? and a, respectively. Erect the 
perpendicular to AC at B to cut the semicircle on AC 
at D. Along BD mark off BE equal to b and through 
E draw EF parallel to AC. Locate G on BC so that 
(BG)\(ED) = (BE)(AB) and complete the rectangle 
DBGH. Through #H construct the rectangular 
hyperbola having EF’ and ED extended as 
asymptotes and intersecting the semicircle in J. 
Draw the parallel to DE through J to meet EF in K 
and BC in L. Let GH cut EF in M. 


To show that BL is a solution of the cubic, verify the 
correctness of the following assertions: 


(a) (EK)(KJ) = (EM)(MH) = (BG)(ED) = 
(BE)(AB). (Hint: J and H lie on the 
hyperbola. ] 


(bo) (BL)(LJ) = (ERK)(BE + KJ) = 
(BE)(EK + AB) = (BE)(AL). 

(c) = = TL [Hint: Triangles AJL and CJL are 
similar. ] 

d (BEY _ (JLY _ LC 

@) (BL? (ALY AL 


(ce) (BE)(BL + AB) = (BL)*(BC — BL). 
(f) (BLP +B°(BL) + = a(BL). 


The next ve problems are from the A/gebra of Abt 
Kamil. 

. Problem 15. 10 dinar is divided equally among a 
group of men so that when 6 more men are added to 
their number and 40 dinar is divided equally among 
them, then each receives as much as he did 
previously. Find the original number of men. 
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10. 


11. 


12. 


13. 


14. 


15. 


Problem 19. Given that 3 roots of a number plus 

4 roots of the difference between the number and the 
3 roots equals 20, nd the number. [Hint: In the 
equation 3./x + 4,/x — 3./x = 20, let x = y” to 
obtain 20 — 3y = 4,/y? — 3y; then square both 
sides. ] 


Problem 26. Divide 10 into two parts in such a way 
that when a certain one of these parts is divided by 
the other and the fraction is multiplied by its 
numerator, the result gives 9. 


Problem 54. Find a number such that if 7 is added to 
it and the sum multiplied by the root of 3 times the 
number, then the result is 10 times the number. 
[Hint: To solve the equation (x + 7)/3x = 10x, put 


1 
x= ar this yields y? + 21 = 10y.] 


Problem 66. Divide 10 into two parts in such a way 
that when 50 is divided by one part and 40 by the 
other, and then the fractions are multiplied, 125 will 


result. [Hint: Algebraically, the equations are 


50 
x+y = 10 and —-— = 1251] 


Use Abii K4mil’s formula for the difference of two 


square roots to show that /18 — J/8 = /2; then 
express 18 + V8 as a single square root. 


Solve the following problems from the last section of 
the Nine Chapters: 


(a) A square, walled city measures 200 paces on 
each side. Gates are located at the centers of 
each side. If there is a tree 15 paces from the 
east gate, how far must a man travel out of 
the south gate to be able to see the tree? 


A 200 B 


(b) 


A square, walled city of unknown dimensions 
has four gates, one at the center of each side. 
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16. 


17. 


A tree stands 20 paces from the north gate. A 
man walks 14 paces southward from the south 
gate and then turns west and walks 1775 
paces before he can see the tree. What are the 
dimensions of the city? 


G 


The two problems that follow are found in the 
seventh chapter of the Nine Chapters. Solve them. 


(a) A certain number of people are purchasing 
some chickens, jointly. If each person 
contributes 9 wen there is a surplus of 11 wen, 
and if each person contributes 6 wen there is a 
de ciency of 16 wen. Find the number of 
people and the price of the chickens. [Hint: If p 
is the price and n is the number of people, then 
9n = p+ 11, and 6n = p — 16.] 

(b) There are 9 equal pieces of gold and 11 equal 
pieces of silver. The two lots weigh the same. If 
one piece is removed from each lot and put in 
the other, the lot containing mainly gold is 
found to weigh 13 ounces less than the lot 
containing mainly silver. Find the weight of 
each piece of gold and silver. 


Consider the following problem, adapted in modern 
mathematical language from the Sea Island 
Mathematical Manual: 


There is a square, walled city of unknown 
dimensions. A man erects two poles d feet 
apart in the east-west direction and joins them 
with a string at eye level. The eastern pole is 
in a straight line with the northeastern and 
southeastern corners of the city. By moving 
northward a, feet from the eastern pole, the 
man’s line of observation with the 
northwestern corner of the city intersects the 
string at a point b feet from its eastern end. He 
again goes north a, feet from the pole, until 


18. 


19. 
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the northwestern corner of the city is in line 
with the western pole. What is the length of a 
side of the square city? 


Show that the dimension of the city wall is 
_ (a2 = 4 1)b 


(ban) — 


[Hint: The poles are located at points C and F. 
Construct EJ parallel to GA. From the similarity of 
triangles FCA and EC/J it follows that a; + JB = 

= (AC\(EC)/FC. Now, triangles GBA and EBS 
are similar, as are triangles GDB and ECG, whence 


AB BG __ GD 
JB BE EC’ 
Thus x = GD = (AB)(EC)/JB.] 


Obtain a solution to the following Chinese problem 
(circa 400): There are three sisters, of whom the 
eldest comes home once every 5 days, the middle 
sister once every 4 days, and the youngest every 3 
days. In how many days will all three meet together? 


Solve the two problems below, found in the Old 
Mathematics in Expanded Sections. 


(a) There is a circular pond in the middle of a 
square eld, and the area outside of the pond is 
3300 square pu. The pond is 20 pu from the 
edge of the eld. Find the side of the eld and 
the diameter of the pond. [Hint: Let the 
unknown x be the diameter of the pond, and 
x = 3. Then (x + 40) represents the area of 
the square, including the pond.] 

(b) There is a rectangular pond in the middle of a 
circular eld, and the area outside of the pond 
is 7300 square pu. The sum of the length and 
width of the pond is smaller than the diameter 
of the eld by 55 pu, while the difference 
between the length and width is 35 pu. Find the 
sides of the pond and the diameter of the eld. 
[Hint: If the unknown x is the diameter of the 
e Id, show that x? + 55x — 15,500 = 0.] 
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20. Find a root of the equation x7 — 71,824 = 0, as was 
done in the Mathematical Treatise in Nine Sections, 
by carrying out the following steps: 


(a) Take 200 as an initial approximation and reduce 
the roots by 200 through the transformation 
y =x — 200. 

(b) With 60 as an approximation to the roots of the 
transformed equation, make a second 
substitution z = y — 60. 

(c) By trial, nd an integral root z of the third 
equation and use it to obtain the desired root x. 


21. Employ the procedure of the previous problem to 
locate a root of the equation x? — 1,860,867 = 0. 
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CHAPTER 6 


The First Awakening: Fibonacci 


Algebra is generous, she often gives more than is asked of her. 


D’ALEMBERT 
In western Europe the period from the bar- 
The Decline and Revival of Learning barian invasions of the fth century until the 
eleventh century, often called the Dark Ages, 
The Carolingian marked the low ebb of mathematics. Roman 
Pre-Renaissance culture and thought persisted in the early part 


of the sixth century—at least in certain places, 
such as Italy and southern Gaul, where the Latin language was still in use. But by the 
beginning of the seventh century, almost nothing remained of the civilization that had 
ourished for a millenium in the Mediterranean lands. This was the darkest part of Eu- 
rope’s Dark Ages. The time was one of complete intellectual stagnation, no less so in 
mathematics than in science and philosophy. As a cloud of ignorance began to settle over 
the greater part of Europe, the Christian Church became the sole custodian of intellec- 
tual life, preserving in the cloisters of a few monasteries what feeble spark of learning 
remained. 

The part played by the Church at this time cannot be too greatly stressed; for as the 
Roman Empire in the West collapsed, the Church emerged as the one stable institution 
among the ruins. It alone had the organization, the dedication, and the educated men to 
provide the leadership that was so badly needed by the new society that was coming 
into being. Although the Church did not immediately step into the place of the state as 
the provider and director of learning, it was not long before the Church was compelled, 
if only to provide a literate clergy, to concern itself with education. There ultimately 
grew up a system of monastery schools that by the end of the Dark Ages was almost 
as complete and comprehensive as the municipal system that had passed away with the 
Roman Empire. 

It cannot be said that the early monastery schools were ideal centers of learning. 
The essential aim of the schools was to produce ecclesiastical leaders, and intellectual 
life there was nourished less by the great writers of antiquity than by the works of the 
fathers of the Church. Thus they taught, within the boundaries xed by the Church’s 
interests and doctrines, the bare elements of reading and writing rudimentary Latin, and 
summary explanations of biblical texts. Suspicion of the ancient pagan authors, which 
had contributed to the decline of learning in the early Christian era, lingered. Secular 
books were no longer read, except possibly in the form of extracts illustrating some moral 
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or grammatical point. Even Latin literature was studied more as a means of educating 
the student in the writing of serviceable Latin than for the ideas involved. 

Yet the intellectual heritage of the Roman world did survive all the negligence of the 
darkest ages. The contents of old manuscripts were preserved for later times only because 
they were copied during this period, and many others became better known because 
they were reproduced in different regions by monastic scribes. Although the scribe may 
have lacked a deep appreciation of the classical texts he was copying, he nonetheless 
preserved nearly all that was valuable in the Latin writings of the ancient world. By 
rescuing the remains of classical literature from destruction, he became the conduit that 
preserved the ancient culture from extinction and enabled its emergence into a European 
civilization. 

Certain kings gave to educational work in the Dark Ages a support second in im- 
portance only to that given by the Church. For a brief spell at the close of the eighth 
century, and during the whole of the ninth, Europe witnessed a remarkable resurgence 
of its intellectual strength brought on by a combination of favorable circumstances and 
extraordinary individuals. This revival of learning, usually described as the Carolingian 
Renaissance, had its origin and focus at the court of the Frankish king Charlemagne 
(742-814). On Christmas Day of 800, while Charlemagne was kneeling in prayer at 
St. Peter’s, the pope suddenly placed on his head a golden crown and hailed him as 
Holy Roman Emperor. The acclamation was well deserved, for Charlemagne was not 
only the most powerful ruler in Europe, for all practical purposes he was the only ruler. 
His dominion encompassed what is now France, western Germany, parts of Austria, and 
Italy as far south as Rome. 

Early in his reign Charlemagne realized that drastic reforms would be necessary to 
alleviate the pitiful conditions of ignorance among the clergy and the civil servants in his 
government. An important necessity for this reconstruction was a new curriculum and 
a new educational system. Charlemagne therefore invited the most renowned scholar of 
the day, the Englishman Alcuin of York, to become his educational advisor. Alcuin was 
eminently successful in accomplishing the tasks Charlemagne set before him. About 789, 
he ordained that every abbey and monastery throughout the realm should have its own 
school, with the seven liberal arts as divided into the quadrivium (arithmetic, geometry, 
astronomy, and music) and the trivium (grammar, rhetoric, and logic) a 1m part of 
the curriculum. Alcuin even dictated how these subjects should be taught, by writing 
elementary textbooks for each of them. 

Because a necessary condition for a revival of learning was the wide distribution of 
manuscripts, he became an energetic searcher after books. Emissaries were sent to Ireland, 
Spain, and Italy for texts that could be copied for the students to use. He also encouraged 
a form of writing by the introduction of a rounded, well-proportioned, and perfectly 
legible script that could be easily written and read. This was known as “Carolingian 
minuscule,” and its advantages were so great that it was adopted by virtually all the 
Italian printers of the fteenth century—and is the source of our printed alphabet today. 
Under Alcuin’s tutelage, the palace school at Aachen was transformed from a prominent 
center of court etiquette into a genuine place of learning. According to tradition, the king 
himself attended classes there, along with all the members of his family and the young 
nobles that he had marked for high position in church and state. In 796, Alcuin retired 
from the court and became abbot of the preeminent monastery of St. Martin at Tours, 
where he continued to teach and collect manuscripts. 
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Mathematics was relatively nonessential to the needs of a society that was still strug- 
gling with basic literacy, so that learning imparted under the heading of the quadrivium, 
with its strong mathematical base, was vulnerable to neglect in Charlemagne’s schools. 
Indeed, within the trivium the study of grammar and rhetoric received far more attention 
than logic did. Thus, it is not surprising that while the ninth century’s “little Renaissance” 
produced marked educational advances, there was no notable change in the mathematical 
climate of western Europe. Even Euclid’s Elements was lacking in Western libraries, so 
that the standard authority of the day on geometry was Boethius, whose shortcomings 
have already been noted. On the practical side, little was taught beyond the arithmetical 
operations needed for calculating the ecclesiastical calendar, the most pressing problem 
of which was establishing the exact date of Easter and other movable feast days. The 
school curriculum also included lessons on logical and mathematical thinking. A work 
ascribed to Alcuin, Propositions for Sharpening Youthful Minds, presented 53 puzzles in 
arithmetic. Although some could be solved through elaborate calculations, many required 
mathematical ingenuity. The best-known puzzle is the problem of three men and their 
three sisters having to cross a river in a boat holding only two people, where it is assumed 
that to be safe each girl must have no other companion than her brother. 

The Carolingian revival was short-lived, for just when it seemed that Charlemagne 
had solved the problem of European political disunity, the nal wave of barbarian in- 
vasion broke over the West. In the ninth and tenth centuries, Vikings from the North, 
Magyars from the East, and Saracens from the South simultaneously plundered the coasts, 
plains, and river valleys of the Frankish kingdom. Weakened internally by the question 
of royal succession and assaulted from without by these new invaders, the Carolingian 
empire fell apart into pieces that would one day become the separate nations of France, 
Germany, and Italy. Of the three incursions, the one by the Vikings was the most persis- 
tent and the most serious; for 200 years the Northmen kept the whole of the West in a 
state of turmoil, laying waste the lands on the seaboard. A strong government might have 
repelled or lessened this evil, but the rising nations of western Europe were everywhere 
still weak and completely incapable of dealing with the marauders. 

The failure of Christian Europe to realize political unity under the Carolingian empire 
did not mean that just when it had started to develop as a cultural entity, it sank back 
into complete barbarism. Generally the educational establishment Charlemagne created 
continued to function during this period of torment, so that there always remained centers 
where learning was cherished. The losses that took place through the failure of some 
monastery schools were constantly made good by the fresh efforts of others. Never again 
would Europe face the possible extinction of literacy that had been the danger in the 
seventh century. 

The slow rise of science and mathematics to renewed prominence during the eleventh 
century corresponded to another transformation, that of the schools themselves. The 
monastic and palace schools of Carolingian Europe were the intellectual arenas for the 
revival of learning in the ninth century; here, all education aimed at a better understanding 
of Scripture and other sacred texts. Anyway, the monastery schools, irregularly and 
unpredictably staffed, were never intended for educating a large segment of society. The 
orientation of education changed as the foremost teachers and students of the time were 
attracted instead to the famous cathedral schools—among them Cologne, Tours, Liege, 
Chartres, Reims, and Paris. It was inevitable that in time the cathedral schools would 
themselves prove inadequate for the numbers who wished to attend them. Teachers who 
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were not members of the school settled in its vicinity and, with the sanction of the 
authorities, gave lectures on subjects that had no place in the circumscribed intellectual 
world of the Church. At rst a system of private initiative prevailed; teachers dispensed 
instruction in return for fees. Students transferred from one master to another at will, 
and bitter competition for students often took place. These early associations of students 
grouped around individual teachers paved the way for the universities of the thirteenth 
and fourteenth centuries. The acquisition of increased numbers of Latin versions of 
Arabic and Greek scientic texts transformed the curriculum, so that the trivium steadily 
declined in importance as logic and mathematics came to occupy the most prominent 
place in the scheme of study. But these radical changes in both the substance and social 
character of learning carry us somewhat ahead of our story. For in the eleventh century, 
the immediate future of learning lay not in the West but in the East, where the increasing 
splendor of Arabic civilization was set off against the continued intellectual darkness of 
Europe. 


Transmission of Arabic Learning to the West 


Two far-reaching movements of peoples had destroyed the last remnants of Mediter- 
ranean unity. The rst was the continuing in ux, across the Rhine and the Danube, of the 
Germanic tribes; the second, the rise of a new religious grouping of the Arab world, the 
religion of Islam. The expansion of Islam occurred over exactly one hundred years—from 
the death of Mohammed in 632 until the battle of Tours in 732, when the Arab armies, 
having penetrated the very heartland of France, were checked by Charles Martel (the 
grandfather of Charlemagne). The defeat at Tours put a stop to further Arab advances 
to the north, and the Arabs remained satis ed with bringing all Spain under their rule. 
In the year (711) in which they landed in Spain, the Arabs were battering, less success- 
fully, at the gates of Constantinople. The capital of the Byzantine Empire managed to 
survive Arab assaults until the fteenth century and thereby saved western Europe from 
Moslem conquest via the Balkan peninsula. As a result of the Arabic conquests, three 
sharply contrasting civilizations arose within the Mediterranean basin: the Byzantine, the 
Latin-European, and the Islamic. In varying degrees, each of these civilizations was heir 
to the late Roman Empire. 

The Arabs who overran the southern and eastern shores of the Mediterranean brought 
with them nothing that could be called scholarship; their science and philosophy, like their 
arts, came from the lands they had conquered. The Arabs, eager to absorb new ideas, 
began to collect the old manuscripts that had been reproduced in suf cient numbers 
to survive the wars attendant on the breakup of the Roman Empire and the lack of 
interest in the early Christians in antique learning. Thus the Arabs met the ideas of 
Aristotle, Euclid, Archimedes, and Ptolemy. They rendered a lasting service to Europe 
by industriously translating into their own tongue what one Arabic scribe after another 
would devoutly call the science of the Greeks. By the tenth century, nearly all the texts 
of Greek science and mathematics that were to become known to Western Christendom 
were available in Arabic copies. A complete version of Euclid’s Elements was obtained 
and translated about the year 800; and Ptolemy’s Megale Syntaxis—which became a 
preeminent, almost divine book—appeared in Arabic in 827 under the generally accepted 
name of the A/magest. The Arabs, by hastening to acquire the accumulated heritage of 
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late antiquity, preserved many a classic Greek work that would otherwise have been 
irretrievably lost to the Latin-speaking West. This, more than anything else, was Islam’s 
great and enduring contribution to the advancement of knowledge. 

Adding signicant material from Persia and India to the extensive foundation of 
Greek learning, the Arabs were able to build a structure of scientic and philosophical 
thought that was to make them the great scholars of the time. Baghdad, a new capital 
city established by the eastern Mohammedans, became for centuries one of the greatest 
centers of learning, quite surpassing any city in Western Christendom. At Baghdad were 
a library of immense proportions, an academy (known as the House of Wisdom) for 
teaching and study, and a host of translators able to take the writings of the classical past 
and turn them into accurate Arabic. By the tenth and eleventh centuries, mathematics 
was almost exclusively regarded as an Arabic science, as the perpetuation of the terms 
algebra and Arabic numerals indicates. 

The scholars of Islam were not so much making an original contribution, however, 
as they were more widely disseminating the developments in mathematics that had taken 
place among the Persians and Hindus. Hindu mathematics had evolved independent of the 
inuence of Greek mathematics; and unlike the Greeks who favored geometry, the Hindus 
had a lively interest in arithmetic and algebra. The so-called Arabic numerals, with the 
introduction of the all-important zero, constituted the most signi cant mathematical idea 
the Arabs borrowed from the East. The vast improvement their “new arithmetic” was over 
the arithmetic of the Latin world will be realized by anyone who tries to add, subtract, 
multiply, or divide using only Roman numerals. Arabic mathematicians also developed 
trigonometry for astronomical purposes, using the ratios we now call the trigonometric 
functions instead of the “chords of an angle” Ptolemy and the Alexandrians used. (The 
chord of an angle is the length of the chord standing on the arc of a circle whose radius 
is 60 and subtending a given angle at the center.) 

The roads by which Arabic learning came to the West ran not through the eastern 
Mediterranean (where the Christian Crusaders captured Jerusalem in 1099), but rather 
through Spain and Sicily. The scientic tradition was established later in Western Islam 
than in the East. The Moors (western Mohammedans from that part of North Africa once 
known as Mauritania) crossed over into Spain early in the seventh century, bringing 
with them the cultural resources of the Arab world. Cordoba, with its 600 mosques 
and its library of 600,000 volumes, by the middle of the century had risen to be the 
intellectual center of the western part of the Mohammedan empire, a counterpart of 
Baghdad in the East. The Greek scientic writings moved westward through the Islamic 
world and reached Spain by the ninth century. Thus, at a time when most learned men in 
Christian Europe were painfully studying secondhand abstracts—sometimes clear, more 
often confused accounts—of Greek works, students at the great Moorish schools of 
Cordoba, Toledo, Seville, and Granada were jealously guarding the originals. 

The impulse that Charlemagne gave to education, though losing force as time went 
on, suf ced to maintain a continuity of learning in Europe until the greater revival of 
the eleventh and twelfth centuries. The two centuries from 1050 until 1250 were ones 
of great intellectual excitement and social dynamism. Christendom, swollen with an 
increased population, armed with new feudal institutions, and inspired by the ideal of the 
Crusades, was everywhere pressing forward—over the Pyrenees into Moorish Spain, and 
into the Byzantine Empire and Palestine. Unlike the Carolingian renaissance, which was 
imposed arti cially from above, the renaissance of the twelfth century grew spontaneously 
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along with greatly changed material conditions. A passion for learning superseded the 
previous intellectual stagnation, as Europeans began to add to their inherited knowledge. 
The immediate problem for a Western scholar was to nd out where learning was to be 
had and to make the effort to go and get it. This was often a dif cult adventure, involving 
hardship, travel to remote and dangerous places, and perhaps an abjuration of faith. The 
discovery by Crusaders that the Moslems possessed a great store of knowledge set Europe 
buzzing, and to tap this new source of information scholars set out for those places at 
which contact between the Christian and Islamic civilizations was most intimate. 

The most obvious point of contact from which the Arabic materials were passed to 
the Latin West was the Spanish peninsula. Spain’s doors were opened by the Christian 
recovery of Toledo in 1085. Western students ocked to its centers of learning, eager to 
learn science as it was transmitted by the Arabs. As soon as it became known that the 
masterpieces of antiquity were locked up in the Arabic, many zealous scholars undertook 
to get access to them and render them into Latin. (It is useful to remember that Latin 
had become the exclusive vehicle for technical and intellectual subjects in the West, and 
remained the academic language until the eighteenth century.) The recovery of ancient 
science in the eleventh and twelfth centuries, augmented by what the Arabs themselves 
had contributed, marked a turning point in European intellectual history. 

At Toledo there arose a regular school of translation of Arabic books of science, 
drawing from many lands those who thirsted for this knowledge. Toledo was not the 
only intellectual clearinghouse. There were whole regions, such as the Norman kingdom 
of Southern Italy and Sicily, that became, because of their open character, forums for 
exchanging ideas and texts. The work of translation was extremely awkward. First the 
Arabic text had to be read aloud, and then rendered into Hebrew or current Spanish idiom. 
Finally a Christian translator turned it into Latin. The process was neither rapid nor free of 
error or misunderstanding, especially considering the intricacy of the scientic treatises. 
Moreover, medieval Latin was not yet equipped with an adequate supply of technical 
terms, so that the meaning of some of these in Arabic was imperfectly known to the 
translators themselves. At best, the Latin translations, having passed through the medium 
of two wholly different languages, were slavishly literal and reasonably accurate. At 
worst, the versions that nally reached the medieval student, with accumulated errors, 
bore but slight resemblance to the Greek originals. 


The Pioneer Translators: Gerard and Adelard 


The second half of the twelfth century saw the work of the most industrious and 
prolic of these pioneer translators from the Arabic, Gerard of Cremona (1114-1187). 
Although he had studied all the arts in Italy, he was especially interested in astronomy. 
Ptolemy’s works were not available to him, so Gerard was drawn to Toledo, where 
he learned Arabic from a native Christian teacher. There he produced a Latin version 
of Ptolemy’s great work on astronomy, the A/magest—probably with the new Arabic 
numerals. Gerard devoted his life to translating scientic works from the Arabic, and 
it is said that more of Arabic science passed into western Europe at his hands than in 
any other way. He is credited with having produced Latin versions of no fewer than 90 
complete Arabic texts, among them Archimedes’ Measurement of a Circle, Apollonius’s 
Conic Sections, and al-Khowarizm”s works on algebra. (It was known for some time 
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The pre-Copernican universe showing the earth as the center. From John 
Blagrave’s The Mathematical Jewel (1585). (Courtesy of Theatrum Orbis Terrarum Ltd.) 


that Gerard had translated Euclid’s Elements, but not until 1901 was the rst extant 
trace, Books X—XIII, found in the library of the Vatican.) The direction and scope of 
mathematical activity in the Middle Ages was very largely based on these translations. 
Another pioneer translator was the English monk Adelard of Bath (1090-1150), who 
traveled far and wide—to Spain, southern Italy, Sicily, Greece, Syria, and Palestine— 
seeking out the knowledge he had heard of. In the disguise of a Mohammedan student, 
Adelard attended lectures at Cordoba (about 1120) and succeeded in gaining an Arabic 
copy of Euclid’s Elements, which he subsequently translated. In this way, he made the 
geometry of the great Alexandrian known for the rst time in the Latin West. Some 
150 years after Adelard, Johannes Campanus brought out a new translation, which be- 
cause of its clarity and completeness, drove the earlier Latinized versions from the eld; 
it followed the original Greek text more closely than its predecessors, but still at some 
distance. This became the basis for the rst printed edition of Euclid’s Elements, which 
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The Copernican universe with the sun as the center. From Copernicus’s De 
Revolutionibus Orbium Coelestium (1543). (By courtesy of Editions Culture et Civilisation.) 


coming out in 1482, was the rst mathematical book of any importance to appear in print. 
(By then, books by ancient and modern authors were being printed daily, yet because of 
the dif culty of typesetting the gures, little or nothing mathematical had appeared.) 
By the late twelfth century what amounted to a torrent of translations from Arabic 
works had reached Europe, and by the thirteenth century many Greek works also had 
been translated. Of all that was obtained from Arabic sources, the philosophy of “the 
new Aristotle,” that is, the scientic works of Aristotle not previously available in Latin, 
was prized most. Aristotle’s Physics, Metaphysics, and New Logic (four advanced works 
on logic) had been translated and were beginning to be circulated. These writings were 
chie y responsible for a shift in educational interest toward speculative philosophy and 
science, which the churchman and scholar John of Salisbury (1115-1180) complained 
were becoming preferred to the history and poetry of his youth. In 1210, the teaching 
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of Aristotle was forbidden at the University of Paris, under pain of excommunication of 
the offending master. 

Arabic being the new language of science, it enjoyed a greater prestige during this 
period than Greek. Moreover, spoken Arabic was more accessible than spoken Greek, 
a knowledge of which had gradually faded in the Latin-speaking West. Consequently, 
the practice was to make translations from the Arabic versions of Greek works and 
not from the original Greek. Once Sicily had fallen into Norman hands (after Arab 
tule from 902 until 1091), it provided a point of contact by which the original Greek 
classics could nd their way into Europe. The region still retained a considerable Arabic- 
speaking population and had never broken off commercial relations with Constantinople, 
so conditions especially favored an exchange of ideas among Arabic, Greek, and Latin 
scholars. Thus, there appeared in Sicily, besides translations from the Arabic, some of 
the earliest retranslations to be made directly from the Greek. Ptolemy’s Almagest was 
rst translated into Latin from Greek in Sicily in 1163, some 12 years before it was 
rendered from Arabic by Gerard of Cremona at Toledo. Unfortunately, this version from 
the Greek gained no currency, and only the version from the Arabic was available in 
Europe until the fteenth century. 

These struggling translators received little or no remuneration and with few ex- 
ceptions enjoyed little or no fame. The only motive for their work was a devotion 
to truth and knowledge. Yet they accomplished a great feat; they renewed Greek sci- 
ence and philosophy in the West, adding to it the treasures of Arabic mathematics and 
medicine. Europe had never had this material before; the vast scientic and mathematical 
body of antique thought, from the Ionian philosophers and Aristotle to the Alexandrian 
mathematicians and Ptolemy, had never been translated into Latin at all. The late Ro- 
man Empire had almost abandoned the study of the Greek language, in which many 
of the masterpieces of antique learning remained, and Charlemagne’s scholars had been 
fully occupied in saving the Latin and Christian classics. The impossibility of draw- 
ing on the wellsprings of Greek culture had led to an impoverishment of knowledge 
and thought. But by the middle 1200s, when all that was worthwhile in Arabic learn- 
ing had been transmitted to Europe through Latin translations, Western scholars stood 
once again on the solid foundation of Hellenistic thought. It is wonderfully fortunate 
that the decline of Arabic scholarship and creativity did not occur before Europe’s 
own intellectual reawakening. When the real revival of learning came, and a genuine 
Renaissance took place in the 1400s, Islam had spent itself as a great force. But by 
then western Europe was prepared to accept the intellectual legacy bequeathed to it by 
earlier ages. 


The greatest mathematician of the Middle 


The Liber Abaci and Liber Quadratorum Ages was Leonardo of Pisa, better known by 


his other name, Fibonacci (a contraction of 
The Hindu-Arabic Numerals lius Bonaccio, “son of Bonaccio’”’). It is safe 

to say that the mathematical renaissance of 
the West dates from him. Fibonacci was born in Pisa about 1175 and educated in North 
Africa, where his father was in charge of a customshouse. As a young man, he traveled 
widely in the countries of the Mediterranean, observing and analyzing the arithmetical 
systems used in the commerce of the different countries. He quickly recognized the 
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enormous advantages of the Hindu-Arabic decimal systems, with its positional notation 
and zero symbol, over the clumsy Roman system still used in his own country. Returning 
to Pisa in 1202, Fibonacci wrote his famous Liber Abaci (Book of Counting), in which 
he explained the virtues of this number system “in order that the Latin race might no 
longer be de cient in that knowledge.” The rst chapter opens with the sentence: 


These are the nine gures of the Indians: 


9 8 765 43 2 1. 


With these nine gures, and with this sign 0... any number may be written, as will be 
demonstrated below. 


It was chie y by means of the second edition of this work, which appeared in 1228, that 
Christian Europe became acquainted with the Arabic numerals. 

Arabic numerals were not entirely new to Europe; Gerard had brought the system 
from Spain a half-century earlier. However, no book previously produced had shown by 
such a wealth of examples from every eld their superiority to the traditional Roman 
numeration. The Liber Abaci embodied virtually all the arithmetical knowledge of Fi- 
bonacci’s time, including much Arabic science, and gave original interpretations of all 
this material. As the mathematical masterwork of the Middle Ages, it remained a model 
and source for the next several hundred years. (Curiously, although the Liber Abaci cir- 
culated widely in manuscript, it was not printed in Italy until 1857, nor was it translated 
into English until 2002.) 

Although we have referred to our present number system as Hindu-Arabic, its origins 
are obscure and much disputed. The most widely accepted theory is that it originated in 
India about the third century, was carried to Baghdad in the eighth century, and_ nally 
was transmitted to western Europe by way of Moorish Spain. 

Written number symbols appeared in India before the dawn of the Christian era. One 
of the earliest preserved examples is found in records cut on the walls of a cave in a hill 
called Nana Ghat, near Bombay. If correctly interpreted, these include 


1 2 4? 6 7 9 (Third century B.C.) 


The next important trace of numeral appears in carved inscriptions at Nasik, India. These 
Brahmi numerals of the second century A.D. form a ciphered system with the following 
rst nine symbols: 


~==Y¥PU75? 


(Brahmi, second century) 
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Historical evidence indicates that the idea of positional notation with a zero was known 
in India by the fth century, if not a century earlier. (It is clear that the form for a symbol 
for zero underwent changes from a mere dot to a small circle.) The numerals used in 
the eighth century are termed “Devanagari,” or “sacred,” numerals and the characters are 
essentially as shown here: 


2284S 7C%E 


(Devanagari, eighth century) 


How and when these numerals rst reached the Arabs is a question that has never been 
satisfactorily settled. During the early Arabic expansion, public decrees were written in 
Greek as well as Arabic, because Greek was widely understood in the Near East. The 
ruling caliph, to promote his own language, passed a law in 706 that forbade the use 
of Greek in favor of Arabic, but nonetheless decreed that the Greek alphabetic system 
could be used in writing out numbers. This indicates that the Hindu symbols had not 
yet penetrated as far as Damascus, the seat of the caliphs. Around 800, the system was 
de nitely known to the Arabs. The mathematician al-Khowarizm’ prepared a small book 
explaining the use of the Hindu numbers, including the use of zero as a place holder. 
When this was translated into Latin by Adelard of Bath in the 1100s, the numerals were 
incorrectly assumed to be of Arabic origin. 

The outward appearance of the Hindu numerals went through a series of changes in 
transit from India, and the Arabs selected from the various shapes those most suitable for 
handwriting. The symbols ultimately adopted by the western Arabs, or Moors, are the 
so-called Gobar numerals, from the Arab word for “dust.” They acquired their peculiar 
name from the custom of the Arab pupils who, lacking other writing materials, sprinkled 
white dust on a black tablet and made their computations with a stylus. It will be seen 
that the Gobar numerals resemble our modern numerals much more closely than the 
Hindu ones do: 


J\2S3 5678 


(West Arab Gobar, tenth century) 


These primitive western forms appear in a tenth century edition of Boethius’s Geometry. 
Because their introduction breaks the continuity of the text, probably they were not part 
of the original work but inserted by a copyist at a later date. 

Coming closer to our present-day notation, the oldest de nitely dated European 
manuscript known to contain the Hindu-Arabic numerals is the Codex Vigilanus, written 
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in Spain in 976. The nine symbols used are 
1 2 3 4 5 6 7 8 9 


(Spain, 976) 


What is interesting is that during their long migration from culture to culture, the In- 
dian number signs remained astonishingly constant in form (look at the shapes for 
6, 7, and 9). 

At rst there was stubborn resistance to the spread of the new numerals. In 1299, the 
city of Florence issued an ordinance forbidding merchants from using Arabic numerals in 
bookkeeping, ordering them either to use Roman numerals or to write out the numerical 
adjectives in full. This decree was probably due to the great variety of shapes of certain 
digits, some quite different from those now in use, and the consequent opportunity for 
ambiguity, misunderstanding, and outright fraud. A 0 can be changed to a 6 or 9 without 
dif culty, but it is not so easy to falsify Roman numerals. If we add to this the confu- 
sion and insecurity that the zero produced in the minds of ordinary people (who could 
understand a symbol that meant nothing at all?), and the scarcity of scrap paper cheap 
enough to be thrown away after the computation was nished, it is easy to see why it 
took so long for Arabic numerals to come into general use. 

It did take a few more centuries, but the Arabic symbols were bound to win out 
in the end. Calculating with an abacus or a counting board and registering the results 
in Roman numerals was simply too slow a procedure. For the nal victory, no certain 
date can be set. Outside of Italy accounts were kept in Roman numerals until about 1550 
and, in the more conservative monasteries and universities, for a hundred years longer. 
After printed books were introduced in 1450, the form of the Arabic numerals became 
standardized. Indeed, so great was the stabilizing in uence of printing that the digits of 
today have essentially the same appearance as the digits of the fteenth century. 


Fibonacci’s Liber Quadratorum 


Fibonacci compiled another work of note, the Liber Quadratorum (Book of Squares). 
Although the Liber Abaci contains a few diophantine problems, the Liber Quadratorum is 
devoted entirely to diophantine equations of second degree. In the dedication, Fibonacci 
related that he had been presented to the Emperor Frederick II at court and that one 
of Frederick’s retinue, a certain John of Palermo, on that occasion prepounded several 
problems as a test of Fibonacci’s mathematical skill. One problem required that he nda 
number for which increasing or decreasing its square by 5 would give also a square as the 
result. It should be said that the problem was not original with John of Palermo, having 
been investigated by Arab writers with whom Fibonacci was unquestionably familiar. 
Fibonacci gave a correct answer, namely a: 


4 492 41 31 
yr+s=@y,  (r-5 =e. 
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Leonardo of Pisa (Fibonacci) 


(circa 1175-1250) 


(By courtesy of Columbia University, David Eugene 
Smith Collection.) 


Through considering this problem and others allied to it, Fibonacci was led to write the 
Liber Quadratorum (1225). 
For some idea of the contents of this remarkable work, let us consider a typical 


problem from it. Solve, in the rational numbers, the pair of equations 
x? bes u’, 


xr—x=v’, 


where x, u, and v are unknowns. A solution is obtained by taking any three squares that 
are in arithmetic progression, say, the squares a*, b*, and c’, and letting the common 
difference be d. Then 


a=b —d, C=ah +d. 


Fibonacci proposed a solution to the problem by giving x the value b?/d. For 


24, oP _ Pe +d) _ Be? _ (be\’ 
Se ge ge ee 

>. _ bt BP R= d) Ba? (ba\” 
eg gS ge Ne 


The simplest numerical example would be a? = 1, b? = 25, and c* = 49 (here, the com- 
mon difference is 24), and this illustration was furnished by Fibonacci. It leads to the 
solution x = 25/24: 


e423 = (3), x*>—x =(2). 


At no time did it seem to occur to Fibonacci that the real question in diophantine analysis 
was to nd all solutions, not just one. 
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In surveying Fibonacci’s activity, one must view him as a pioneer in the revival of 
mathematics in the Christian West. In mathematical content, his work does not surpass 
the work of his Arab predecessors. Fibonacci, far from being a slavish imitator of others, 
gave fresh consideration to the ancient knowledge and independently furthered it. Many 
of his proofs were original and, in some cases, his results were original also. 

Fibonacci’s work indicates a combination of inventive genius and a profound knowl- 
edge of earlier writers on mathematics. A striking illustration of Fibonacci’s ability was 
his observation that the classi cation of irrationals given by Euclid in Book X of the Ele- 
ments did not include all irrationals. This exists in a small treatise entitled F/os (meaning 
“blossom,” or “ ower’). Sent to Emperor Frederick II, the Flos was stimulated by the 
mathematical disputation held in the emperor’s presence at Pisa about 1224. It is an anal- 
ysis of 15 indeterminate problems, including two of the three questions posed by John 
of Palermo. Fibonacci stated that the second challenge put to him was nding a cube 
that with two squares and 10 roots should be equal to 20; in other words, the problem is 
to solve the equation 


x? + 2x? + 10x = 20. 


It is especially interesting that the rst mention of a cubic in Europe after the time of the 
Greeks should be the result of a mathematical joust, for (as we shall see) the solution 
of the general cubic equation came about in connection with another problem-solving 
contest. 

The specic cubic mentioned above can be written in the form 


x? x3 


5" 10 

To see that x cannot be a rational number, we use Fibonacci’s argument, which was 
substantially as follows. Suppose to the contrary that x were rational, say x = a/b, 
where gcd (a, b) = 1. The expression 

a a a? o a _ a(10b* + 2ab + a?) 

b 5b? 1063 103 
will not be an integer unless b> (and in turn b itself) divides 10b* + 2ab + a?. But this 
means that b must divide the difference 


(1067 + 2ab + a”) — (1067 + 2ab) = a’, 


which leads to the conclusion that b divides a. This contradicts the condition that 
gcd (a, b) = 1, so no rational root of the cubic equation exists. 

By checking each of the cases, Fibonacci next demonstrated that a root of the equa- 
tion could not be represented by any of the Euclidean irrational magnitudes 


atvb, Jatvb, Vative, or V Jat vb, 


where a and 5 denoted rational numbers. Hence, its construction could not be carried out 
with straightedge and compass only. This was the rst indication that there was more 
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to the number system than what could be constructed using the geometric algebra of 
the Greeks. Fibonacci contented himself with nding an accurate approximation to the 
required root. He gave it in sexagesimal notation, simply making the statement that 


x = 1322, 7, 42, 33, 4, 40, 


whose value, in decimal form, is 1.3688081075.... This was a remarkable estimate 
of the only real root of the cubic equation, correct to nine decimal places; and it was 
the most accurate European approximation to an irrational root of an algebraic equation 
that would exist for the next 300 years. But we are not told how the result was found. 
Although Fibonacci never revealed his sources, the possibility cannot be excluded that 
he had learned the solution in his travels. The same problem appears in the algebra of 
the great Persian poet and mathematician Omar Khayyam (circa 1050-1130), where it 
was solved geometrically by intersecting a circle and a hyperbola. 

Fibonacci, like the Arabic mathematicians before him, recognized that a quadratic 
equation can be satis ed by two values; yet he habitually rejected negative numbers as 
solutions. He took a step forward, however, in his Flos, when he interpreted a negative 
number in a nancial problem to mean a loss instead of a gain. 

Brahmagupta (circa 600) and Bhaskara (circa 1150), in writing common fractions, 
had used the scheme of placing the numerator above the denominator, without any line 
of separation. The Arabs at rst copied the Hindu notation, but later improved on it by 
inserting a horizontal bar between the two numbers. Fibonacci followed the Arab practice 
in the Liber Abaci. He habitually put the fractional part of a mixed number before the 
integral part, with juxtaposition used to imply their addition. A kind of ascending con- 
tinued fraction, which he called fractiones in gradibus (“step fractions”), was introduced 


by Fibonacci. His notation 


8, for example, was meant to be read 


10 10 
1 7 
* 70 * 10-10 
In the same way, the expression ———— signi es 
2 6 10 
7 5 1 


10” 10-6 10-6-2° 
His habit of indicating numbers from right to left was in uenced by the Arabs. 


The Works of Jordanus de Nemore 


A signicant, if less gifted, contemporary of Fibonacci was Jordanus Nemorarius, 
or Jordanus de Nemore (circa 1225). Virtually nothing is known with any certainty of 
his life or even his identity. His name appears four times in the Biblionomia, a library 
catalog compiled around 1250, so that it is reasonable to assume that he wrote during 
the rst part of the thirteenth century. A manuscript sometimes attributed to Jordanus 
contains the marginal note, “This is enough to say for the instruction of the students 
at Toulouse”; hence, he may have lectured at the University of Toulouse, which was 
founded in 1229. 

Such speculations aside, Jordanus is known to us only through his written works. 
Six of these are strictly mathematical treatises, dealing with arithmetic (number theory), 
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algebra, and astronomy. His De Triangulus in particular represents medieval geometry at 
its highest level by giving rigorous—and frequently new—proofs of Euclidean theorems. 
The proofs are derived largely from Arabic sources, which were themselves based on 
Greek mathematical texts. Worth noting are the three proofs Jordanus gives for the 
classical problem of trisecting an angle, two constructions for nding continued mean 
proportionals between two given lines, and a proof of Heron’s formula for the area of 
a triangle in terms of its sides (that is, A = \/s(s — a)(s — b)(s — c), where s is the 
semiperimeter). 

The largest and most original of Jordanus’s works is the De Numeris Datis. It is a 
text on advanced algebra that complements the Arabic treatises of al-Khowarizm* and 
Abi Kamil. There are 115 problems, divided into four books, offering a development of 
quadratic, simultaneous, and proportional equations; for the most part the material had not 
appeared elsewhere. The De Datis is wholly rhetorical, with letters of the alphabet used 
to represent general numbers. The format usually consists of a formal statement of the 
problem, a proof that more often than not appears as a series of instructions (tantamount to 
constructing equations), and then a speci c numerical illustration. The numbers occurring 
in the examples are written in cumbersome Roman numerals. Proposition 6 of Book IV 
illustrates Jordanus’s approach: 


If the ratio of two numbers together with the sum of their squares is given, then each of 
them is known. [Proof] Let the ratio of x and y be given. Let d be the square of x and c be 
the square of y; and let d+ c be known. Now the ratio of d to c is the square of the ratio 
of x and y. Hence the former is known. Consequently d and c are known. 


This can be expressed in modern algebraic notation as follows: If x/y = a,x? +)? =b 
are given and x7 =d and y* =c, then d/e = x?/y* =a’. But x? + y* =b implies 
that ((d/c) + 1)y* = b, which leads to y = /b/(a2 + 1). After giving his proof, Jor- 
danus offers the example x/y = 2 and x* + y* = 500. His rules provide the solution 
y = /500/(2? + 1) = 10, and so x = 20. 

Another problem is this: If the sum of the squares of the two parts of a given number 
added to their difference is known, then the two parts can be found. In modern notation, 
the two equations are x + y =a and x7 +y*+x—y=Db. Here, Jordanus’s example 
isx+y=10,x?+y*+x—y = 62, with solution x = 7 and y = 3. The text’s single 
cubic equation occurs in the concluding proposition: a/x? = b and a?/x = c produces 
the cubic x? = c/b’. 

As the rst Western mathematician consistently to employ letters of the alphabet 
to designate quantities, known as well as unknown, Jordanus advanced the evolution of 
algebraic symbolism. Yet this practice was overlooked by subsequent writers in algebra 
for some 350 years before Frangois Viéeta realized the facility to be gained through 
Jordanus’s lettering scheme. 

The two central mathematical gures of the European Middle Ages, Fibonacci and 
Jordanus, had a notable lack of successors during the next two centuries. Although the 
study of mathematics was not entirely abandoned in this so-called barren period, the sub- 
ject was in the hands of lesser talents who did not contribute work of lasting importance. 
For many of these, mathematics was a mere sideline to activities concerning the Church: 
in England, there was Thomas Bradwardine (1290-1349), who became Archbishop of 
Canterbury only a month before falling victim to the plague; in France, Nicole Oresme 
(1323-1382), whose career carried him from a professorship in Paris to a bishopric in 
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Brittany; and in Germany, Nicholas Cusa (1401-1465), appointed a cardinal by Pope 
Nicholas V. Mathematics was concerned with practical applications during this period, 
so the powerful mercantile cities fostered a growing use of the Hindu-Arabic numerals 
and the new arithmetic that went with them, whereas interest in the more advanced al- 
gebra promoted by Fibonacci and Jordanus languished. Contemporary Western scholars, 
more inclined to theology and metaphysics, did not care to invest in the labor required 
to learn mathematics. We shall shortly see that the ideas of Fibonacci and Jordanus were 
to enjoy a second life when revived by the Italian algebraists during the time that has 


come to be called the Renaissance. 


6.2 Problems 


The rst three problems appear in Fibonacci’s Liber 
Abaci. 


1. Two birds start ying from the tops of two towers 
50 feet apart; one tower is 30 feet high and the other 
40 feet high. Starting at the same time and ying at 
the same rate, the birds reach a fountain between the 
bases of the towers at the same moment. How far is 
the fountain from each tower? 


2. A merchant doing business in Lucca doubled his 
money there and then spent 12 denarii. On leaving, 
he went to Florence, where he also doubled his 
money and spent 12 denarii. Returning home to Pisa, 
he there doubled his money and again spent 12 
denarii, nothing remaining. How much did he have in 
the beginning? 


3. Three men, each having denarii, found a purse 
containing 23 denarii. The rst man said to the 
second, “If I take this purse, I will have twice as 
much as you.” The second said to the third, “If I take 
this purse, I will have three times as much as you.” 
The third man said to the rst, “If I take this purse, I 
will have four times as much as you.” How many 
denarii did each man have? 


The next three problems are taken from Fibonacci’s Liber 
Quadratorum. 


4. Given the squares of three successive odd numbers, 
show that the largest square exceeds the middle 
square by eight more than the middle square exceeds 
the smallest. 


5. Assuming that x and y are integers: 
(a) Find a number of the form 4xy(x + y)(x — y) 


that is divisible by 5, the quotient being 
a square. 


(b) Prove that if x + y is even, then the product 
xy(x + y)(x — y) is divisible by 24, and that 
without this restriction, 4xy(x — y)(x + y) is 
divisible by 24. [Hint: Consider that any integer 
is of the form 3k, 3k + 1, or 34 + 2 in showing 
that 3|xy(x + y)(x — y). Similarly, because any 
integer is of the form 8k, 8k + 1,..., or 
8k +7, then 8|xy(x — y)(x + y).] 


6. (a) Find a square number such that when twice its 


root is added to it or subtracted from it, one 
obtained other square numbers. In other words, 
solve a problem of the type 


9 
r4+2x =u’, x? —2x =v? 


in the rational numbers. 

(b) Find three square numbers such that the 
addition of the rst and second, and also the 
addition of all three squares, produces square 
numbers. In other words, solve a problem of 
the type 


roy=w, eoyoe=v 
in the rational numbers. [Hint: Let x and y be 
two relatively prime integers such that x? + y? 


equals a square, say, x* + y* = u?. Now note 
the identity 


ome ur —1 _ w+ti : 
u 5 = 5 . 


. Fibonacci proved that if the sum of two consecutive 


integers is a square (that is, ifn + (n — 1) = wu? for 
some wu), then the square of the larger integer will 
equal the sum of two nonzero squares. Verify this 
result and furnish several numerical examples. 


. The algebraic identity 


(a? + b’)\(c? +d’) = (ac + bdy + (ad — bey 
= (ad + bey + (ac — bdy 
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10. 


11. 


appears in the Liber Quadratorum. Establish this 
identity and use it to express the integer 

481 = 13-37 as the sum of two squares in two 
different ways. 


(a) Given rational numbers a and b, nd two other 
rational numbers x and y such that 
a’ +b? =x? + y’. [Hint: Choose any two 
integers c and d for which c? + d? is a square; 
now write (a* + b?)(c*? + d’) as a sum of two 
squares. | 


(b) Illustrate part (a) by expressing 61 = 5? + 6° as 
the sum of squares of two rational numbers. 


Solve the following problem, which is one of the 
tournament problems that John of Palermo posed to 
Fibonacci. Each of three men owned a share in a pile 
of money, their shares being ;, ;, and 2 of the total. 
Each man took some money at random until nothing 
was left. The rst man afterward returned ; of what 
he had taken, the second i, and the third i. When 
the amount thus returned was divided into three 
equal parts and given to each man, each one had 
what he was originally entitled to. How much money 
was there in the pile at the start, and how much did 
each man take? [Hint: Let t denote the original sum, 
u the amount each man received when the money left 
in the pile was divided equally, and x, y, and z the 
amounts the men took. Then 


lsx Zz 
rcerae) 


eee ee 
and 
Ook t 
—-+u=-, 
2 2 
2 t 
S+uss, 
oF t 
—+u=-, 
6 6 


which implies that 47u = 7¢.] 


The famous French scholar Gerbert d’ Aurillac 
(940-1003), who was later elected to the papal 
throne as Sylvester I, solved a problem that was 
considered remarkably dif cul t for the time, namely, 
to determine the sides of a right triangle whose 
hypotenuse a and area b? were given numbers. Use 
the technique employed in the Cairo papyrus (see 
page 77) to get Gerbert’s answer of 


faz + 4B + Ja? — 4B), 
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Gerbert, in his Geometry, determined the area of an 
equilateral triangle of side s to be 3s7/7. Show that 
he arrived at this conclusion by taking the value of 
3 to be equal to 2. 


The trisection of an angle can be accomplished by a 
construction described by Jordanus in his De 
Triangulis. Let LPOQ be a given angle. With its 
vertex as center, draw a circle with any radius r 
intersecting PO in A and QO in B. From O, draw a 
radius OC perpendicular to OB; now construct a 
chord AD cutting OC in a point E in such a way 
that DE = r. Finally, through O draw a line OF 
parallel to DA. 


To see that FOB is one-third ZPOQ, establish the 
following assertions: 


(a) OFED is a parallelogram, whence triangle OFE 
is isosceles. 

(b) LOAD = LODA = LOFE = LFOA =a. 

(c) The sum of the angles of triangle OFE equals 


2(90° — ZAOB + a) + a = 180°, 


or a = (2/3)ZAOB. 


Problems 14-17 are found in Jordanus’s De Numeris 
Datis. 


14. 


15. 


Book I, Problem 28. Solve the system of equations 


x+y=a, c/x+c/y = b. 


For example, if each of the two parts of 10 divide 40 
so that the sum is 25, what are the parts? [Hint: 
Notice that xy = (ca)/b.] 


Book I, Problem 15. Obtain x and y if 


x+y=a, vr+y4+x—-y=b. 

For example, if the sum of the squares of the parts of 
10 when increased by the difference of the parts 
equals 62, what are the parts? [Hint: First show that 
(x — yP +20 — y) = 2 — a] 
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16. Book II, Problem 22. Find the solution of the system 17. Book II, Problem 16. Solve the following equations 
of equations for x and y: 
x+y=a, (x +c)/y =b. a/x = y/b, x/y=c. 
For example, if 12 is separated into two parts so that For example, if the rst and fourth term of a given 
the rst increased by 2 is 3 of the other, what are the ratio are 18 and 2, and the ratio of the second and 
parts? third equals 4, what are the second and third terms? 


It is ironic that Fibonacci is remembered today mainly because 


6.3 The Fibonacci Sequence a nineteenth century French number theorist, Edouard Lucas, 
attached his name to a sequence that appeared in a trivial 
The Liber Abaci’s problem in the Liber Abaci. Fibonacci posed the following 
Rabbit Problem problem dealing with the number of offspring of a pair of 
rabbits. 


A man put one pair of rabbits in a certain place entirely surrounded by a wall. How many 
pairs of rabbits can be produced from that pair in a year, if the nature of these rabbits is 
such that every month each pair bears a new pair which from the second month on becomes 
productive? 


On the basis that none of the rabbits die, a pair is born during the rst month, so that 
there are two pairs present. During the second month, the original pair has produced 
another pair. One month later, both the original pair and the rstborn pair have produced 
new pairs, so that two adult and three young pairs are present, and so on. The gures 
are tabulated in the chart. 


Growth of Rabbit Colony 


Months Adult Pairs Young Pairs Total 
1 1 1 2 
2 2 1 3 
3 3 2 5 
4 5 3 8 
5 8 5 13 
6 13 8 21 
7 21 13 34 
8 34 21 55 
9 55 34 89 

10 89 55 144 
11 144 89 233 
12 233 144 377 


The point to remember is that each month the young pairs grow up and become adult 
pairs, making the new “adult” entry the previous one plus the previous “young” entry. 
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Each of the pairs that was adult last month produces one young pair, so that the new 
young entry is equal to the previous adult entry. 
When continued inde nitely, the sequence 


1, 1,2, 3,5, 8, 13, 21, 34,55, 89, 144, 233, 377,... 


is called the Fibonacci sequence and its terms the Fibonacci numbers. If we let F;, denote 
the nth Fibonacci number, then we can write this remarkable sequence as follows: 


2=14+1 or F3=F,(4+F, 
3=142 or Fy=h4+ Ff, 
§$=2+3 or F35=F34+ Fh, 
8=34+5 or Po =iy4+ Fs, 


In general, the rule for information is easily discernible: 
By =>. = 1, Fy Fao be for n > 3. 


That is, each term in the sequence (after the second) is the sum of the two that immediately 
precede it. Such sequences, in which from a certain point forward every term can be 
represented as a linear combination of preceding terms, are “recursive sequences.” The 
Fibonacci sequence is one of the earliest recursive sequences in mathematical work. 
Fibonacci himself was probably aware of the recursive nature of his sequence, but not 
until 1634—by which time mathematical notation had made suf cient progress—did 
Albert Girard write the formula in his posthumously published work L ’Arithmetique de 
Simon Stevin de Bruges. 

It may not have escaped your attention that successive terms of the Fibonacci se- 
quence are relatively prime. We will establish this fact next. 


No two consecutive Fibonacci numbers F, and F,4, have a factor d > | in common. 


Proof. Suppose that d > 1 divides F,, and F,,.,;. Then their difference 

Fu41 — Fy = Fy- will also be divisible by d. From this and the formula 

Fi, — Fy-1 = Fh-2, it can be concluded that d| F,-2. Working backward, we can 
show that F,-3, F,-4,..., and nally F) are all divisible by d. But F; = 1, which 
is certainly not divisible by any d > 1. This contradiction invalidates our supposition 
and therefore proves the theorem. 


| 


Because F3 = 2, Fs = 5, F7 = 13, and F\; = 89 are all prime numbers, one might 
be tempted to guess that F,, is prime whatever n > 2 is a prime. The conjecture fails at 
an early stage, for a little guring shows that 


Fig = 4181 = 37- 113. 


Not only is there no known device for predicting which F;, are prime but it is not even 
known whether the number of prime Fibonacci numbers is in nite. 
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On the positive side, one can prove that for any prime p, there are in nitely many 
Fibonacci numbers that are divisible by p and that lie at equal distances from one another 
in the Fibonacci sequence. For instance, 3 divides every fourth term in the Fibonacci 
sequence, 5 divides every fth term, and 7 divides every eighth term. 

We saw earlier that by the Euclidean algorithm, the greatest common divisor of 
two positive integers can be found after nitely many divisions. When the integers are 
suitably chosen, the number of divisions required can be made arbitrarily large. The 
precise result is this: for 7 > 0, there exist integers a and b for which exactly n divisions 
are needed in calculating gcd (a, b) by the Euclidean algorithm. 

To verify our contention, let us take a = F,,42 and b = F,,,,. The Euclidean algo- 
rithm for obtaining gcd (F;,42, F,+1) leads to the following system of equations: 


Fe =1- Fyai + Fa 
Fuai =1- F, + Fy-1 


fy=1-F3+ Fy 
F3=2-F)+0. 


Evidently the number of necessary divisions is n. For example, to nd the greatest 
common divisor of the numbers Fg = 21 and F7 = 13 by the Euclidean algorithm, one 
needs six divisions: 


21=1-1348 
13=1-8+5 
8=1-5+43 
S= 1-3-2 
3=1-241 
2=2-1+0. 


You no doubt recall that the last nonzero remainder appearing in the Euclidean 
algorithm for F;,2 and F;,,; furnishes the value of ged (Fi,42, Fn41). Hence, 


gcd (Fi42, Funai) = Fy = 1, 


which shows anew that consecutive Fibonacci numbers are relatively prime. 


Some Properties of Fibonacci Numbers 


The Fibonacci numbers have numerous easily derivable properties. One of the sim- 
plest, due to Edouard Lucas (1842-1891), is that the sum of the rst m Fibonacci numbers 
equals F,,42 — 1. For instance, when we add the rst eight Fibonacci numbers together, 
we get 


14+14+24+34+5+8+413+21=54=55-1, 
where 55 = Fi. 
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That this is typical of the general situation follows from the relation 


Pi, =F;- 
n=, — P; 


Fy = Fs — Fy 


scree = Fi+t — Fi 
aa = Frn+2 Fr Fri. 


When these equations are added, the left-hand side gives the sum of the rst n Fibonacci 
numbers, and on the right-hand side the terms cancel in pairs, leaving us only with 
Fij42 — F. The conclusion: 


Py Py Pap Py = Fe = Fo = Fee — 1 
Another Fibonacci property of interest is the identity 
(1) Fp = FraFnei t+ (“1,0 2. 


The last term means that the sign in front of the nal | alternates. This can be illustrated 
by taking, say, n = 6 and n = 7: 


Fe =8 =5-13-1=F;F;—-1, 
F7 = 13? =8-214+1=FoFe +1. 
To establish identity (1), let us start with the equation 
Fr — Fn Fae = FnFnat + Fn—2) — Fr Ft 
= (Fn — Frgi)Fn-1 + Fn Fn-2- 


Recalling that the rule of formation of the Fibonacci sequence gives Fy4; = Fy + Fri, 
the expression in parentheses can be replaced by —F;,_; to obtain 


Fo Fy Fay = (-DF? 


n n—-1 


= F,, Fy-2). 


Except for the initial sign, the right-hand side of this equation is the same as the left- 
hand side but with all the subscripts decreased by 1. By an entirely similar argument, 
F?_, — F,F,_2 can be shown to equal (-1)(F2_, — F,-1F,-3), whence 


n—1 
Fy — Fy—1Fn4i = (—1)(Fy_g — Fai Fa). 
After n — 2 such steps, we eventually arrive at 
Fr — Fn—1 Fav = (-1)" (Fy — Fi) 
= (iy te = 2 1) 
=(-1%7C 1) 
= (a1. 


which is what we sought to prove. 
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For n = 2k, where k is an integer, relation (1) becomes 
(2) Fy, = Fi Foy — 1. 


This identity is the basis of a well-known geometric deception whereby a square 8 units 
by 8 can be broken into pieces that seemingly t together to form a rectangle 5 by 13. 
To accomplish this, divide the square into four parts as shown in the left-hand diagram 
and rearrange them as indicated on the right. 


8 


The area of the square is 82 = 64, whereas the rectangle, which seems to have the same 
constituent parts, has an area 5 - 13 = 65, and so the area has apparently been increased 
by one square unit. The puzzle is easy to explain. The points a, b, c, and d do not all lie 
on the diagonal of the rectangle, but instead are the vertices of a parallelogram whose 
area is exactly equal to the extra unit of area. 

The construction can be carried out with any square whose sides are equal to the 
Fibonacci number F>,. When the square is partitioned as in the diagram, the pieces can be 
re-formed to produce a rectangle having a slot in the shape of a slim parallelogram (our 


Py, 
Py | 
Fry Pryy_ | 
Py | 
a 
Pope 


gure is exaggerated). The identity Fy,_;Fy,4; —1= Fe can be interpreted as asserting 
that the area of the rectangle minus the area of the parallelogram is precisely equal to the 
area of the original square. It can be shown that the height of the parallelogram—that is, 
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the width of the slot at its widest point—is 
1 


«Pee es 


When F >; is reasonably large (say, Fr, = 144, so that Fy,_2 = 55), the slot is so narrow 
as to be almost imperceptible to the eye. 

This is a convenient place to examine a remarkable connection between the Fibonacci 
numbers and what the Greeks called the go/den ratio. We start by forming the sequence 


Fi 
Un = z (n > 1) 
of the ratios of consecutive Fibonacci numbers. The rst few terms are 
m= p=l ws = = = 1.60 
uy=t=2 ug = & = 1.625 
i= 3 = 15 uy SF = 1.615 452 
u4=3=1.66... ug=#=1.619.... 


As the index increases, the sequence seems to tend to a number that falls between 1.61 
and 1.62. Let us assume that the limiting value actually exists; call it a. For any n > 1, 
we have 


Frat = Ei a Fy-1 _ Fy-1 
ae 2 7 i. 
which by virtue of our de nition of the u,’s, can be replaced by 
1 
Un = 1+ 2 
Un—-] 


As n increases, the left- and right-hand sides of the foregoing equation are getting closer 
and closer to a and | + 1/a, respectively, so that the equation as a whole is approaching 


1 3 
a=l4+- or a —-a—-1=0. 
a 


But the only positive root of this quadratic equation is 


a = 3(1 + J5) = 1.618033989..., 


the so-called golden ratio. Thus, the sequence of the ratios of consecutive Fibonacci 
numbers gives an approximation of the golden ratio, and the further out we go, the better 
the approximation becomes. 


5=3+ Fi, 
6=F, ++ fi, 
7T=F+th)4+ + Fy. 


1. It can be established that each positive integer is 
representable as a sum of Fibonacci numbers, none Write the integers 50, 75, 100, and 125 in this 
taken more than once; for example, manner. 
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2. 


6.4 


(a) Show that the sum of the rst n Fibonacci 5. One can prove that the greatest common divisor of 
numbers with odd indices is given by the two Fibonacci numbers is also a Fibonacci number; 
formula speci cal ly, 

PF, + Py + Fs +--+ Foy) = Pu. ged (F., F,) = Fi, whens =collGaae: 
[Hint: Add the equalities F; = Fy, 
FP, = F,— Fy, Fy = Fo — Fi,....] Verify this identity in the case of gcd(Fo, Fi) and 

(b) Show that the sum of the rst m Fibonacci gcd(Fi5, Fy9). 
numbers with even indices is given by the 6. Use Problem 5 to prove that for n > 2, F,,|F, if and 
formula only if n|m. 


[Hint: Use part (a) and the identity (b) 3/F 


Pot Fat Fe t--++ Fon 
= Foy — 1. 


7. Establish each of the following assertions: 


(a) 2\F, (that is, F, is even) if and only if 3|n. 
if and only if 4|n. 


Pit Fy t+ Fy t+) + Fon = Fong. — 1.) 


(c) Obtain the following formula for the alternating (c) 


4|F,, 1f and only if 6|n. 


sum of Fibonacci numbers: (d) 5|F,, if and only if 5|n. 


For any prime p # 2 or 5, it is known that either 


FP 


Fi- Fy 4+ Fy— Fate. +(-1)"'F, 
=(=1) "Fa 1: problem. ] 


[Hint: All these require the aid of the previous 


8. Show that the sum of the squares of the rst n 
Fibonacci numbers is given by the formula 


p-1 Or F,,1 is divisible by p. Conrm this in the 
case of the primes 7, 11, 13, and 17. 


Fey Fe 4 Fe 4.004? = Fy Fyat. 


From the formula F,,41/,-1 — F? = (—1)", conclude 
that consecutive Fibonacci numbers are relatively [Hint: Note that F? = F,(Fi41 — Fr—1) 


prime. 


= Fi Fast _ Hail 


In Section 3.3 we mentioned the ancient 


Fibonacci and the Pythagorean Problem problem of nding all right triangles whose 


sides are of integral length, the so-called 
Pythagorean Number Triples Pythagorean problem. From a number- 

theoretic point of view, solving this prob- 
lem amounts to determine formulas giving all triples (x, y, z) of positive integers that 
satisfy the equation x* + y? = z. Such a triple of integers x, y, and z is referred to as a 
Pythagorean triple. Both Euclid’s Elements (clothed in its geometric language) and the 
Arithmetica of Diophantus indicate a rule for making as many Pythagorean triples as 
you like. Choose any pair of integers, call them s and ¢, and let 


X= 266, y=s°-2, zas+e. 


It was left to later Arab mathematicians to show that all Pythagorean triples can be 
produced from these formulas, a result available to the well-traveled Fibonacci. 

Before giving Fibonacci’s argument, let us make several observations. First, notice 
that if (x,y,z) is a Pythagorean triple and & is any positive integer, then the triple 
(kx, ky, kz) arrived at by multiplying each of the entries by k is also a Pythagorean 
triple; for the relation x? + y? = z* implies that k?x? + k?y? = k?z?, or what is the same 
thing, (kx) + (ky) = (kz)?. Thus, from the triple (3, 4,5), we could get the triples 
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(6, 8, 10), (9, 12, 15), (12, 16, 20), and in nitely many more. But none of these is essen- 
tially different from the triple (3, 4, 5). It is more interesting to nd the basic Pythagorean 
triples—those that cannot be gotten by multiplying some other one by a suitable positive 
integer. These are termed primitive Pythagorean triples. 


Definition 


A Pythagorean triple (x,y,z) is said to be primitive if the three numbers x, y, and z 
have no common divisor d > 1. 


Some examples of primitive Pythagorean triples are (3,4,5), (5, 12,13), and 
(8, 15, 17), whereas (10, 24, 26) is not primitive. 

According to our de nition of a primitive Pythagorean triple (x,y,z), there is no 
divisor common to all three numbers. Actually, much more is true; no two of the numbers 
x, y, and z can have a common divisor d > 1. Phrased somewhat differently: the integers 
x, y, and z are relatively prime in pairs. To see this, let us suppose that gcd (x, y) = 
d > 1. By the fundamental theorem of arithmetic, there must exist some prime p with 
pld. Since d|x and dl|y, we should then have p|x and ply, which in turn imply p|x? 
and p|y*. But then p|(x? + y’) or plz”. An appeal to Euclid’s lemma now gives p|z. 
The implication of all this is that p is a common divisor of the three integers x, y, and 
z, a contradiction that (x, y, z) is a primitive triple. Because this contradiction arose out 
of the assumption that d > 1, we must conclude that d = 1. In the same way, one can 
verify that ged (y, z) = 1 and ged (x, z) = 1. 

Here is another property of primitive Pythagorean triples. 


If (x, y, Z) is a primitive Pythagorean triple, then one of the integers x and y is even, 
and the other is odd. 


Proof. By the result of the last paragraph, x and y cannot both be even, so that all 
we need to show here is that they cannot both be odd. As is well known, any odd 
number can be put in the form 2” + 1, where n is an integer. Thus, if x and y are 
odd, there exist appropriate choices of h and k for which 


x =2h+1 and y=2k+1. 
Then 
P=xrt+y=(2h+1%+2k+1¥ 
= 4h? 4+ 4h414+40+4k4+1 
= 40? +h+P+h)+2=4m 42. 


Because z is of the form 4m + 2 = 2(2m + 1), it is an even number. This, in its 
turn, forces z to be even (z cannot be odd, since the square of an odd number is also 
odd). But the square of any even number is divisible by 4, and 4m + 2 is clearly not 
divisible by 4. This situation being impossible, we see that x and y cannot both be 
odd. 


a 
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THEOREM 


By virtue of this lemma, there exist no primitive Pythagorean triples (x, y, z) all 
of whose values are prime numbers (you may supply your own argument). There are 
primitive Pythagorean triples in which z and one of x or y is prime, for instance, the 
triples (3, 4, 5), (11, 60, 61), and (19, 180, 181). It is not known whether in nitely many 
such triples exist. 

In a primitive Pythagorean triple (x, y, z), exactly one of x and y is an even integer. 
We shall hereafter write our triples so that x is even and y odd; then z must be odd 
(otherwise, ged (x, z) > 2). 

With the routine work out of the way, all primitive Pythagorean triples can be 
described in a straightforward manner. 


The triple (x, y, z) is a primitive Pythagorean triple if and only if there exist relatively 
prime integers s > t > 0 such that 


eH 28t. y=s"— 2, z=s4+?’, 


where one of s and t is even and the other is odd. 


Proof. To start, let (x, y,z) be any primitive Pythagorean triple. We have agreed to 
take x even, and y and z both odd, so it follows that z + y and z — y are even 
integers; say, z + y = 2u and z — y = 2v. Now the equation x? + y* =z? can be 
rewritten 


wary =(zt+ylz—y), 


whence on division by 4, 


G) = ()(G*)-™ 


Notice that u and v are relatively prime integers; for if gcd (u, v) = d > 1, then 
d|(u — v) and d|(u + v), or equivalently, d|y and d|z, which violates the condition 
that gcd (y, z) = 1. 

It can be proved that if the product of two relatively prime integers equals the 
square of an integer, then each of them is itself a perfect square. Granting this fact, 
we conclude that uw and v are each perfect squares. To be de nite, let us write 


u=s’, v=f, 


where s and ¢ are positive integers. The result of substituting these values of u and 
v is 
z=utvest+P, 
ysu-ves’-P, 
x? = 4uv = 48°’, 
or in the last case, x = 2st. Because any common divisor of s and ¢ divides both y 


and z, the relation gcd (y, z) = 1 forces gcd (s, ¢) = 1. It remains for us to observe 
that if s and t were both even, or both odd, then this would make each of y and z 
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even, an impossibility. Hence, exactly one of the pair s, ¢ is even, and the other is 
odd. 

It is easy to see that any triple (x, y, z) satisfying the conditions of this theorem 
is a Pythagorean triple. For if x = 2st, y = s* — t?, and z = s* + 2”, then the 
following identity holds: 


4 y =(Qst? +(92? PY 
= 4571? + 54 — 2571? + ¢4 
=st 429? 4H =(0° +7? =2?. 


Let us next show that the triple (x, y, z) is primitive. We assume to the contrary that 
x, y, and z have a common divisor d > | and argue until a contradiction is reached. 
Consider any prime divisor p of d. Observe rst that p ¥ 2, since it divides the odd 
integer z (one of s or ¢ is odd, and the other is even; hence, s? + ¢? = z must be 
odd). From p|y and p|z, we obtain p|(z + y) and p|(z — y), or put otherwise, p|2s* 
and p|2¢?. But then p|s and p|t, which is incompatible with gcd (s, f) = 1. In 
consequence, d = | and (x, y, z) is a primitive Pythagorean triple. 


CO 


We have a method for producing primitive Pythagorean triples, namely by means of 
the formulas 


x = 2st, y=s’— 2’, z=s°+P, 
and the theorem indicates that all primitive Pythagorean triples can be so obtained. The 
accompanying table lists some primitive Pythagorean triples arising from small values 
of s and ¢t. For each value of s = 2,3,4,..., 7, we have taken those values of ¢ that are 
relatively prime to s, less than s, and even whenever s is odd. 


S t x y Z 
2 i 4 3 = 
3 2 12 5 13 
4 1 8 15 17 
4 3 24 a 25 
5 2 20 21 29 
> 4 40 9 41 
6 1 12 35 37 
6 5 60 11 61 
7 2 28 45 53 
7 4 56 33 65 
7 6 84 13 85 


From this or a more extensive table, you might be led to suspect that if (x, y, z) is 
a primitive Pythagorean triple, then exactly one of the integers x or y is divisible by 3. 
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THEOREM 


Let us show that this is indeed the case. As members of a primitive Pythagorean triple, 
x, y, and z can be written 


x = 2st, y=s?-?, z=s°t+h? 


for suitable integers s and ¢. Recall that the square of any integer has either the form 
3k or the form 3k + 1. If either s? or ¢* happens to be of the form 3k (this is to say, 
if either 3|s* or 3|¢7), then 3|s or 3|t, in which case 3|x, and there is nothing more to 
prove. Thus, it sufces to assume that both s? and ?? take the form 3k + 1; to be speci c, 
let s* = 3k + 1 and f* = 3h + 1. Substituting, we get 


y=s—2? =3k+1)—GA4+1) = 3(k—A). 


This is simply the statement that 3|y. All in all, what we have proved is as follows. 


In a primitive Pythagorean triple (x,y,z), either x or y is divisible by 3. 


Let us turn to another of the famous tournament problems Fibonacci solved, namely, 
one equivalent to nding a number x such that both x? +5 and x? — 5 are squares of 
rational numbers; say, 


(1) x4+5=a? and x?-5=5". 


We shall see that the solution depends ultimately on knowing the general form of primitive 
Pythagorean triples. 

A solution is sought in the rational numbers, so let us express x, a, and 5 as fractions 
with a common denominator: 


Fibonacci’s Tournament Problem 


Substituting these values in equation (1) and clearing fractions gives 
(2) xi+5d=a?, x?-—5d? =b?. 
When the second equation is subtracted from the rst, we get 
10d? = a? — b? = (a, +b (ay — b)). 


The left-hand side is even, so that a, and b, must both be even or both odd. In either 
event, a; — b; is an even integer, say a; — bj = 2k, from which it can be inferred that 
a, +b; = 5d? /k. Now solve the last two equations simultaneously for a, and 5; to 
obtain 


If these two expressions are now substituted in equations (2), then one arrives at 


5a? (5d2\? 
x? + 5d? = (+4) — (=) +5d* +k’, and 
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5a? > (5d2\? 
2 a =(—_k a pa = a ke 
a? 2k ae ee ee 


which on addition yield the single condition 


5a2\" 
e+ (=) =x?, 


The point is precisely this: the three numbers k, 5d*/2k, and x; form a Pythagorean triple. 
As such, they must arise from a primitive Pythagorean triple and so can be written as 
5d? He 2 By 
k = (2mn)t, — =(m* —n’*)t, xX; =(m* +n°)t 


for some choice of m, n, and f. 
To eliminate , let us take the product of the rst two of these equations. The result is 


5d? = 4mn(m? — n”)t’. 

We are seeking values for the integers m and n that will make the right-hand side of this 
equation 5 times a perfect square. As a rst attempt, it is reasonable to set m = 5, so 
that the condition reduces to 

d? = 4n(5* — n7)t’. 
Evidently the right-hand side becomes a square when n = 4: 

ad’ =4.4(5* — 47)? = 16. 92? = (121). 
These values for m and n lead to 
xp =(m? 4+ r°)t = (2 +4) = 41. 
Putting the pieces together, we get 
x, 41t 41 


d  12t 12 
as a solution to Fibonacci’s tournament problem. 
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CHAPTER 7 


The Renaissance of Mathematics: 
Cardan and Tartaglia 


A mathematical problem should be difficult in order to entice us, yet not completely inaccessible, lest 
it mock at our efforts. 


DAVID HILBERT 


If the thirteenth century can be seen as the highest 


Europe in the Fourteenth point of medieval Europe, then perhaps the four- 
and Fifteenth Centuries teenth century was the lowest. Although the thirteenth 


century had given abundant promise for the future, 
The Italian Renaissance many events conspired to make the following century 

a period almost as dark as what followed the collapse 
of Rome. The af ictions were those classic riders of the Apocalypse: famine, plague, war, 
and death. The fourteenth century opened with a series of heavy rainfalls so constant and 
so widespread that chroniclers of the time compared it with the great ood of Genesis. 
Not only did the climate become wetter, but it turned signi cantly colder also, in what 
has been called the Little Ice Age. The cumulative effect was a disastrous crop failure and 
an attendant famine in which mortality increased alarmingly in the towns, some losing 
ten percent of their inhabitants in six months. Those who suffered malnutrition lacked 
resistance to disease. Upon a people weakened by hunger fell a worse calamity, the 
Black Death. The Black Death was bubonic plague, carried by brown rats—speci cally 
by a_ ea parasitic on brown rats—and easily spread in the crowded, dirty conditions of 
the medieval towns. The outbreak of the plague reached the Mediterranean in 1347, via 
Italian ships from the Crimea, the port center in the Black Sea. (Because the Crimea was 
the terminus of the greatest of the caravan routes, it is probable that the seeds of the epi- 
demic were brought from China.) The disease then swept in a great arc through western 
Europe, striking France in 1348 and af icting England a year later. Medical knowledge 
was hopelessly inadequate; nothing could be done to resist the attack. The Black Death 
raged at its worst for three years, and even when the worst was over it returned with 
lesser virulence at intervals of 12 to 15 years until the late seventeenth century. The Great 
Plague of London in 1665 was the last English eruption. In the absence of trustworthy 
vital statistics, it is impossible to make 1m estimates of the terrible mortality. At Paris, 
it is said, over 800 people died of it each day, and at Avignon 10,000 people were buried 
in a single mass grave in the rst six weeks. The few gures that we have indicate that in 
some towns half, in general perhaps a third, of the population was carried away, whereas 
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other regions were completely depopulated. Food shortages were aggravated by sickness 
in the agricultural districts. At Montpellier in France, so many inhabitants died that the 
town fathers invited repopulation from as far away as Italy. Peculiarly at peril were those 
whose occupations called for them to remain in the stricken towns: of cials who tried 
to preserve order, doctors and priests who stayed to aid and console the dying, scholars 
who continued their studies. These also perished in great numbers; and society, deprived 
of its natural leaders, was shaken and unstable for decades following. 

The smoke of war hung over the whole sad century. The most famous of these 
wars was that series of English invasions of France extending from 1338 until 1452 and 
known to us as the Hundred Years’ War. It dragged on for generations before either 
side won a permanent victory. Even the brief interludes of peace were far from tranquil. 
Thousands of soldiers refused to lay down their arms and instead formed wandering 
bands of brigands, the Free Companies of mercenaries, who pillaged the countryside and 
held for ransom those whom they captured. To this litany of afictions one must add the 
rst social revolts by the rural peasantry and the urban poor. Savage rebellions occurred 
in Flanders in 1323-1328, in northern France in 1358 (the famous Jacquerie, which gave 
its name to all other purely peasant risings), and in England, with the Peasants’ Revolt 
of 1381. 

People of the fourteenth century saw the future as an endless succession of evils; 
despair and defeat everywhere overwhelmed con dence and hope. The depressed mood 
of the time is preserved for us in the Danse Macabre, or Dance of Death, an actual dance 
in pantomime performed with public sermons, in which a_ gure from every walk of life 
confronts the corpse he must become. 

Yet the ultimate ruin by which Western civilization was threatened never materi- 
alized. By approximately 1450, the calamities of war, plague, and famine had tapered 
off, with the result that population increased, compensating for the losses from 1300 
on, and the towns began growing rapidly. Prosperity was once again possible, provided 
that public order could be restored. The great majority of the people of western Europe 
had become convinced that the ills of a strong monarchy were less to be feared than 
weakness of government, that rebellion was more dangerous to society than was royal 
tyranny. Thus, after two centuries of chaos, political security returned with the advent 
of the “new monarchies” of Louis XI in France (1461), Ferdinand and Isabella in Spain 
(1477), and Henry VII in England (1485). The rise of these strong national states marked 
the demise of feudalism and provided the solid foundation on which a new European 
civilization could be built. 

As the long-stagnant economy responded to the stimulus of the dramatic growth in 
population, western Europe experienced a recovery that seemed to many a remarkable 
rebirth. Europeans not only succeeded in restoring order, stability, and prosperity but 
also embarked on a series of undertakings that vastly expanded their literary and artistic 
horizons. To later generations this reawakening of the human intellect is known as the 
Renaissance. The word is the legacy of the great nineteenth-century historian Jacob 
Burkhardt, who in The Civilization of the Renaissance in Italy (1860) popularized the 
idea of the Italian Renaissance as a distinct epoch in cultural history, differentiated clearly 
from the preceding period and from the contemporary culture north of the Alps. In recent 
years, the whole concept of a “renaissance” has come under suspicion by those who 
claim that the greater period of cultural achievement came in the twelfth century. There 
is no longer any general agreement about the character of the Renaissance, its causes, 
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or even its geographical or chronological limits. Ultimately, the Renaissance cannot 
be disregarded; for medieval civilization—founded as it was on a basis of land tenure 
and an almost purely agricultural economy—could not continue inde nitely to absorb 
an expanding urban population and accommodate a money economy founded on trade 
without changing into something recognizably different. Thus, depending on context, we 
shall use the term Renaissance in either of its current senses: as a great revival of literature 
and the arts, with its reverence for classical culture, or as that period of transition (roughly, 
1350-1550) in which the decisive change from a largely feudal and ecclesiastical culture 
to a predominantly secular, lay, urban, and national culture took place. 

The reason that a cultural rebirth was experienced and nurtured rst in Italy was 
doubtless that Italy had not been as seriously affected by war and economic dislocation 
as the northern countries. (It had experienced many small wars but no great con ict.) At 
the beginning of the fteenth century, feudalism had disappeared in central and northern 
Italy, giving place to a vigorous urban society of politically independent city-states. The 
intellectuals and artists of these prosperous territorial states, hoping to bolster or replace 
the tottering traditions of medieval culture, thought they had found a model for their 
secular, individualistic society in the classical past. A cultivation of the Latin and Greek 
classics ourished with an intensity unknown since the decline of Rome. This “revival 
of classical culture” was one of the distinguishing characteristics of the Renaissance and 
one of the chief forces in its changing civilization. 

Two events helped to hasten this upsurge of interest in the literary remains of an- 
tiquity: the fall of Constantinople to the Turks (1453) and Johann Gutenberg’s invention 
of printing with movable, metallic type (about 1450). Long before the Arabs had sub- 
jugated Egypt, fugitive scholars from Alexandria had reached Constantinople with their 
books, making the fortress city the chief resting place of what was left of classical liter- 
ature in the original Greek. On May 29, 1453, the Ottoman Turks seized the great city; 
even though Constantinople had long been a mere enclave in Turkish territory, its fall 
stunned Christendom. This nal collapse of the Byzantine empire drove a host of Greek 
scholars to seek refuge on Italian soil, bringing with them a precious store of classical 
manuscripts. Many of the treasures of Greek learning, hitherto known indirectly through 
Arabic translations, could now be studied from the original sources. 


Artificial Writing: The Invention of Printing 


The invention of printing revolutionized the transmission and dissemination of ideas, 
thereby making the newly acquired knowledge accessible to a large audience. Handwrit- 
ten books were scarce and dear, and they had necessarily been the monopoly of the 
wealthy and scholars under their patronage. Those few books that were available to the 
public had to be chained down, and as further insurance against their loss, many bore 
maledictions damning anyone who stole, mutilated, or even approached them without 
washing his hands. When it became possible to issue books not in single copies, but in 
the hundreds or even thousands, the world of letters and learning was opened up to the 
moderately well-to-do classes everywhere. 

There is no need to belabor the importance of printing with movable type. Still, it 
should be stressed that the rst printing presses were made in the early fteenth century 
in medieval Germany, not in Renaissance Italy, and that Italian scholars for a long time 
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scorned the new process. Moreover, the stimulus that had led to the invention of printing 
was the typically medieval desire for quicker and cheaper ways of producing religious 
texts. There had been printing before Gutenberg, but Gutenberg’s Bible certainly heralded 
a new day. 

The rst form of printing in Europe, perhaps the rst form of printing on paper, 
was the block printing (the transference of ink from carved wooden blocks) of playing 
cards in the latter decades of the fourteenth century. Among the many block books 
produced, some attained considerable popularity. The best-known was the Poor Man’s 
Bible, a 40-page book of religious pictures with a minimum of inscriptions, intended for 
the instruction of the uneducated in the principal lessons of the Bible. What came in the 
fteenth century therefore was not the invention of printing but the notion of separate 
metal type for each letter. Of course, the production of paper made from linen helped 
popularize the new discovery; there would have been little use in a cheap method of 
duplication if the only material available had been expensive parchment. 

On this last point, a digression may be permitted. By the eighth century, when the 
advance of Islam produced the nal separation of East and West, Egyptian papyrus was 
no longer available. The monastic scholars therefore wrote on parchment made from the 
skins of animals, usually sheepskin or goatskin. Parchment was prepared for scribal use 
by a slow process that involved soaking the skin in a lye solution to dissolve organic 
materials, stretching it on a frame and rubbing it with a pumice stone for smoothness, 
and nally pressing the skin and cutting it to size. Parchment had many advantages over 
papyrus; it resisted dampness, and if a text were no longer required, it could be scraped 
off and the same writing surface could be used again. Even so, parchment was expensive, 
and without a cheaper material to print on, the invention of printing would not have been 
so useful and signi cant. 

The rst use of paper from hemp, tree bark, sh nets, and rags is carefully dated 
in Chinese dynastic records as belonging to the year 105, but this discovery like most 
was probably a gradual process. The secret of its manufacture was taught by Chinese 
prisoners to their Arab captors at Samarkand in the eighth century. For the next 500 
years, papermaking was an Arab monopoly until passed on by the Moors in Spain to the 
Christian conquerors. At the opening of the fourteenth century, paper was still a fairly 
rare material in Europe, imported from Damascus and turned out in small quantities from 
several newly established mills in Italy. By the end of the century, it was manufactured 
in Italy, Spain, France, and southern Germany and had largely displaced parchment as 
the standard writing material of all but the wealthy. Gutenberg’s famous Bible was one 
of the few early books printed on parchment, and each of his Bibles is said to have 
required the skins of 300 sheep. 

Once invented, the “divine art” of printing from cast movable type spread like 
wild re through central and western Europe, so that by the end of the century the names 
of 1500 printers were known. To ascertain accurately the number of books that all these 
presses produced before the year 1500 is impossible. According to the titles collected 
in various catalogs of incunabula, about 30,000 printed works appeared. Assuming that 
the editions were small, averaging about 300 copies, there would have been nearly 9 
million books (including pamphlets) in Europe by 1500, as against the few score thousand 
manuscripts that lately had held all the irrecoverable lore of the past. 

The rst printed books were little concerned with mathematics. Many mathematical 
works written in the mid-1400s, such as Regiomontanus’s treatise De Triangulis, did not 
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appear in print until very much later. The principal standbys of the earlier printers were 
the Bible (which appears in many editions, both in Latin and in the popular languages), 
books of meditation, and religious tracts of various sorts. Those mathematical works 
that did come off the presses were unoriginal, falling far below the level of the great 
thirteenth- and fourteenth-century mathematicians. The rst popular textbook, the Treviso 
Arithmetic, was published in 1478 at Treviso, an important mercantile town not far to the 
north of Venice. Essentially a list of rules for performing common calculations, it was 
written, claims the anonymous author, at the request of young people preparing to enter 
commercial careers. The Treviso Arithmetic was signi cant not so much for its content 
as for initiating a remarkable movement. Before the close of the fteenth century, over 
200 mathematical books had been printed in Italy alone. Euclid’s Elements, with the 
Latin commentary by Campanus of Novara, was published in 1482 at Venice and again 
in 1491 at Vicenza. Campanus lacked linguistic competence in Arabic, so this version 
contained numerous errors and barbarous terminology. In 1505, Zamberti brought out a 
new translation, working from a recovered Greek manuscript. 

One of the earliest European scholars to take advantage of the recovery of the origi- 
nal Greek texts was the mathematician-astronomer Johannes Miiller (1436-1476), better 
known as Regiomontanus, from the Latin name of his native town of K6nigsberg. The 
most distinguished scientic man of his time, Regiomontanus was active in translating 
and publishing the classical manuscripts available, including Ptolemy’s treatise on as- 
tronomy, the Al/magest. The fruits of this study were shown in his greatest publication, 
De Triangulis Omnimodis (On Triangles of All Kinds). The work was nished about 1464 
but remained unprinted until 1533. Trigonometry was one of the few branches of math- 
ematics to receive substantial development at the hands of the Greeks and the Arabs. In 
the De Triangulis, Regiomontanus systematically summed up the work of these pioneers 
and went on to solve all sorts of problems relating to plane and spherical triangles. The 
only trigonometric functions introduced were the sine and the cosine, but at a later date 
Regiomontanus computed a table of tangents. For all practical purposes, De Triangulis 
established trigonometry as a separate branch of mathematics, independent of astronomy. 

Calender revision was a growing concern at this time, particularly in regard to the 
calculation of the date of Easter. The Council of Nicaea (325 A.D.) stipulated that Easter 
must be celebrated on the rst Sunday following the rst full moon after the vernal 
equinox, and it xed the date of the vernal equinox, the rst day of spring, at March 
21 for all future years. The Roman or Julian calendar, introduced by Julius Caesar, was 
based on a year of 3654 days with a leap year every fourth year. This was not a precise 
enough measure, because the length of a solar year—the time it takes for the earth to 
complete an orbit around the sun—is apparently 365.2422 days. This small error meant 
that Easter receded a day from its solar norm every 128 years. 

Regiomontanus had set up an observatory and a private printing press in the city of 
Nuremberg. He published two calendars in 1472, one in Latin and the other in German. 
Although each calendar had appended to it the ecclesiastical dates of Easter for the years 
1475-1531, the Latin version also contained a differing set of dates calculated from 
Regiomontanus’s astronomical observations. His calendars enjoyed great popularity— 
calendars are among the oldest examples of printing with movable type—as evidenced 
by their sales and numerous reprintings. In 1475, Regiomontanus was invited to Rome by 
Pope Sextus IV to give advice on coordinating the calendar with astronomical events. He 
died shortly thereafter, suddenly and somewhat mysteriously. Some said he was poisoned 


306 


Chapter 7 The Renaissance of Mathematics: Cardan and Tartaglia 


by his enemies, but more likely he became a victim of a plague that was raging after the 
Tiber had over owed its banks. 

Calendar reform was forgotten after the death of Regiomontanus, and was not again 
viewed as imperative until the reign of Pope Gregory XIII. He brought together a large 
number of mathematicians, astronomers, and prelates in 1552 nally to remedy the defects 
in the Church’s reckoning of the dates of Easter. The Jesuit mathematician Christoph 
Clavius was put in charge of carrying out the necessary calculations. For this, he relied 
upon Erasmus Reinhold’s Tabulae Prutenicae (1551), named for Reinhold’s patron, the 
Duke of Prussia. These were far superior to any other astronomical tables available, 
having freely used observations that Copernicus had provided in his De Revolutionibus. 

The new calendar that was imposed on the predominantly Catholic countries in 
Europe, known as the Gregorian calendar, decreed that 10 days were to be omitted 
from the year 1582. This was accomplished by having October 15 immediately follow 
October 4 in that year. At the same time, Clavius amended the scheme for leap years: 
these would be the years divisible by four, except for those marking centuries; century 
years would be leap years only if they were divisible by four hundred. Because the edict 
came from Rome, Anglican England and her possessions resisted the changes. When 
England nally adopted the Gregorian calendar in 1752, the countryside erupted in riots 
as people demanded the return of their “lost days.” 

It is dif cult if not impossible to assess the in uence of this new trigonometric learn- 
ing on the great voyages of discovery in the late 1400s. At one time, historians thought 
that the Portuguese navigators in venturing south of the equator along the coast of Africa 
had used the tables of solar declination in Regiomontanus’s almanac, the Ephemerides 
Astronomicae; but it appears that the rst editions (1474) of this work contain no such 
tables. What is known is that Columbus carried a copy of the Ephemerides with him 
on his four trips to the New World. On one occasion, having read that Regiomontanus 
predicted a total eclipse of the moon for February 29, 1504, Columbus took advantage 
of this knowledge to frighten the natives into reprovisioning his ships. 

The period of Regiomontanus was also the time of Luca Pacioli (1445-1514), 
a Franciscan friar who was commonly called Fra Luca di Borga. Many scholars of 
this time felt the compulsive urge to bring together, within the pages of a large book, 
all known information in some given eld. There was a systematic compendium, or 
“summa,” for every interest and taste. Pacioli’s Summa de Arithmetica Geometria 
Proportioni et Proportionalita, published in Venice in 1494, was the most in uential 
mathematical book of that period. The rst comprehensive work to appear after the Liber 
Abaci of Fibonacci, it contained almost nothing that could not be found in Fibonacci’s 
treatise, which indicate how little European mathematics had progressed in nearly 300 
years. But as an encyclopedic account of the main mathematical facts inherited from the 
Middle Ages, the Summa goes far beyond what was taught in the universities. Written 
carelessly in Italian, it is notable historically for its wide circulation (perhaps due to the 
author’s explanation of the mechanics of double-entry bookkeeping). 


Founding of the Great Universities 


The universities that were being established were to become prominent in the culti- 
vation and spread of learning. The Latin universitas was originally a mere synonym for 
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communitas, a general word indicating a collection of individuals loosely associated for 
communicating ideas. Initially, the only educational centers were monasteries. Their pri- 
mary function was religious service, not intellectual, and they were disinclined to teach 
outsiders. They preserved, rather than added to, literature. As the number of laymen 
seeking education grew, schools attached to the churches of bishops became prominent 
as centers of learning. Cathedral schools were provided mainly for those who would 
enter the ranks of the “secular clergy” and carry on the work of the Church in the world, 
not apart from it. Such schools ourished as a sideline to the work of the bishop and 
were prone to be affected by the reputation of the local teachers, waxing and waning 
with the comings and goings of particular personalities. 

Cathedral schools were of course hardly conducive to the free ow of ideas. Thus, 
long before the formal beginnings of universities, assemblies of students gathered around 
an individual master or two who had no connection with the Church—who, however, still 
needed the permission of the bishop to teach. An excellent teacher became a celebrated 
gure, and students traveled from town to town in pursuit of some famous scholar whose 
reputation had reached their homelands. The force of personality of Peter Abelard (1079- 
1142) is said to have attracted students from every corner of Europe to his crowded lecture 
hall in Paris. Twenty of his pupils subsequently became cardinals, and more than 50 be- 
came bishops. There is an oft-told tale that when the theological writings of Abelard were 
condemned by the Church, the king of France suddenly forbade Abelard to teach in his 
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lands. On hearing the news, Abelard climbed a tree and his students ocked to hear him 
from below. When the king then prohibited him from teaching in the air, Abelard began 
lecturing in a boat; at this point the king relented. Abelard is especially important because 
his brilliance as a teacher popularized the cathedral school at Notre Dame as a center of 
higher learning, thus opening the way for the foundation of the University of Paris. 

The growth of the universities in the twelfth and thirteenth centuries was a natural 
consequence of a demand that the older cathedral and monastery schools were unable to 
satisfy. A developing body of secular knowledge that had a marked professional value 
(medicine, and especially law) and that required for its mastery protracted study under 
an eminent specialist began to make the university an indispensable institution. Students 
living in the centers made famous by the cathedral schools began to nd it necessary to 
organize in order to regulate their own conduct, to protect themselves from extortion by 
local citizens, and—because many were not native to the area—to secure legal rights. 
Thus, the students in voluntary association tended, like the merchants and craftsmen 
of those days, to form self-governing guilds and eventually to gain legal recognition 
through the charter of a king or a pope. The universities at Bologna (1158), Paris (1200), 
Padua (1222), Oxford (1214), and Cambridge (1231) can all trace their inception to 
this period. These embryonic universities bore little physical resemblance to what they 
later became, and there were great variations among the institutions of different towns. 
Not until the fteenth century did the universities acquire permanent buildings. Before 
this, teachers lectured in their own quarters or in rented halls, and general meetings 
took place in churches or monastic halls. Competition for eminent scholars gradually 
led to contracted salaries, so that as early as 1180 Bologna paid several professors from 
municipal funds; the selection of professors, however, remained a student prerogative. 
Students individually paid the master who taught liberal arts, because teaching skill was 
equivalent to the skill of any other tradesman. Teachers of theology, on the other hand, 
were forbidden to stipulate charges in advance—theology being a “spiritual gift’”—but 
were allowed to accept donations after a lecture was concluded. 

Paris and Bologna were the great “mother” universities, serving as models for the 
later universities that sprang up in every part of Europe during the next two centuries. 
The universities of Italy and southern France followed the academic pattern of Bologna, 
whereas those in northern Europe looked to Paris as the standard. Both schools devel- 
oped much the same methods of teaching and came to grant the same degrees, but they 
emphasized different studies and were organized differently. The rise of secular adminis- 
trative governments in Italy made legal studies the door to high civil of ce and pro table 
employment. Thus, at Bologna jurisprudence always dominated and little attention was 
given to theology and philosophy. For these subjects, the student went by preference to 
Paris, where canon law was secondary and civil law was not taught at all. Education in 
the north was everywhere still in the hands of the Church, so it was a matter of course 
that ecclesiastical studies should predominate at Paris and that the church authorities 
should claim a large share in university government. In Bologna, the university was a 
union of student guilds, which gained control over all academic affairs, save only the 
bestowing of degrees—which were licenses to teach. In Paris the system of organization 
was the reverse, with the governance of the university in the hands of the masters. One 
reason for this difference will be found in the differing ages of the students. At Bologna, 
with its interest in the “lucrative science” of law, many students were mature men who 
had already attained high civil position. The students at the faculty of arts at Paris, much 
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the largest faculty there, were too young (possibly 12 or 14 years old) and too poor to 
assert themselves in any similar way. 

Universities enjoyed an enormous prestige as custodians of learning in an age in 
which education was esteemed as at almost no other period in history. A city’s trade, 
population, and notability depended on the presence of a university, so that cities without 
schools were willing to underwrite universities that might secede from their established 
seats. The medieval universities had no permanent buildings and little corporate property, 
so it was simple for students to migrate to another city when for any reason they were 
dissatis ed. The masters, because they were entirely dependent for their livelihood on 
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meager tuition fees, had no choice but to follow. (Cambridge, for example, was raised 
to university status by a migration from Oxford in 1209.) 

Students obtained numerous privileges—including the right of trying their own mem- 
bers in practically all civil and criminal cases—through the potent threat of withdrawing 
to rival cities. A threatened secession from Bologna in 1321 was withdrawn on the terms 
that an offending magistrate should be publicly ogged and that the city should erect a 
chapel for the university. The students, having humbled the municipal authorities by a 
ruthless boycotting of recalcitrant teachers, went on to enforce a series of statutes gov- 
erning all phases of instruction. Each master had to give a certain number of lectures 
covering a prescribed minimum of work, might be ned for tardiness or for evading 
dif cult material on which he was supposed to expound, could not leave town without 
the permission of the student rector, and even on the occasion of his wedding was only 
allowed one day off. In the long run, the strength of the student body proved its un- 
doing. As the local authorities began to pay teachers’ salaries in order to propitiate the 
students and keep up the reputation of the local university, the state gradually gained the 
responsibility for appointments and supervision of faculty. 

Because the academic base consisted of the seven arts of the traditional trivium 
(grammar, logic, and rhetoric) and quadrivium (arithmetic, music, geometry, and astron- 
omy), super cially mathematics seemed to be important. Little attention was paid to the 
quadrivium, however, ostensibly because these studies had practical applications. Paris, 
Oxford, and Cambridge systematically discouraged all technical instruction, holding that 
a university education should be general and not technical. The real reason seems to 
have been that distinction could be more easily attained in theology and philosophy than 
in the sciences. By 1336, in an effort to stimulate interest in mathematics, a statute was 
passed at the University of Paris that no student could graduate without attending lec- 
tures on “some mathematical books.” It also appears that after 1452, candidates for the 
degree master of arts at Paris had to take an oath that they had read the rst six books 
of Euclid. Although the Renaissance was to prove to have been as much a landmark 
in mathematics as in other branches of learning, the university curriculum continued to 
provide for a literary rather than a scientic education. 


A Thirst for Classical Learning 


As the revival of commerce and the growth of town life in the fourteenth century 
gradually altered medieval culture, many efforts were made to shore it up or to replace 
it with something new. When neither the feudal nor the ecclesiastical tradition of the 
earlier period proved adequate, intellectuals of the Italian city-states looked to a more 
remote past to nd a congenial civilization. Most of the Latin authors—and as they later 
discovered, Greek authors—had written for an urban, secular, and individualistic society 
not unlike their own. Italian scholars devoted themselves with a passionate zeal to the 
study of classical writings, interpreting them in the light of the present age. Behind this 
“cult of the classics” lay the belief that antiquity, both Latin and Greek, offered a model 
of perfection by which to judge all civilizations; and in its literature could be found 
new solutions to all political, social, and ethical problems. There began a systematic 
and astonishingly successful search for mislaid or forgotten manuscripts, many of which 
still existed in only a few scattered copies. From one end of Europe to another, scholars 


Europe in the Fourteenth and Fifteenth Centuries 311 


rummaged through old libraries in towns and monasteries. The collecting, copying (at 
rst by hand and later, when the printing trade had developed, by press), and diffusion 
of the treasures they had unearthed was just the beginning. 

Manuscripts had to be edited to purify them from the many errors medieval copyists 
had made and to secure the correct form for each passage. Bibliophiles compiled gram- 
mars and lexicons and composed guides to ancient works, and commentaries on them. A 
tradition of critical judgment in dealing with authoritative texts emerged—a development 
quite impossible when the Church had had a monopoly on learning; this would be of 
great value once the interests of the educated turned toward scientic research. 

The Renaissance thirst for antique culture inspired a growing fashion of collecting 
libraries. This enthusiasm pervaded all branches of society, as the princes of the church, 
state, and commerce vied with one another in assembling books. Renaissance men ven- 
erated manuscripts just as their grandfathers had adored the relics of the Holy Land. (A 
sort of snobbery existed among some wealthy owners, who boasted that their collections 
contained no printed works.) The development of the Vatican Library in Rome during 
this period was largely the work of Pope Nicholas V (1395-1455), who had been, before 
his elevation to the papal throne, librarian to Cosimo de Medici. It cannot be said that 
the Vatican Library had substantial reality at this time, containing as it did a mere 350 
volumes in various states of repair. Nicholas dispatched agents all over Europe to collect 
manuscripts, with the authority to excommunicate those who refused to give them up. 
At the same time, some of the most distinguished scholars in Rome were set to making 
translations of the Greek works into Latin. By the time of his death, Nicholas had built 
the library to over 5000 volumes and made it one of the nest in Italy. While the primary 
purpose was to collect and preserve works on the history and doctrines of the Church, 
an increasing number of secular works found their way into the collection. Vespasiano 
da Bisticci, a writer of the time, said with some exaggeration, “Never since the time of 
Ptolemy had half so large a number of books of every kind been brought together.” 

For a time it seemed that the people of the Renaissance had far less intention of 
creating something new than of reviving something old, less an idea of moving forward to 
the future than of returning to the past. Like every fad, this exaltation of ancient life was 
carried to absurd extremes by some of its devotees. Literary clubs, called “academies” in 
the ancient Greek fashion, were formed, at which discourses on classical subjects were 
read and followed by discussion and debate. Greek was the language of the meetings 
and Greek names were adopted by the members. In imitation of the ancient custom, 
successful poets were crowned with wreaths of laurel. Classical ways of feeling, thinking, 
and writing cast such a deep spell over some scholars that they slipped into the habit 
of pretending to be Greeks or Romans, even going through the motions of reviving 
pagan religious rituals. Despite these excesses, the Renaissance Italians of the fteenth 
century performed an invaluable service to future generations by restoring the whole 
surviving heritage of Greek literature, editing all of it, and nally bringing out printed 
editions of the entirety. The accomplishment becomes even more impressive when we 
recall that the knowledge of ancient Greek script had almost disappeared in the West 
during the Middle Ages. As the range of Hellenistic prose and verse was brought back 
into the mainstream of Western scholarship, there developed an ideal of education for 
general human cultivation. This new attitude toward learning differed so markedly from 
the strictly utilitarian or professional objective of study that had dominated the centuries 
just preceding that it engendered an entirely new experience. 
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All this activity on behalf of the classics directly in uenced the universities, gradu- 
ally transforming the prevailing curriculum to the humanities. The term “humanities” is 
simply a translation of the ancient Latin phrase studia humanitatis and was used in the 
Renaissance to mean a clearly dened set of scholarly disciplines (grammar, rhetoric, 
poetry, history, and moral philosophy) based on the study of the classics of Greece and 
Rome. The humanities of the Renaissance were not the seven liberal arts of the Middle 
Ages under another name; for the humanities omitted not only the mathematical disci- 
plines of the quadrivium, but also logic, adding three subjects that are best implied in the 
trivium, namely poetry, history, and moral philosophy. Thus, the Renaissance thinkers 
created a version of the classical curriculum that in all its variations was to become one 
of the great staples of the university, until pushed aside in the nineteenth and twentieth 
centuries by science, modern languages, and social science. 

Another sign of the breakup of the traditional disciplines was the conscious and 
deliberate creation of a new educational model, the gentleman. Gentlemanly training 
demanded that one be schooled in the classical writings, graceful in deportment, proper 
in style of dress, and of discriminating taste in music, painting, and the literary arts. In 
the universities an atmosphere of largely verbal scholarship arose, resting primarily on 
grammar—which meant reading, writing, and rigorous analysis of language and style 
of literary works—and on rhetoric, the art of persuasion and eloquence in speaking. 
Elegant Latin was regarded as essential for public documents, and Ciceronian phrases 
were henceforth reckoned among the tools of diplomacy. Close study and imitation of 
the ancients were held necessary to achieve this style. 

What distinguished the Greek revival of the Renaissance from its medieval fore- 
runners was not simply that Greek became part of the general curriculum of studies, 
but that the whole focus of interest was on the literary and historical masterpieces of 
Greek literature. By emphasizing the scholarly worth of the humanities as a molder of 
the gentlemanly character, the Renaissance educators subordinated, and sometimes even 
impugned, learning from experience and direct observation. The effect was to impede the 
study of the physical sciences and mathematics, which were beyond the scope of literary 
treatment, and if anything, offended the aesthetic senses of these men of letters. Although 
the Renaissance movement as a whole made relatively little progress in science, it never- 
theless indirectly opened the way to the Scientic Revolution of the 1600s by recovering 
more of the ancient learning than the medieval scholars had possessed. Although Euclid 
and Ptolemy, and even much of Archimedes, were known in the Middle Ages, such 
advanced authors as Diophantus and Pappus were rst translated during the Renaissance. 
By the 1600s, almost all the extant corpus of Greek mathematics was easily available 
to those interested in the subject. The result, apparent from the middle 1500s on, was a 
rapid and noticeable rise in the level of sophistication of European mathematics. 


The Renaissance produced little brilliant mathemat- 


The Battle of the Scholars ics commensurate with the achievements in litera- 
ture, painting, and architecture. The generally low 

Restoring the Algebraic level of prevailing mathematical knowledge stood 

Tradition: Robert Recorde in the way of any intellectual breakthrough. Al- 


though mathematics was included in the curriculum 
of most universities, it was maintained only in a halfhearted manner. Indeed, during the 
late 1400s, Bologna was practically the only place where the teaching of the subject 
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was properly organized, and even there it appeared chie y as a sideline to astronomy. 
There were few university chairs in mathematics, and no mathematician could command 
respect from the learned world without also being a teacher, scholar, or patron of the 
Renaissance humanities. 

Regiomontanus set the pattern for combining mathematics with humanistic learning. 
At the University of Vienna, he lectured enthusiastically on the classical Latin poets 
Virgil, Juvenal, and Horace, drawing a larger audience than if his subject had been 
astronomy or mathematics. On a visit to Rome he copied the tragedies of Seneca while 
learning Greek in order to undertake a more comprehensive translation of the Almagest 
and subsequently the Conic Sections of Apollonius. To make the Greek tradition generally 
available, Regiomontanus became an ardent advocate of the new craft of printing, even 
installing a printing press in his house for publishing his and other people’s manuscripts. 
Starting with Regiomontanus, mathematicians displayed an astute appreciation of the 
power of printing. A recurrent feature of the mathematical revival was an ambitious 
printing program designed to achieve a rapid dissemination of texts and translations. 

Mathematics bene ted immensely from the humanist passion—almost missionary 
zeal—for the discovery, translation, and circulation of ancient Greek texts. Though their 
main interest was in the literary classics, the humanists took all classical learning as 
their province, and mathematical works were cherished equally with literary ones in the 
retrieval. These manuscript collectors were responsible for assembling in Italy an almost 
complete corpus of Greek mathematical writings. The medieval scholar had generally 
been limited to Euclid, Ptolemy, and sometimes Archimedes, all in translation from the 
Arabic. By the fteenth century, typical holdings encompassed not only the works of 
the aforementioned authors in both Latin and Greek, but also Diophantus, Apollonius, 
Pappus, and Proclus. The mathematician, like many of his Renaissance contemporaries, 
often tried little more than to comprehend what the ancients had done, certain that this 
was the most that could be known. Although much of this effort was wasted, the return 
to original sources made a_ rst step toward an intellectual advance. It would then be 
only a matter of time before mathematicians were stimulated to go beyond the strict 
letter of the texts to develop new concepts and results strictly unforeseen by the Greeks. 
Besides, it was far better to read an author, say Euclid, directly than to read what some 
commentator thought an Arabic paraphrase of the author meant. 

By 1500 the situation had changed radically. The newly translated works had been 
absorbed, and scholars, discontented with looking backward to antiquity, were prepared to 
go beyond the mathematical knowledge possessed by the Greeks. It came as an enormous 
and exhilarating surprise when the Italian algebraists of the early 1500s showed how to 
solve the cubic equation, something the ancient Greeks and the Arabs had missed. (The 
advance in algebra, however, that proved to be the most signi cant was the introduction of 
better symbolism.) In arithmetic, developing commercial and banking interests stimulated 
improved methods of computation, such as the use of decimal fractions and logarithms. 
Trigonometry, in connection with its increasing use in navigation, surveying, and military 
engineering, began to break away from astronomy and acquire a status as a separate 
branch of mathematics. Re ned astronomical instruments necessitated the computation 
of more extended tables of trigonometric functions. In a monument to German diligence 
and perseverance, Georg Joachim (generally called Rhaeticus, 1514-1576) worked out 
a table of sines for every 10 seconds to 15 decimal places. Only in geometry was 
the progress less pronounced. Renaissance geometers tended to accept the elementary 
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properties found in Euclid’s Elements as an exclusive model for their conduct and to 
ignore developments that could not claim Greek paternity. 

Mathematicians were eager to make known the newly discovered ways in which 
they could aid the ordinary person, from teaching the merchant how to reckon pro ts 
to showing the mapmaker the principles underlying the projection of a spherical surface 
onto a plane. Even the sixteenth century found the rules of simple arithmetic and ge- 
ometry dif cult to comprehend, and long division was truly long in the time required to 
accomplish it. Thus, the middle 1500s saw an increasing number of books of elementary 
instruction, written in plain and simple language. These practical textbooks, although 
producing nothing new, were important in diffusing mathematics to an ever-increasing 
public. The great majority were in Latin, but a good many appeared in the vernacular. 
Fair examples are the works of the English mathematician Robert Recorde (1510-1558): 
The Grounde of Artes (1542, a popular arithmetic text that ran through 29 editions), The 
Pathewaie of Knowledge (1551, a geometry containing an abridgment of the Elements), 
and The Whetstone of Witte (1557, on algebra). Books on algebra became so numerous 
in Germany that the subject was long known in Europe as the “cossic art,” after the 
German word coss for “unknown” (literally, “thing”). Through the trend of producing 
textbooks in the popular languages, mathematics assumed increasing importance in the 
education of all cultured people and not just of specialists training for an occupation. 

Although hailed as the founder of the English school of mathematics writers, Robert 
Recorde was neither the author of the rst mathematics text printed in England nor the 
rst whose works appeared in the English language. The churchman Cuthbert Tonstall 
published the Latin De Arte Supputandi (1552), based largely on Italian sources, in 
the same year that he became Bishop of London; and the anonymous vernacular text 
An Introduction for to Lerne to Recken with the Pen and with the Counters came out 
in 1537. Nevertheless, Recorde’s series of works enjoyed the widest popularity, going 
through innumerable printings in his own and the next century. 

Recorde was educated at Oxford and then received the degree doctor of medicine in 
1545 from Cambridge. He gave mathematics lessons privately in both university towns, 
before setting up a medical practice in London. It is said that he was physician to 
King Edward VI and to Queen Mary. Sometime around 1551, he was appointed to the 
position of Surveyor of [silver] Mines and Monies in Ireland. Recorde’s good fortune 
must have been temporary, because he died in prison a few years later. The reason for 
his incarceration is not known, but it is most likely connected to his conduct in political 
of ce. 

Recorde’s Castle of Knowledge (1556), a textbook on astronomy written as a dialogue 
between scholar and master, is equally noteworthy for containing the rst discussion in 
England of the Copernican hypothesis of the earth’s motion. His position was guarded 
and noncommittal, perhaps in fear of ridicule or, worse, religious persecution. When the 
young scholar describes Copernicus’s ideas as “vaine phantasies,” the master counters, 
“You are too younge... you were best to condemne no thynge that you do not well 
understand.” 

Algebraists, who had oundered under the weight of a cumbersome syncopated nota- 
tion, began to introduce a symbolism that would make algebraic writing more ef cient and 
compact, one that was also better suited to the needs of typography. These improvements 
came intermittently, and there was a lack of uniformity in symbols, even for common 
arithmetic operations (the present division sign + was often used to indicate subtraction). 
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Also, different symbols were proposed in different countries, tried, and often discarded. 
The Italian algebraists were slow in taking up new notation, preferring the initial letters 
p and m for “plus” and “minus” at a time when the Germans, less fettered by tradition, 
were adopting the familiar mathematical signs + and —. Although the development of 
symbols for operations in algebra was proceeding rapidly, the quantities described in 
equations were still represented by actual instead of general numbers. As a result, there 
could not be a complete treatment of, say, the quadratic equation. Instead, methods of 
solution were described and direct solutions were offered for many special cases, each 
illustrated by equations having appropriately chosen particular numerical coef cients. 

The liberation of algebra from the necessity of dealing only with concrete examples 
was largely the work of the great French mathematician Francois Vieta (1540-1603), who 
initiated using consonants to represent known quantities and vowels for the unknowns. 
This one step marked a decisive change, not only in convenience of notation but also 
in the abstraction of mathematical thought. In moving from varied but specic examples 
such as 3x? + 5x + 10 = 0 to the general ax? + bx +c = 0, an entire class of equations 
could be considered at once, so that a solution to the abstract equation would solve all 
the specic equations at one fell swoop. 


The Italian Algebraists: Pacioli, del Ferro, and Tartaglia 


Italian mathematics of the 1500s can be summarized in the names of del Ferro, 
Tartaglia, Cardan, Ferrari, and Bombelli. The collective achievement of the rst four was 
the solution of the cubic and biquadratic equations and implicitly a deeper understanding 
of equations in general. This feat was perhaps the greatest contribution of algebra since 
the work of the Babylonians some 3000 years earlier. Third-degree, or cubic, equations 
were in no sense peculiar to the Renaissance, attempts at their solution going back to 
classical antiquity. We have seen that the problem of duplicating the cube, the so-called 
Delian problem, attained special celebrity among the Greeks. This problem is nothing 
more than the attempt to nd two mean proportionals between a (the length of the edge 
of the given cube) and 2a; that is, to solve 

a x y 

x yp 2a’ 
which requires substantially the solution to the cubic equation x? = 2a3. Another note- 
worthy cubic equation is encountered in Diophantus’s Arithmetica in connection with 
Problem 17 of Book VI: 


Find a right triangle such that the area added to the hypotenuse gives a square, while the 
perimeter is a cube. 


The manner in which Diophantus set up the problem leads to the cubic x3 + x = 4x? + 4. 
We do not know how the solution was obtained, for he said simply that x was found to 
be 4. Perhaps he reduced the equation to the form x(x? + 1) = 4(x* + 1) and saw that 
it was satis ed by x = 4. Arab writers contributed solutions to special cubics but seem 
to have believed that many cases could not be solved. Part of the poet Omar Khayyam’s 
(circa 1100) fame as a mathematician rests on his claim of being the rst to handle 
any type of cubic having a positive root. In the thirteenth century, John of Palermo 
proposed solving the equation x? + 2x* + 10x = 20 as one of his challenge problems to 
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Fibonacci in their contests. Fibonacci showed by geometry that no rational solution was 
possible, but he gave an approximate value for a root. Over the next several hundred 
years, mathematicians searched for a “cubic formula” that could be used to solve cubic 
equations in much the same way the quadratic formula was used for quadratic equations. 
The credit for nally discovering such a formula belongs to the Italian mathematical 
school at Bologna during the 1500s. 

The most complete and detailed fteenth century mathematical treatise was the 
Summa de Arithmetica, Geometria, Proportioni, et Proportionalita (1494) of Fra Luca 
Pacioli, a work in which the author borrowed shamelessly from earlier writers. The main 
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Fra Luca Pacioli 


(circa 1445-1514) 


(The Bettmann Archive.) 


contribution of the Summa (which was, after all, a summary) was to lay out the bound- 
aries of contemporary mathematical knowledge and so to supply a program of sorts for 
the renaissance of mathematics. Pacioli ended his Summa by asserting that the solution 
of the cubic equation was as impossible as the quadrature of the circle. This put off some 
mathematicians from the attempt but only induced others to try. In the rst or second 
decade of the sixteenth century, Scipione del Ferro (1465-1526) of the University of 
Bologna shattered Pacioli’s prediction by solving the cubic equation for the special case 
x3 + px =q, where p and q are positive. Pacioli may have personally stimulated this 
rst great achievement of Renaissance algebra, for in 1501-1502 he lectured at the Uni- 
versity of Bologna, where one of his colleagues was del Ferro. (Pope Nicholas V had, 
in 1450, proclaimed a general reorganization of the university and allocated four chairs 
to the mathematical sciences. By 1500 there were as many as eight professors at a time 
teaching mathematics there.) 

It was the practice in those days to treat mathematical discoveries as personal proper- 
ties, disclosing neither method nor proof, to prevent their application by others to similar 
problems. This was because scholarly reputation was largely based on public contests. 
Not only could an immediate monetary prize be gained by proposing problems beyond 
the reach of one’s rival, but the outcomes of these challenges strongly in uenced aca- 
demic appointments; at that time, university positions were temporary and subject to 
renewal based on demonstrated achievement. (As the printing of scientic periodicals 
became commonplace, this attitude of secrecy gradually shifted to the view that publica- 
tion of results was the scholar’s best path to recognition.) At any rate, loath to surrender 
an advantage over other competitors, del Ferro never published his solution and divulged 
the secret only to a few close friends, among them his pupil and successor Antonio Maria 
Fiore. This exchange was to lead to one of the most famous of mathematical disputes, 
its origin being a problem-solving contest at Venice in 1535 in which Fiore challenged 
Nicolo Tartaglia to solve various kinds of cubics. 

One of the most important restorers of the algebraic tradition, Nicolo Tartaglia 
(1500-1557), was also one of the least in uential. Tartaglia (whose actual family name 
was Fontana) was born in Brescia, in northern Italy. When the French sacked Brescia 
in 1512, many of the inhabitants sought refuge in the local cathedral. The soldiers how- 
ever violated the cathedral’s sanctuary and massacred the townspeople. The boy Nicolo’s 
father was among those killed in the butchery, and he himself was left for dead after 
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Nicolo Tartaglia 
(circa 1500-1557) 


(Source: ORE, OYSTEIN; CARDANO. © 1953 Prince- 
ton University Press, 1981 Renewed. Reprinted by per- 
mission of Princeton University Press.) 


receiving a severe sabre cut that cleft his jaw and palate. Although his mother found 
the lad and treated the wounds as best she knew, he was left with an impediment in his 
speech that earned him the cruel nickname Tartaglia, “the stammerer.” Later in life he 
used the nickname formally in his published works; he wore a long beard to cover the 
monstrous scars, but he could never overcome the stuttering. 

Although his early years were spent in direst poverty, Tartaglia was determined to 
educate himself. His widowed mother had accumulated a small sum of money so that 
he might be tutored by a writing-master. The funds ran out after 15 days, but the boy 
stole a copybook from which he subsequently learned to read and write. It is said that 
lacking the means to buy paper, Tartaglia made use of the tombstones in the cemetery 
as slates on which to work out his exercises. Possessing a mind of extraordinary power, 
he eventually acquired such pro ciency in mathematics that he earned his livelihood by 
teaching the subject in Verona and Venice. It is ironic that Tartaglia, a man dis gured bya 
sabre, contributed to the ultimate obsolescence of the sabre by his pioneering work Nova 
Scientia (1537), on the application of mathematics to artillery re. Tartaglia’s “new 
science” was, of course, ballistics. Even though the theories he developed were often 
completely wrong, he was the rst to offer a theoretical discussion as contrasted with the 
so-called experience of gunners. Anticipating Galileo, Tartaglia taught that falling bodies 
of different weights traverse equal distances in equal times. 

Tartaglia’s unfortunate early experiences may have encouraged a suspicious char- 
acter. Self-taught, he was jealous of his prerogatives and constantly impelled to try to 
establish his intellectual credentials. Either through intent or simple ignorance of the lit- 
erature, he had a habit of claiming other people’s discoveries as his own. An instance of 
this is the “arithmetic triangle” commonly attributed to Pascal, which Tartaglia asserted 
was his invention although it had previously appeared in print. Tartaglia seems to have 
felt that his lack of a classical education placed him at a disadvantage as a humanist; and 
in his General Trattato di Numeri et Misure (1556-1560), intended to replace Pacioli’s 
Summa, he adorned the preface with quotations from both Cicero and Ptolemy. 
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In 1530, Tartaglia was sent two problems by a friend, namely: 


1. Find a number whose cube added to three times its square makes 5; that is, nd a 
value of x satisfying the equation x* + 3x? = 5. 


2. Find three numbers, the second of which exceeds the rst by 2, and the third of 
which exceeds the second by 2 also, and whose product is 1000; that is, solve the 
equation x(x + 2)(x + 4) = 1000, or equivalently, x? + 6x? + 8x = 1000. 


For some time Tartaglia was unable to solve these problems, but in 1535 he nally 
managed to do so, and he also announced that he could effect the solution of any equation 
of the type x° + px* = q. Fiore, believing Tartaglia’s claim to be a bluff, challenged him 
to a public problem-solving contest. Each contestant was to propose 30 problems, the 
victor being the one who could solve the greatest number within 50 days. Tartaglia was 
aware that his rival had inherited the solution of some form of cubic equation from a 
deceased master, and he worked frantically to nd the general procedure. Shortly before 
the appointed date, he devised a scheme for solving cubics that lacked the second-degree 
term. Thus, Tartaglia entered the competition prepared to handle two types of cubics, 
whereas his opponent was equipped for but one. Within two hours, Tartaglia had reduced 
all 30 problems posed to him to particular cases of the equation x* + px = q, for which 
he knew the answer. Of the problems he himself put to Fiore, the latter failed to master 
a single one (most of which led to equations of the form x* + px? = q). 


Cardan, A Scoundrel Mathematician 


Girolamo Cardano (1501-1576), better known as Cardan, now appears on the scene. 
Cardan’s life was deplorable even by the standards of the times. He saw one son exe- 
cuted for wife-poisoning; he personally cropped the ears of a second son who attempted 
the same offense; he was imprisoned for heresy after having published the horoscope 
of Christ; and in general he divided his time between intensive study and extensive de- 
bauchery. Yet in his range of interests as well as vices, Cardan was a true Renaissance 
man: physician, philosopher, mathematician, astrologer, dabbler in the occult, and proli c 
writer. 

After a frivolous youth devoted mainly to gambling, Cardan began his university 
studies at Pavia and completed them at Padua in 1525 with a doctorate in medicine. 
Ostensibly on the grounds of his illegitimate birth but more likely owing to his reputation 
as a gambler, Cardan’s repeated applications to the College of Physicians in Milan were 
all turned down. It is not surprising that his rst published work, De Malo Recentiorum 
Medicorum Medendi Usu Libellus (On the Bad Practices of Medicine in Common Use), 
ridiculed the practitioners in Milan. By the time he was 50 years old, Cardan stood 
second only to Vesalius among European physicians and traveled widely to treat the 
well-known. So great was his fame that the archbishop of Scotland was among his 
patients. The archbishop was believed to be suffering from consumption; and Cardan, on 
the strength of a statement—later admitted to be false—that he could cure this complaint, 
journeyed to Edinburgh to treat the archbishop. Fortunately for the patient, and also for 
Cardan’s reputation, it turned out that he was suffering from attacks of asthma. When 
Cardan passed through London on the return trip, he was received by the young King 
Edward VI, whose horoscope he obligingly cast. The comfortable predictions of a long 
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life and prosperous future proved to be a great embarrassment when the boy died shortly 
thereafter. At various times, Cardan was professor of mathematics at the universities 
of Milan, Pavia, and Bologna, resigning each position as a result of some new scandal 
connected with his name. Forbidden to lecture publicly or to write or publish books, he 
nally settled in Rome, where for some strange reason, he obtained a handsome pension 
as astrologer to the papal court. According to various accounts, having predicted that he 
would die on a certain day, Cardan felt obliged to commit suicide to authenticate the 
prediction. 

When the news of the mathematical joust between Tartaglia and Fiore eventually 
reached Cardan in Milan, Cardan begged Tartaglia for the cubic solution, offering to in- 
clude the result in his forthcoming book Practica Arithmeticae (1539) under Tartaglia’s 
name. Tartaglia refused on the grounds that in due time he intended to publish his own 
discourse on algebra. Being credited for a formula is not the same thing as having a trea- 
tise, an original work, under your own name; it is the book, not the footnote reference, that 
history will cite. Cardan, in the hope of learning the secret, invited Tartaglia to visit him. 
After many entreaties and much attery, Tartaglia revealed his method of solution on the 
promise, probably given under oath, that Cardan would keep it con dential. Rumors be- 
gan to circulate, however, that Tartaglia was not the rst discoverer of the cubic formula, 
and in 1543 Cardan journeyed to Bologna to try to verify these reports. After examining 
the posthumous papers of del Ferro, he concluded that del Ferro was the one who had 
made the breakthrough. Cardan no longer felt bound by his promise to Tartaglia, and when 
Cardan’s work Ars Magna appeared in 1545, the formula and method of proof were fully 
disclosed. Cardan candidly admitted (at three places in the text) that he had gotten the so- 
lution to the special cubic equation x? + px = q from his “friend” Tartaglia, but claimed 
to have carried out for himself the proof that the formula he had received was correct. 
Angered at this apparent breach of a solemn oath and feeling cheated out of the rewards 
of his monumental work, Tartaglia accused Cardan of lying. Thus began one of the bitter- 
est feuds in the history of science, carried on with name-calling and mudslinging of the 
lowest order. 


Cardan wrote on a wide variety of subjects, includ- 


Cardan’s Ars Magna ing mathematics, astrology, music, philosophy, and 
medicine. When he died, 131 of his works had been 

Cardan’s Solution of the published and 111 existed in manuscript form, and he 

Cubic Equation had claimed to have burned 170 others that were un- 


satisfactory. These ran the gamut from Practica Arith- 
meticae (1539), a book on numerical calculation based largely on Pacioli’s work of 1494, 
to Liber de Vita Propria (1575), an autobiography in which he did not spare the most 
shameful revelations. His passion for the games of chess, dice, and cards inspired Cardan 
to write Liber de Ludo Aleae (Book on Games of Chance). Found among his papers after 
his death and published in 1663, this work broke the ground for a theory of probabil- 
ity more than 50 years before Fermat and Pascal, to whom the rst steps are usually 
attributed. In it he even gives advice on how to cheat, no doubt gained from personal 
experience. One of the ironic twists of fate in Cardan’s life is that his excessive gambling, 
which had cost him time, money, and reputation, should have helped him earn a place 
in the history of mathematics. 


Cardan’s Ars Magna 
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In permanent signicance, the Ars Magna (The Great Art) undoubtedly stands at 
the head of the entire body of Cardan’s writings, mathematical or otherwise. This work, 
which was rst printed in 1545, today would be classi ed as a text on algebraic equations. 
It makes very clear that Cardan was no mere plagiarist but one who combined a measure 
of honest toil with his piracy. Although negative numbers had become known in Europe 
through Arabic texts, most Western algebraists did not accept them as bona de numbers 
and preferred to write their equations so that only positive terms appeared. Thus, there 
was no one cubic equation at the time, but rather thirteen of them, according to whether 
the terms of the various degrees appeared on the same side of the equality sign or 
on opposite sides. In giving Cardan the formula for x? + px = q, Tartaglia did not 
automatically provide solutions for all the other forms that the cubic might take. Cardan 
was forced to expand Tartaglia’s discovery to cover these other cases, devising and 
providing the rule separately in each instance. 

Hitherto, Western mathematicians had con ned their attention to those roots of equa- 
tions that were positive numbers. Cardan was the rst to take notice of negative roots, 
although he called them “ ctitious,” and the rst to recognize that a cubic might have 
three roots. Another notable aspect of Cardan’s discussion was the clear realization of 
the existence of what we now call complex or imaginary numbers (the ghosts of real 
numbers, as Napier was later to call them). Cardan kept these numbers out of the Ars 
Magna except in one case, when he considered the problem of dividing 10 into two parts 
whose product was 40. He obtained the roots 5+ /—15 and 5 — ./—15 as solutions of 
the quadratic equation x(10 — x) = 40, and then stated, “Putting aside the mental tortures 
involved, multiply 5+ /—15 by 5 — /—15, making 25 — (—15), whence the product is 
40.” Cardan somehow felt obliged to accept these solutions yet hastened to add that there 
was no interpretation for them, remarking, “So progresses arithmetic subtlety the end of 
which, as is said, is as re ned as it is useless.” But merely writing down the meaningless 
gave it a symbolic meaning, and Cardan deserves credit for having paid attention to the 
situation. 

Among the innovations that Cardan introduced in the Ars Magna was the trick of 
changing a cubic equation to one in which the second-degree term was absent. If one 
starts with the equation 


xe+ax?+bx+c=0, 


all that is needed is to make the substitution x = y — a/3. With this new variable, the 
given equation becomes 


-P- @-@)] 


322 


Chapter 7 The Renaissance of Mathematics: Cardan and Tartaglia 


Girolamo Cardano 


(1501-1576) 


(Source: Princeton University Press.) 


If one sets 


then the last equation can be written 
y+py=4q, 


which is the so-called reduced form of the cubic. It lacks a term in y’, but otherwise the 
coef cients are arbitrary. 

Cardan solved the cubic equation x* + 20x = 6x? + 33 by this reduction technique. 
Through the substitution x = y — (—6)/3 = y + 2, it is transformed to the equation 


(y? + 6y? + 12y + 8) + 20(y + 2) = 6(y + 4y + 4) + 33, 
or simpli ed, 
yr + 8y =9. 

This last equation has one obvious solution, namely, y = 1; hence, x = y+ 2 =3 will 
satisfy the original cubic. 

Let us examine how Cardan managed to arrive at the general solution of the reduced 
cubic. Because the Renaissance was a period of the highest veneration of Greek math- 
ematics, it is not unexpected that his proofs should be based on geometric arguments, 


emulating (as Cardan himself emphasized) the reasoning of Euclid. The technique for 
dealing with the cubic 


(1) xi + px =q, p>o%,q> 0, 
although geometric, is equivalent to using the algebraic identity 


(2) (a — by + 3ab(a — b) = a? — BD’. 
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If a and b are chosen so that 3ab = p and a? — b> = q, then identity (2) becomes 
(a — by + p(a—b) =4, 


which shows that x = a — b will furnish a solution to the cubic (1). The problem therefore 
involves solving the pair of simultaneous equations 


ab =¢, 
ab=-—, 

3 
for a and b. To do so, one squares the rst equation and cubes the second, to get 


aS — 2a3b} + bo = q?, 


4a*b? = ay 
27 
When the equations are added, it follows that 
4 3 
(+b)! =a5 +20 P +O = 9? +, 
and so 
4p 
3, 23 
b> =,/q?+—. 
a + qo + 7 
If the equations 
4p 


a—b=q and a+b =,/q2+— 


are now solved simultaneously, then a> and b> can be determined; the result is 


1 4p? q gp 
b= -{- 2 NW A fF 
5 (-1+ J+) 2° V4" 27 
But then 
te, (CaP 
a= ris real 7 
7 Ce 
peo ee, 
27V 4737 
and consequently, 
iq, |, ep 4 ¢@, i? PP 
=¢@=b=4)2 24F fio cer ae lie 
v= Bo Va AVE 
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As Tartaglia feared, this last formula has forever since been known as Cardan’s formula 
for the solution of the cubic equation. The mathematician to whom we owe the chief con- 
tribution made to algebra in the sixteenth century is largely forgotten, and the discovery 
goes by the name of a scoundrel. 

Cardan illustrated his method by solving the equation 


x? + 6x = 20. 


In this case, p = 6 and q = 20, so that p>/27 = 8 and q*/4 = 100; whence the formula 
yields 


x = y V108 + 10 — J V108 — 10. 


As remarked earlier, Cardan was forced to treat an elaborate list of equation types, 
produced largely by his failure to allow negative coef cients. In solving the equation 


x? = pxt+q, p>O9O, q>0, 
he used a geometric argument corresponding to the identity 
(a+b~ =a? +b? 4+3ab(a +5), 


to arrive at the solution 


There is one dif culty connected with this last formula, which Cardan observed but 
could not resolve. When (q/2)? < (p/3)°, the formula leads inevitably to square roots 


of negative numbers. That is, ,/qg?/4 — p3/27 involves “imaginary numbers.” 
Consider, for example, the historic equation 


x= 15x +4, 


treated by Rafael Bombelli, the last great sixteenth century Bolognese mathematician, in 
his Algebra (1572). A direct application of the Cardan-Tartaglia formula would lead to 


624 Deaf ee 2 So. 


Bombelli knew, nevertheless, that the equation had three real solutions, namely 4, —2 + 
J/3, and —2 — V3. One is left in the paradoxical situation in which the formula produces 
a result useless for most purposes, yet in other ways three perfectly good solutions can 
be found. This impasse, which arises when all three roots are real and different from 
zero, 1s known as the “irreducible case” of the cubic equation. 


Bombelli and Imaginary Roots of the Cubic 


Bombelli was the rst mathematician bold enough to accept the existence of imagi- 
nary numbers, and hence to throw some light on the puzzle of irreducible cubic equations. 
A native of Bologna, he himself had not received any formal instruction in mathemat- 
ics and did not teach at the university. He was the son of a wool merchant and by 
profession an engineer-architect. Bombelli felt that only Cardan among his predecessors 
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had explored algebra in depth, and that Cardan had not been clear in his exposition. He 
therefore decided to write a systematic treatment of algebra to be a successor to Cardan’s 
Ars Magna. Bombelli composed the rst draft of his treatise about 1560, but it remained 
in manuscript form until 1572, shortly before his death. The preparation of his Algebra 
took considerably longer than Bombelli had foreseen, for as he wrote in the work: 


A Greek manuscript in this science was found in the Vatican Library, composed by Diophan- 
tus.... We set to translating it and have already done ve of the seven (sic) extant books. 
The rest we have not been able to ni sh because of other commitments. 


Tremendously enthusiastic over the rediscovery of the Arithmetica, Bombelli took 143 
problems and their solutions from its rst four books and embodied them in his Algebra, 
interspersing them with his own contributions. Although Bombelli did not distinguish 
among the problems, he nonetheless acknowledged that he had borrowed freely from 
Diophantus. (A manuscript of the Algebra was found in 1923; the absence of the 143 
problems borrowed from the Arithmetica suggests that Bombelli had not seen the Vat- 
ican copy when he rst wrote the work.) Whereas the works of Pacioli and Cardan 
contained many problems of applied arithmetic, Bombelli’s problems were all abstract. 
He claimed that while others wrote for a practical, rather than a scientic purpose, he 
had “restored the effectiveness of arithmetic, imitating the ancient writers.” The publica- 
tion of Bombelli’s Algebra completed a movement that began in Italy about 1200, when 
Fibonacci introduced the rules of algebra in the Liber Abaci. 

Bombelli’s skill in operating with imaginary numbers enabled him to demonstrate 
the applicability of Cardan’s formula, even in the irreducible case (all roots real) of 
the cubic equation. Assuming that the complex numbers behaved like other numbers in 
calculations, he made a circuitous passage into, and out of, the complex domain and 
ended by showing that the apparently imaginary expression for the root of the equation 
x? = 15x + 4 gave a real value. Bombelli had the ingenious idea that the complex values 


of the radicals 
J245/-121. and ~2=.4/—121 


might be related much as the radicals themselves; that is, they might differ only in a 
sign. This prompted him to set 


(liVJaDiagthl ad. 422. Sml Sao hy 1. 


where a > 0 and b > 0 are to be determined. As Bombelli said: 


It was a wild thought in the judgment of many; and I too for a long time was of the same 
opinion. The whole matter seemed to rest on sophistry rather than on truth. Yet I sought so 
long, until I actually proved this to be the case. 


Now the relation 2 + /—121 =a+bJ/—1 implies that 
244/=121 SG Sb =1y 


= a + 3a7°bV-1 + 3ab?(V/—1¥ + B19 


= a(a’ — 3b”) + b(3a’ — b?)/—1. 
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This equality would hold provided that 
a(a*—3b7)=2 and  d(3a*—b*)=11. 


If solutions are sought in the integers, then the rst of these conditions tells us that a 
must be equal to 1 or 2, and the second condition asserts that b has the value | or 11; 
only the choices a = 2 and b = 1 satisfy both conditions. Therefore, 


94./-1l=Q47=1). end 2=V=-D1=@=/=1). 


Bombelli concluded that one solution to the cubic equation x? = 15x + 4 was 


¢=— 0-4/1 S )o./= 191 


= JOs/a1 24 Clay 
=(22+V7-1)+(2-V-1)=4. 


In proving the reality of the roots of the cubic x? = 15x + 4, he demonstrated the ex- 
traordinary fact that real numbers could be engendered by imaginary numbers. From this 
time on, imaginary numbers lost some of their mystical character, although their full 


acceptance as bona de numbers came only in the 1800s. 


7.3 Problems 


1. 


3. 


Find all three roots of each of the following cubic 
equations by rst reducing them to cubics that lack a 


term in x?. 


(a) x? +11x = 6x74 6. 
(b) x3+6x?4 3x =2. 
(c) x? +6x? = 20x + 56. 
(d) x3 +64 = 6x? + 24x. 


Derive Cardan’s formula 


for solving the cubic equation x7 = px + q, where 
p>Oandg>0. 


Using Cardan’s formula, obtain one root of each of 
the following cubic equations. 


(a) x3+24x = 16. 

(b) x34 15x = 6x? 4+ 18. 
(c) x3°4+27x = 6x? +58. 
(d) x3 =9x 412. 


(e) x3 =6x7+15x 48. 
(f) x3 =3x7+27x +41. 


. Solve the cubic equation x3 + 6x? +x = 14. 


Problems 5—11 appear in Cardan’s Ars Magna. 


. Chapter 5, Problem 2. There were two leaders, each 


of whom divided 48 aurei among his soldiers. One of 
these had two more soldiers than the other. The one 
who had two soldiers fewer had four aurei more for 
each soldier. Find how many soldiers each had. 


. Chapter 37, Problem 1. The dowry of Francis’s wife 


is 100 aurei more than Francis’s own property is 
worth, and the square of the dowry is 400 more than 
the square of his property’s value. Find the dowry 
and the property value. 


. Chapter 31, Problem 1. Divide 8 into two parts, the 


product of the cubes of which is 16. [Hint: If the two 
parts are 4+ x and 4 — x, it follows that 
(44x)(4—x) = 716] 


. Chapter 5, Problem 4. There is a number for which 


adding twice its square root to it and twice its square 
root to this sum gives 10. What is the number? 
[Hint: Call the number x?; if y* = x? + 2x, then 

y? +2y =10.] 
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10. 


11. 


12. 


13. 


14. 


Chapter 17, Problem 3. An oracle ordered a prince to 
build a sacred building whose space would be 400 
cubic cubits, the length being 6 cubits more than the 
width and the width 3 cubits more than the height. 
Find these quantities. [Hint: If x is the height, then 
x(x + 3)(x + 9) = 400.] 


Chapter 32, Problem 3. Divide 6 into two parts, the 
sum of the squares of which is equal to the 
difference between their cubes. [Hint: Calling the 
two parts 3+ x and 3 — x leads to the equation 
x34 27x = x? +9] 


Chapter 38, Problem 1. Find two numbers whose 
difference is 8 and for which the sum of the cube of 
one and the square of the other is 100. [Hint: Let the 
numbers be called x + 2 and x — 6, so that 

(x + 2° + (x — 6) = 100.] 


The following method of Vieta (1540-1603) is useful 
in solving the reduced cubic x? + ax = b. By 
substitution of x = a/3y — y, the given equation 
becomes y° + by? — a3/27 = 0, a quadratic in y?. 
By the quadratic formula, 


3_ 1 ied yey A 
3 V rae 


from which y and then x can be determined. Use this 
method to nd a root of the cubics x? + 81x = 702 
and x? + 6x? + 18x + 13 = 0. [Hint: 

J142,884 = 378,] 


By making the substitution x = y + 5/y, 
of the cubic equation x? = 15x + 126. 


nd a root 


Use Cardan’s formula to nd, in these examples of 
the irreducible case in cubics, a root of the given 
equations. 


(a) x3 = 63x + 162. 
Hint: 81 + 30./—3 = (-3 +2/—3)'] 


(b) x =7x4+6. 


[ 10 4 
Hint: 32 w — 7+ GY 3). 


(c) x3 4+6=2x7 45x. 


[ 28 5 1 5 : 
Hint 2 oo 3 = a | 
mee ame € ne ) 


15. 


16. 
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The great Persian poet, Omar Khayyam (circa 
1050-1130), found a geometric solution of the cubic 
equation x* + a*x = b by using a pair of intersecting 
conic sections. In modern notation, he rst 
constructed the parabola x? = ay. Then he drew a 
semicircle with diameter AC = b/a? on the x-axis, 
and let P be the point of intersection of the 
semicircle with the parabola (see the gure). A 
perpendicular is dropped from P to the x-axis to 
produce a point Q. 


A Q Cc 


Complete the details in the following proof that the 
x-coordinate of P, that is, the length of segment AQ, 
is the root of the given cubic. 


(a) AQ’ =a(PQ). 

(b) Triangles AQP and PQC are similar, so that 
AQ PQ 5 b 
Po-~oc Q Q (= 0) 

(c) Substitution gives AQ’ + a7AQ = b. 


It is also possible to use the parabola y = x? for 
duplicating a cube of edge a. Draw a circle with 
center (a/2, 1/2) that passes through the origin (0, 0). 
Then the x-coordinate of the point of intersection of 
the circle and the parabola y = x? will serve as the 
edge of a cube double in volume to the given cube. 
Prove this conclusion. 
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After the cubic had been solved, it was 


Ferrari’s Solution of the Quartic Equation only natural that mathematicians should 


attack the quartic (fourth-degree) equa- 
The Resolvant Cubic tion. The solution was discovered during 

work on a problem proposed to Cardan 
in 1540. Divide the number 10 into three proportional parts so that the product of the 
rst and second parts is 6. If the numbers are called 6/x, x, and x*/6, the conditions 
laid down are clearly ful lled. In particular, the requirement that 


x3 


6 
- —= 10 
ras 6 


is equivalent to the quartic 
x* + 6x? + 36 = 60x. 


After an unsuccessful attempt at solving this equation, Cardan turned it over to his 
disciple Ludovico Ferrari (1522—1565). Ferrari, using the rules for solving the cubic, 
eventually succeeded where his master had failed. At least, Cardan had the pleasure of 
incorporating the result in the Ars Magna, with due credit given Ferrari. 

Ferrari, the son of poor parents, was taken into Cardan’s household as a servant boy 
at the age of 14. Although he had not received any formal education, Ferrari was excep- 
tionally gifted, and Cardan undertook to instruct him in Latin, Greek, and mathematics. 
Cardan soon made him his personal secretary, and after four years of service, Ferrari 
left to become public lecturer in mathematics at the University of Milan. He became 
professor of mathematics at Bologna in 1565 and died in the same year, having been 
poisoned with white arsenic—by his own sister, as rumor had it. 

Ferrari joined the fray surrounding the solution of the cubic by swearing that he 
had been present at the fateful meeting between Cardan and Tartaglia and that there 
had been no oath of secrecy involved. Always eager to defend his old master, Ferrari 
then challenged Tartaglia to a public disputation on mathematics and related disciplines, 
writing in a widely distributed manifesto: “You have written things that falsely and un- 
worthily slander Signor Cardan, compared with whom you are hardly worth mentioning.” 
Tartaglia’s counterstatement asked Ferrari either to let Cardan_ ght his own battles or to 
admit that he was acting on Cardan’s behalf; the challenge would be accepted if Cardan 
were willing to countersign Ferrari’s letter and if (because Tartaglia feared some sort of 
trickery) topics from the Ars Magna were excluded. Another acrimonious dispute ensued 
in which 12 letters were exchanged, full of charges and insults, with each party trying 
to justify his own position. In one of these retorts, Ferrari made the mistake of calling 
himself Cardan’s creation, allowing Tartaglia the satisfaction of thereafter referring to 
him as “Cardan’s creature.” 

The contest nally took place in Ferrari’s hometown of Milan in 1548 before a 
large and distinguished gathering. Perhaps aware of his own limitations, Cardan had the 
foresight to leave Milan for several days. There is no record of the proceedings except 
for a few statements to the effect that the meeting soon deteriorated into a shouting match 
over a problem of Ferrari’s that Tartaglia had been unable to resolve. The altercation ran 
into the dinner hour, at which time everyone felt compelled to leave. Tartaglia departed 
the next morning claiming to have come off the better in the dispute, but it seems more 
likely that Ferrari was declared the winner. The best evidence of this is that Tartaglia 
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HIERONYMI CAR 


DANI, PRASTANTISSIMI MATHE 


MATICI, PHILOSOPHIE, AC MEDICI; 


ARTIS MAGN€&, 


SIVE DE REGVLIS ALGEBRAICIS, 
Lib.unus. Qui & rotius operis de Arithmetica, quod 
OPVS PERFECTVM 


| Fserndapre tet Le@or, Algebraicas (Teali, dela Cof 
( uocant) nouis adinuentionibus.acdemonftrationfbus ab Authore ita 
locupletatas,ut pro pauculis ancea uulge critis.iam fepruaginta euaferint, Nes 
g folum , ubiumus numerus alteri,aut duo unt,uerum etiam,ubi duo duobus, 
aut tres uni ¢quales fuerint,nodum explicant. — Huncafi librumideo feors 
fim edere placuit,ue hoc abftrafifsimo, & plané inexhaufto totius Arichmeri 
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Tomos edentur, tanto auidius amplectantur,ac minore faftidio perdifcants 


Title page of Cardan’s Ars Magna (1545). (Source: M.L.T. Press.) 


lost his teaching post in Brescia, and Ferrari received a host of attering offers, among 
them an invitation to lecture in Venice, Tartaglia’s stronghold. 
Ferrari’s method for solving the general quartic could, in modern notation, be sum- 


marized as follows. First, reduce the equation 
x* + ax? + bx? +cex+d=0 
to the special form 


y+ py +ay+r=0, 
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in which the term in y° is missing, by substituting x = y — a/4. Now the left-hand side 
of 


yt py =-qy-r 


contains two of the terms of the square of y” + p. Let us complete the square by adding 
py? + p* to each side to get 


(y? + py = +2py? +p? = py +p —ay-r. 


We now introduce another unknown for the purpose of converting the left member of 
this equation into (vy? + p+z)?. This is done by adding 2(y? + p)z + z* to each side, 
and leads to 


O° +ptzy =py +p —qy-rt20° + pyzt7 
= (pt2z)y —qyt(p? —rt+2pz +z’). 


The problem now reduces to nding a value of z that makes the right-hand side, a 
quadratic in y, a perfect square. This will be the case when the discriminant of the 
quadratic is zero; that is, when 


A(p + 2z\(p* —r+2pz+2)=4q’, 
which requires solving a cubic in z; namely, 
823 + 20pz’ + (16p? — 8r)z + (4p* — 4pr — gq’) = 0. 


The last equation is known as the resolvent cubic of the given quartic equation, and it 
can be solved in the usual way. There are in general three solutions of the resolvent 
cubic, and y can be determined from any one of them by extracting square roots. Once 
a value of y is known, the solution of the original quartic is readily reached. 

If the procedure sounds complicated, an example from the Ars Magna might help 
to clarify the sequence of steps. Cardan considered (Chapter 39, Problem 9) the quartic 
equation x+ + 4x + 8 = 10x?, or equivalently, 


x*— 10x? = —4x — 8. 
Completing the square on the left-hand side, one gets 
(x? — 10)? = —10x? — 4x + 92. 
By adding the quantity 2z(x* — 10) + z? to each side, this equation is changed to 
(1) (x? — 10 +z) = (2z — 10)x? — 4x + (92 — 20z +. 27), 


where z is a new unknown. Now the right-hand expression is a perfect square if z is 
chosen to satisfy the condition 


A(2z — 10)(92 — 20z +z”) = 16, 
or after simpli cation, 


2 — 2527 + 1927 = 462. 
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This is a cubic equation from which z can be found. We start by letting z = u + 2; this 
substitution reduces the equation to the form 


A solution is uw = —#, so that z = 7. It is this value of z that should give squares on 
3 g q 


both sides of equation (1). The result of substituting z = 7 is 
(x? — 3P = 4x? — 4x + 1 = (2x — 1, 
whence x” — 3 = +(2x — 1). The positive sign gives 
x*—2x-2=0, 
and the negative sign yields 
x? +2x-4=0. 


In solving these equations by the quadratic formula, it is found that the four solutions of 
the original quartic equation are 


jg ha 8 1S Gd, A 4/5, 


The Story of the Quintic Equation: Ruffini, Abel, and Galois 


Our story has a postscript. We have seen that in the case of quadratic, cubic, and 
quartic equations, explicit formulas for the roots were found that were formed from 
the coef cients of the equation by using the four operations of arithmetic (addition, 
multiplication, subtraction, and division) and by taking radicals of various sorts. The next 
natural step was to seek similar solutions of equations of higher degrees, the presumption 
being that an equation of degree n should be capable of formal solution by means of 
radicals and probably by radicals of an exponent not larger than n. For close to 300 years, 
algebraists wrestled with the general equation of fth degree (the quintic equation) and 
made almost no progress. But these repeated failures at least had the effect of suggesting 
the possibility, startling at the time, that the quintic equation might not be solvable in 
this way. 

Paolo Rufni (1765-1822), an Italian physician who taught mathematics as well 
as medicine at the University of Modena, conrmed the suspicion of the impossibility 
of nding an algebraic solution for the general fth-degree equation. Ruf ni’s proof, 
which appeared in his two-volume Teorie generale delle equazioni of 1799, was sound in 
general outline although faulty in some details. The Norwegian genius Niels Henrik Abel 
(1802-1829), when he was about 19 years old, made a study of the same problem. At 
rst he thought he had found a solution of the general quintic by radicals, but later 
he established the unsolvability of the equation, using a more rigorous argument than 
Ruf ni’s. Abel fully realized the importance of his discovery and had it published in 
1824, at his own expense, in a pamphlet that bore the title Memoire sur les equations 
algébriques ou on démontre l’impossibilité de la résolution de l’equation generale du 
cinquiéme degré. So that expenses could be kept down, the whole pamphlet had to 
be condensed to six pages of actual print, making it difcult to follow the reasoning. 
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Thus, the signi cance of Abel’s masterpiece went unnoticed by contemporary scholars. 
When Europe’s leading mathematician, Carl Friedrich Gauss, duly received his copy, 
he tossed it aside unread with the disgusted exclamation, “Here is another of those 
monstrosities!” 

Abel’s opportunity came when he had the great good fortune to make the acquain- 
tance of August Leopold Crelle, a German civil engineer and enthusiastic mathematical 
amateur. At this time, Crelle was making plans to launch a new journal, which would be 
the rst periodical devoted exclusively to mathematical research. Abel eagerly accepted 
the invitation to submit articles, and the rst three volumes of the Journal fiir die reine 
und angewandte mathematik (Journal for Pure and Applied Mathematics), or Crelle’s 
Journal as it is commonly called, contained 22 papers by Abel. In the founding volume 
(1826), he expanded his earlier research into what now is known as the Abel-Ruf ni 
theorem: it is impossible to nd a general formula for the roots of a polynomial equation 
of degree ve or higher if the formula for the solution is allowed to use only arithmetic 
operations and extraction of roots. When Abel composed his paper, he was not aware 
that he had a precursor. He was later to write, however, in a manuscript Sur la résolution 
algébraique des equations (dated 1828 but only published after his death): “The only 
one before me, if I am not mistaken, who has tried to prove the impossibility of the 
algebraic solution of the general equation is the mathematician Ruf ni, but his paper is 
so complicated that it is dif cult to judge the correctness of his arguments.” 

Abel’s theorem on the unsolvability of higher equations applied to general equations 
only. Many special equations existed that were solvable by radicals, and the character- 
ization of these remained an open question. It was reserved for another young math- 
ematician, Evariste Galois (1811-1832) to de nitively answer what specic equations 
of a given degree admit an algebraic solution. The posthumous publication of Galois’s 
manuscripts in Liouville’s Journal de Mathématiques in 1846 represented both the com- 
pletion of Abel’s research and the foundation of group theory, one of the most important 
branches of modern mathematics. Considering the signi cance of his discovery, one nat- 
urally asks why it required 14 years after Galois’s death for the essential elements of 
his work to become available in print. The reason is a combination of sheer bad luck 
and negligence. The original memoir was mislaid by the editor appointed to examine 
it, and after resubmission, it was returned by a second editor, who judged the contents 
incomprehensible. 

The sequence of events seems to be this. Galois rst submitted his results on the 
algebraic solution of equations to the Academy of Sciences in May 1829, while he 
was still only 17 years old. Augustin-Louis Cauchy (1789-1857), a member of the 
Academy and a professor at the Ecole Polytechnique, was appointed referee. Cauchy 
either forgot or lost the communication, as well as another presented a week later. Galois 
then (February 1830) submitted a new version of his investigations to the Academy, 
hoping to enter it in the competition for the Grand Prize in Mathematics, the pinnacle 
of mathematical honor. This time it was entrusted to the permanent secretary, Joseph 
Fourier (1768-1830), who died shortly thereafter, before examining the manuscript. It 
was never retrieved from among his papers. A further disappointment awaited Galois. 
In January 1831, he submitted his paper for the third time under the title “Une mémoire 
sur les conditions de résolubilité des equations par radicaux.” After a delay of some 
six months, during which Galois wrote to the president of the Academy asking what 
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had happened, it was rejected by the referee Simeon-Denis Poisson (1781-1840). At the 
conclusion of his report, Poisson remarked: 


His arguments are not suf ciently clear, nor developed enough for us to judge their 
correctness.... It is hoped that the author would publish his work in its entirety so that 
we can form a de nite opinion. 


In May 1832, Galois was provoked into a duel in unclear circumstances. (The theory has 
been advanced that the challenger was hired by the police, who arranged the confrontation 
to eliminate what they considered to be a dangerous radical.) On the eve of the duel, 
apparently certain of death, Galois wrote a letter to a friend describing the contents of 
the memoir Poisson had rejected. Its seven pages, hastily written, contain a summary of 
the discoveries he had been unable to develop. The letter ends with the plea: 


Eventually there will be, I hope, some people who will nd it pro table to decipher this 
mess. 


Galois spent the rest of the night annotating and making corrections to some of his 
papers; next to a theorem, he scrawled: 


There are a few things left to be completed in this proof. I do not have time. 


The duel took place on May 30, 1832, early in the morning. Galois was grievously 
wounded by a shot in the abdomen, and lay where he had fallen until found by a passing 
peasant, who took him to the hospital. He died the next morning of peritonitis, attended 
by his younger brother. Galois tried to console him, saying, “Do not cry. I need all 
my courage to die at twenty.” He was buried in a common ditch at the cemetery of 
Montparnasse; the exact location is unknown. 

By 1843, Galois’s manuscripts had found their way to Joseph Liouville (1809-1882), 
who after spending several months in the attempt to understand them, became convinced 
of their importance. He addressed the Academy of Sciences on July 4, 1843, opening 
with the words: 


I hope to interest the Academy in announcing that among the papers of Evariste Galois I 
have found a solution, as precise as it is profound, of this beautiful problem: whether or not 
it [the general equation of fth degree] is solvable by radicals. 


Liouville announced that he would publish Galois’s papers in the December 1843 issue 
in his recently founded periodical Journal de Mathématiques Pures et Appliquées. But 
for some reason, publication of the heavily edited version of the celebrated 1831 memoir 
did not occur until the October-November 1846 issue. 

Although no trace of Galois’s grave remains, his enduring monument lies in his 
ideas. During the late 1800s, Galois’s theory—as well as a new topic it brought to life, 
group theory—became an integral and accepted part of mathematics. Galois’s theory 
appears to have been taught in the German universities for the rst time by Richard 
Dedekind, who lectured on the topic at Géttingen in the winter of 1856-1857; it is said 
that only two students came to hear him. The rst full and clear presentation of the 
Galois theory was given by Camille Jordan in his book Traité des substitutions et des 
equations algébraiques (1870). 
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7.4 Problems 


1. Solve the following quartic equations by Ferrari’s 
method. 


(a) x*+3= 12x. 

(b) x++6x?7+8x+21=0. 
[Hint: The reduced form of the resolvent cubic 
is uw? — 24u + 32 = 0, with wu = 4 as a solution.] 

(c) x*++9x +4 = 4x?. [Hint: The resolvent cubic 
z) — 102” + 282 — 2 = 0 has z= 2 as a 
solution. ] 


2. Solve the quartic x+ + 4x3 + 8x7 + 7x+4=0. 
(Hint: First replace the given quartic by 
yt + 2y? — y +2 = 0. The resolvent cubic of this 
last equation is z3 + 5z? + 6z + 2 =0, with z = — 
as a solution.] 


3. Solve the quartic x* + 8x3 + 15x? = 8x + 16. [Hint: 
First replace the given quartic by 
yt — 9y? — 4y + 12 = 0. The reduced form of the 
resolvent cubic of this last equation is 
3_ 8 


w= Bu + +8, with uw =5 as a solution.] 


u 
2 


4. Use Ferrari’s method to show that the quartic 
equation 


x4 49 = 4x3 4+ 6x? + 12x 


has the four roots 3 + J6, 3- J/6, —1+ /—2, and 
—-l1—/-2. 


5. Find a solution to the following problem from the 
Ars Magna. 


Chapter 26, Problem 1. Four men form an 
organization. The rst deposits a given quantity of 
aurei; the second deposits the fourth power of 
one-tenth of the rst; the third, ve times the square 
of one-tenth the rst; and the fourth, 5. Let the sum 
of the rst and second equal the sum of the third and 
fourth. How much did each deposit? [Hint: If it is 
assumed that the rst deposited 10x, then the 
conditions imply that x* + 10x = 5x? + 5.] 


6. The following method of Viéta was a notable 
improvement in Ferrari’s technique for solving the 
quartic y* + py? + gy +r = 0. To both sides of the 
equation, add y?z? + 424, where z is a new 
unknown, so that 


(7? + 52°P = y's? + G24 —r —qy — py’ 


= y(2’ — p)—qyt+ (Gz -r). 
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The right-hand side is a perfect square if z is chosen 
to satisfy 


g? =e" — p)(z4 - 7) 
= 2° — pz — 4rz* + 4rp, 


which is a cubic in z* and therefore solvable. Use 
Viéta’s procedure to nd one root of the quartic 
equation 


yo-y+y-y=10. 


Bibliography 


Bidwell, James, and Lange, Bernard, “Girolamo Cardano: A 
Defense of His Character.” Mathematics Teacher 64 (1971): 
25-31. 


Boas, Marie. The Scientific Renaissance. New York: Harper, 1962. 


Cardano, Girolamo. The Great Art, or the Rules of Algebra. 
Translated by Richard Witmer. Cambridge, Mass.: M.I.T. 
Press, 1968. (Dover reprint, 1993). 


Clarke, Frances. “New Light on Robert Recorde.” Jsis 8 (1926): 
515-532. 

Crombie, A. C. Medieval and Early Modern Science. 2 vols. 
Cambridge, Mass.: Harvard University Press, 1963. 
Dales, Richard. The Scientific Achievement of the Middle Ages. 
Philadelphia: University of Pennsylvania Press, 1973. 
Easton, Joy. “The Early Editions of Robert Recorde’s Ground of 
Arts.” Isis 58 (1967): 515-532. 

Feldman, Richard, Jr. “The Cardano-Tartaglia Dispute.” 
Mathematics Teacher 54 (1961): 160-163. 

Grant, Edward, ed. A Source Book in Medieval Science. 
Cambridge, Mass.: Harvard University Press, 1974. 

Haskins, Charles. Studies in the History of Medieval Science. 
Cambridge, Mass.: Harvard University Press, 1924. 

Hay, Cynthia, ed. Mathematics from Manuscript to Print: 
1300-1600. Oxford: Clarendon Press, 1988. 

Kearney, Hugh. Science and Change 1500-1700. New York: 
World University Library, 1971. 

La Nave, Federica, and Mazur, Barry. “Reading Bombelli.” 
Mathematical Intelligencer 24, no. 1 (2002): 12-21. 

Lindberg, David. The Beginnings of Western Science: 600 B.C. to 
1450 A.D. Chicago: University of Chicago Press, 1992. 


Ore, Oystein. Nils Henrik Abel: Mathematician Extraordinary. 
Minneapolis, Minn.: University of Minnesota Press, 1957. 


. Cardano, the Gambling Scholar. Princeton, N.J.: 
Princeton University Press, 1953. (Dover reprint, 1965). 


Pesic, Peter. Abel’s Proof. Cambridge, Mass.: MIT Press, 
2003. 


Bibliography 


Randall, John. The School of Padua, and the Emergence of 
Modern Science. Padua: Editrice Antenore, 1961. 

Rigatelle, Laura. Evariste Galois (1811-1832). Translated by 
John Denton. Boston: Birkhauser, 1996. 

Rose, Paul. The Italian Renaissance of Mathematics. Geneva: 
Librairie Droz, 1975. 

Rothman, Tony. “Genius and Biographers: The Fictionalization of 
Evariste Galois.” American Mathematical Monthly 89 
(1982): 84-106. 

Sanford, Vera. “Robert Recorde’s Whetstone of Witte, 1557.” 
Mathematics Teacher 50 (1957): 258-266. 

Sarton, George. Six Wings: Men of Science in the Renaissance. 
Bloomington: Indiana University Press, 1957. 

Schrader, Dorothy. “The Arithmetic of Medieval Universities.” 
Mathematics Teacher 60 (1967): 264-275. 


335 


Smith, David E. “The First Printed Arithmetic (Treviso, 1478).” 
Isis 6 (1924): 311-313. 


Stubhaug, Arild. Nils Henrik Abel and His Times. Translated by 
R. Daly. New York: Springer-Verlag, 2000. 


Swetz, Frank. Capitalism and Arithmetic: The New Math of the 
15th Century. La Salle, Ill.: Open Court, 1987. 


Taton, René. “Evariste Galois and his Contemporaries.” Bulletin 
of the London Mathematical Society 15 (1983): 107-118. 


Thorndike, Lynn. Scientific Thought in the Fifteenth Century. 
New York: Columbia University Press, 1929. 


Wightman, W. P. D. Science and the Renaissance. Vol. 1. 
Edinburgh: Oliver and Boyd, 1962. 


Zinner, Ernst. Regiomontanus: His Life and Work. Translated by 
E. Brown. New York: North-Holland/Elsevier, 1990. 


This page intentionally left blank 


8.1 


CHAPTER § 
The Mechanical World: 


Descartes and Newton 


The discoveries of Newton have done more for England and for the race, than has been done by 
whole dynasties of British monarchs. 
THOMAS HILL 


The Renaissance, which by the sixteenth century 


The Dawn of Modern Mathematics was well under way in Italy, soon spread north 
and west, rst to Germany, then to France and 

The Seventeenth Century the Low Countries, and nally to England. By the 

Spread of Knowledge late 1600s, scienti c, technological, and economic 


leadership centered on the English Channel—in 
those countries that had been galvanized by the commerce arising from the great voyages 
of discovery. At the start, the revival was mainly literary, but gradually scholars began 
to pay less attention to what was written in ancient books and to place more reliance on 
their own observations. The age was characterized by an eagerness to experiment, and 
above all to determine how things happened. Seventeenth-century science may be said 
to have begun with the appearance of William Gilbert’s De Magnete in 1600, the rst 
treatise on physical science whose content was based entirely on experimentation; the 
culmination would have been Isaac Newton’s Opticks in 1704. 

In between the De Magnete and the Opticks came the contributions of Johannes 
Kepler, who was convinced that planetary bodies moved not in Aristotle’s “ideal circles” 
but in elliptical orbits, and he thereby formulated the laws of terrestrial motion (1619). 
Also there were the demonstrations by William Harvey (1628) of the circulatory route of 
the blood from the heart through arteries and veins by way of the lungs; the laying down 
of the principles of modern chemistry by Robert Boyle in his Sceptical Chymist (1661); 
and the publication of Robert Hooke’s Micrographia (1665), the earliest large-scale work 
on the microscopic observation of cellular structure. However, no brief summing can do 
justice to the achievements of a period that saw so many new discoveries and so many 
advances in scientic methods. 

Whereas the Renaissance marked a return to classical concepts, the seventeenth 
century set mathematics on entirely new foundations. So extensive and radical were 
the changes that historians have come to regard the half-century from 1637 to 1687 as 
the fountainhead of modern mathematics—the rst date alluding to the publication of 
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Descartes’s La Géométrie and the second to the date of publication of Newton’s Principia 
Mathematica. 

Renaissance mathematics had added little to the geometry of the ancient Greeks, 
but 1600 ushered in an unexpected revival in the subject. In 1637 the French mathe- 
matical community witnessed one of those strange coincidences, once thought rare but 
which the history of science has shown to be frequent. Two men, Pierre de Fermat and 
René Descartes, simultaneously wedded algebra to geometry, to produce a remarkable 
innovation, analytic geometry. About the time when Fermat and Descartes were laying 
the foundations of a coordinate geometry, two other equally original mathematicians, 
Pascal and Desargues, were rendering a similar service in the area of synthetic projective 
geometry. But it was not only on account of the far-reaching developments in geometry 
that the seventeenth century has become illustrious in the history of mathematics, for the 
activities of the mathematicians of the period stretched into many elds, new and old. 
Number mysticism gave way to number theory, in Fermat’s re ections on diophantine 
analysis. The mathematical theory of probability, a subject to which Cardan contributed 
in his book Liber de Ludo Aleae, took its rst full steps in an exchange of letters be- 
tween Pascal and Fermat concerning the calculation of probabilities. Leibniz’s attempt 
to reduce logical discussion to systematic form was the forerunner of modern symbolic 
logic; but it was so far in advance of its time that not until 200 years later was the 
idea realized through the work of the English mathematician George Boole. Hardly less 
important were the studies of Galileo, Descartes, Torricelli, and Newton, which were to 
turn mechanics into an exact science during the next two centuries. 

During the middle years of the Renaissance, trigonometry had become a systematic 
branch of mathematics in its own right in place of serving as handmaiden to astronomy. 
The aim of facilitating work with complicated trigonometric tables was responsible for 
one of the greatest computational improvements in arithmetic, the invention of logarithms, 
by John Napier (1550-1617). Napier worked at least 20 years on the theory, which 
he explained in his book Mirifii Logarithmorum Canonis Descriptio (A Description 
of an Admirable Table of Logarithms, 1614). Seldom has a new discovery won such 
universal acclaim and acceptance. With logarithms, the operations of multiplication and 
division can be reduced to addition and subtraction, thereby saving an immense amount 
of calculation, especially when large numbers are involved. Astronomy was notorious 
for the time-consuming computations it imposed; the French mathematician Pierre de 
Laplace was later to assert that the invention of logarithms “by shortening the labors, 
doubled the life of the astronomer.” 

Above all, for mathematics the seventeenth century was the century of the rise 
of calculus. Although we normally ascribe the invention of calculus to two brilliant 
contemporaries, Isaac Newton (1642-1727) and Gottfried Leibniz (1646-1716), great 
advances in mathematics are seldom the work of single individuals. Cavalieri, Torricelli, 
Barrow, Descartes, Fermat, and Wallis had all paved the way to the threshold but had 
hesitated when it came to crossing it. By the second half of the seventeenth century, the 
raw materials lay at hand out of which the calculus would emerge. All that remained was 
for a Leibniz or a Newton to fuse these ideas in a tremendous synthesis. Newton’s well- 
known statement to Hooke, “If I have seen farther than others, it is because I have stood 
on the shoulders of giants,” shows his appreciation of this cumulative and progressive 
growth of mathematics. 
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Galileo’s Telescopic Observations 


Probably no single gure from the 1600s is as well known as the mathematician- 
physicist-astronomer Galileo Galilei (1564-1642). His name is associated with events of 
profound signi cance: with the birth of modern science, with the Copernican revolution, 
with the dethronement of Aristotle as the supreme authority in the schools, and with the 
struggle against external restrictions on scientic inquiry. Galileo’s original intention was 
to enter the lucrative profession of medicine, and in 1581, he enrolled at the University of 
Pisa as a medical student. While a student at Pisa, Galileo is supposed to have made his 
rst independent discovery, the isochronism (equality of time) of the pendulum. Tradition 
has it that this came about through his observation that a chandelier in the cathedral, set 
in motion while being lit, performed all its swings at equal intervals of time although its 
successive swings gradually grew narrower in amplitude. 

Galileo’s formal introduction to mathematics came late. There is a story that Galileo, 
after listening at the door of a classroom in which Ostilio Ricci (a pupil of the famous 
Italian mathematician Tartaglia) was lecturing on the geometry of Euclid, became so 
fascinated with mathematics that he abandoned his medical plans. Indeed, he appears 
to have had little fondness for medicine and left the university in 1585 without a de- 
gree. Under the tutelage of Ricci, Galileo spent the next year pursuing the study of 
Euclid; he then went on to the other Greek geometers, winding up with the mechanical 
works of Archimedes. From 1585 to 1589, he earned money by giving private lessons 
in mathematics. Then, at the age of 25, Galileo succeeded in obtaining a lectureship at 
the University of Pisa. The appointment was only for three years, and the salary a mere 
pittance, but he gained academic standing. 

Galileo is alleged to have performed, during his stay at Pisa, a public demonstration 
at the Leaning Tower to show that bodies of the same material but different weights 
fall with equal speed. This was an open challenge to the prevailing Aristotelian physics, 
according to which “the downward movement of a mass of gold or lead, or any body 
endowed with weight, is quicker in proportion to its size;” that is, the heavier the body, the 
faster the fall. Aristotelians claimed that the simultaneous arrival of the two weights—if 
the demonstration was actually carried out—was the effect of sorcery and not a refutation 
of Aristotle. They managed to pack the young professor’s lectures and hiss at his every 
word. Because he had aroused antagonism in the faculty, Galileo had little hope of 
reappointment at Pisa at the end of his three-year contract. He left, in 1592, to become 
professor of mathematics at the famed University of Padua, a post he held for the next 
18 years. 

In 1609, Galileo heard rumors that Dutch spectacle-makers had invented a remarkable 
contrivance for making distant objects appear quite close. Surmising how such a device, 
called a “telescope,” might be constructed, he set to work to fashion one for himself. (It 
is noteworthy that the Dutch instrument was of a totally different type from what Galileo 
designed.) Although Galileo by no means invented the telescope, he seems to have been 
the rst to look at the sky systematically with one and to publish ndings. 

In a series of observations made in 1610, Galileo was able to distinguish the four 
satellites revolving about Jupiter—perhaps the most dramatic disproof of the Aristotelian 
view that the earth is at the center of all astronomical motions. Within a month he 
published this truly earth-shattering news in a 29-page booklet entitled the Sidereus 
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Galileo Galilei 
(1564-1642) 


(From A Short History of Astronomy by Arthur Berry, 
1961, Dover Publications, Inc., N.Y.) 


Nuncius (The Starry Messenger), “unfolding great and marvelous sights” such as the 
existence of unknown stars, the nature of the Milky Way, and the rugged surface of the 
moon. Such ideas were so disturbing that there were professors at Padua who refused 
to credit Galileo’s discoveries, refused even to look into his telescope for fear of seeing 
in it things that would discredit the infallibility of Aristotle and Ptolemy, and even the 
Church. His open publication of Copernican views made Galileo’s position as a teacher 
at Padua, a stronghold of Aristotelianism, untenable. Later in the year, he accepted an 
appointment as “First Mathematician” of the University of Pisa, and also the post of 
court mathematician to the Grand Duke of Tuscany. 

At the beginning of the sixteenth century, most people still believed in the ancient 
description of the universe. As conceived by Aristotle and elaborated by Ptolemy, this 
system placed the earth at the center; and then at increasing distances from it came nine 
crystalline and concentric spheres. The rst seven spheres carried the sun, the moon, and 
the ve known planets, and the xed stars were attached to the eighth one, often called 
the “rmament.” An elaborate theory of epicycles, deferents, equants, and eccentrics 
accounted for each planet’s motion within its own sphere. On the outside lay the ninth 
sphere, known as the “primum mobile” and representing the Prime Mover, or God; this 
was held to provide in some inexplicable fashion the motive power for all the others. 
Beyond this last sphere, there was nothing, no matter, no space, nothing at all. The 
Aristotelian universe was a_ nite one contained within the primum mobile. 

From the standpoint of Aristotle, the earth was the main body in the universe, and 
everything else existed for its sake and the sake of its inhabitants. In the new cosmology 
produced by Nicolaus Copernicus (1473-1543), the sun changed places with the earth; 
the sun became the unique central body and the earth merely one of several planets 
revolving about the stationary sun. The ancient theory, because it made the earth the 
center, is known as the “geocentric theory,” and the Copernican, because it treated the sun 
as central, is called “heliocentric.” (In Greek, the words for “earth” and “sun” are ge and 
helios.) From the theory’s inception, theologians—both Protestant and Catholic—viewed 
with extreme dislike a theory in which the earth became a comparatively insigni cant 
part of Creation. Had not God created the universe for man’s enjoyment and put the 
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earth at the center to prove this? Indeed the psalmist declared, in the ninety-third Psalm, 
“He hath made the round world so sure, that it cannot be moved.” Moving the earth was 
like displacing God’s throne. 

Underlying the issue whether the Copernican pattern of celestial motion was phys- 
ically correct was the matter of authority. Copernicanism was so incompatible with the 
traditional interpretation of various passages in the Bible that if it should prevail, it 
seemed that the Bible would lose authority, and Christianity would suffer. Besides, if 
freedom of judgment could be exercised to the extent of deciding between rival astro- 
nomical theories, it was but a short step to questioning authority itself. 

Over the next several years, Galileo found himself involved in disputes about the 
relation of his astronomical views to the Bible. A question frequently raised to confound 
the adherents of the heliocentric theory was how to explain the “Miracle of Joshua.” The 
tenth chapter of the Book of Joshua relates that God, at Joshua’s prayer, made the sun 
stand still and lengthened the day so that the Israelites could pursue their enemies; had 
the sun gone down, the victory would not have been total. In several widely circulated 
letters (1613), Galileo maintained that “this passage shows manifestly the impossibility 
of the Aristotelian and Ptolemaic world systems and on the other hand accords very 
well with the Copernican.” With a brilliant dialectic turn, Galileo pointed out that if one 
accepted the traditional cosmology of Aristotle, the account of Joshua stopping the sun 
could not be understood literally. For it was admitted that the originative source of the 
sun’s motion, as well as that of the planets and stars, was the primum mobile. Therefore, 
if the whole of the heavenly movement was not to be disarranged, Joshua must have 
stopped not the sun but the outermost celestial sphere. On the other hand, by accepting 
Copernican theory, one could take the story literally. For if it was assumed that the 
revolution of the planets was impressed on them by the sun, which is in the center of 
the universe, then by stopping the sun Joshua was able to stop the whole solar system 
without disordering the other parts. Quite simply, if one were going to interpret scriptural 
language in its strict meaning, it would be better in this case to be a Copernican. 

Galileo went on to state that the Holy Scriptures did not have as their aim the 
teaching of science, and that the words of the Bible were not to be taken literally. 
Where the sun was described as moving around the earth, this was a re ection of the 
incomplete knowledge of those times; certainly it was not meant as an endorsement of 
a given astronomical theory. He quoted “an ecclesiastic of the most eminent degree” 
who once said, “The intention of the Holy Ghost is to teach us not how the heavens go, 
but how to go to Heaven.” Farther on, he added that “before a physical proposition is 
condemned, it must be shown to be not rigorously demonstrated.” In implying that it was 
the Church that ought to give scientic proof if Galileo were to be faulted, he provided 
exactly the opportunity his enemies wanted. They proclaimed everywhere that Galileo 
had assailed the authority of the Scriptures as a privileged source of knowledge and had 
tried, as an outsider, to meddle in religious matters. Mathematics was denounced from 
the pulpit as a devilish art and all mathematicians as enemies of the true religion. 

Toward the beginning of 1616, the pope submitted the following two propositions 
to the Holy Of ce for examination: (1) “The sun is the center of the world and entirely 
motionless as regards spatial motion” and (2) “The earth is not the center of the world 
and is not motionless, but moves with regard to itself and in daily motion.” After a day’s 
deliberation, a special commission of theologians ruled that the rst of these was “foolish 
and absurd, philosophically and formally heretical, inasmuch as it expressly contradicts 


342 


Chapter 8 The Mechanical World: Descartes and Newton 


the doctrine of the Holy Scripture in many passages.” As for the second proposition, it 
could be “equally censured philosophically and was at least erroneous in faith.” 

As an avowed protagonist of a moving earth, Galileo also had to be disciplined. He 
was summoned to the palace of Cardinal Bellarmine and in the presence of witnesses, 
admonished to “abstain altogether from teaching or defending this opinion and doctrine, 
and even from discussing it.” If he did not acquiesce, he was to be imprisoned. Galileo 
declared that he submitted. Immediately afterward, the Holy Of ce proceeded against the 
writings of Copernicus. The work that had caused the upheaval in astronomical thought, 
the De Revolutionibus Orbium Coelestium (On the Revolution of the Heavenly Spheres, 
1543), and all other texts that af rmed the earth’s motion were put on the Index of 
Prohibited Books “pending correction” of various passages. 

Galileo was more or less silenced until 1623, when a new pope, a longtime admirer 
of his, was elected and took the name Urban VUI. After several friendly audiences, Urban 
granted Galileo permission to write about Copernicus’s theory provided that he represent 
it not as reality but as a convenient scientic hypothesis, and provided that arguments for 
the Ptolemaic view were given equal and impartial discussion. Galileo began work on a 
book that he believed would comply with these instructions. The writing went slowly, 
as Galileo was in ill health, but by 1630 he had nally completed the manuscript of the 
astronomical treatise that was to lead to his celebrated trial: Dialogo Sopra Due Massimi 
Sistemi del Mondo (Dialogue Concerning the Two Chief World Systems). After many 
arguments and delays in getting the necessary license for printing it, Galileo had the 
book approved by the Church authorities and it was published in Florence in 1632. 

To reach the widest possible audience, Galileo wrote the Dialogue not in Latin, the 
academic language of the universities, but in Italian. It is addressed “To the Discerning 
Reader,” whom he wished to win over to his cause. The immediate response from the 
public was enormous, with the work sold out as it came off the presses. One reason 
is that the Dialogue was the most readable of the three great masterpieces of contem- 
porary astronomical literature, of which the other two were the De Revolutionibus of 
Copernicus and the Principia of Newton. It was not a severely technical treatise like 
the others—mathematics would have been out of place in it—but a piece of brilliant 
polemic, directed at the clerical establishment. As far as the text itself was concerned, 
the Dialogue consisted of a lively conversation extending over four successive days in 
which three people discussed the arguments for and against Copernicanism, though com- 
ing to no denite conclusion. Of the three speakers, Salviati, the Copernican scholar, 
represented Galileo; Simplicio, the archetype of the bumbling Aristotelian philosopher, 
stood for authority; and Sagredo, the intelligent and cultivated layman, acted as modera- 
tor. The form of a dialogue was chosen partly for literary reasons, but still more because 
it would enable Galileo to claim that certain views expressed were not really his own 
but those of an imaginary character. Needless to say, the Aristotelian cause came out a 
miserable second best, as Simplicio was made to look foolish and forced to withdraw 
from the conversation. 

With the publication of the Dialogue, Galileo’s enemies in science as well as in 
the Church redoubled their denunciation of him. An ecclesiastical commission that 
examined the work reported that he had transgressed orders by treating Copernicanism 
not as hypothesis but as fact. Moreover, their search of the Inquisition’s records for 1616 
disclosed a notary’s unsigned statement to the effect that Galileo had been personally 
ordered not to teach Copernicanism in any manner, orally or in writing. The pope, who 
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Nicolaus Copernicus 


(1473-1543) 


(From A Short History of Astronomy by Arthur Berry, 
1961, Dover Publications, Inc., N.Y.) 


had known nothing of this injunction, felt that he had been tricked into the granting of 
permission for Galileo to write about the topic. Worst of all, Urban became convinced 
that Galileo had held him up to ridicule in the character of Simplicio, the clumsy 
defender of the geocentric theory. Urban could not have failed to recognize his own 
arguments put into Simplicio’s mouth almost verbatim. (After hearing one of Simplicio’s 
objections to the new theory, Salviati commented sarcastically, “An admirable and truly 
angelic argument.”) His vanity bitterly wounded, the pope came to support the Jesuit 
position that the teachings expounded in the Dialogue were potentially more dangerous 
to Christianity than all the heresies of Luther and Calvin. 

Although 70 years old and seriously ill, the author of the Dialogue was summoned to 
Rome to stand trial before a tribunal of the dreaded Inquisition. The line of questioning 
was designed to elicit from Galileo an admission that he had broken his promise to 
obey the injunction issued in 1616. His defense was that far from violating the decree 
by advocating Copernicanism, he had written the Dialogue to show that the arguments 
for it were “invalid and inconclusive.” At a second examination, Galileo declared that 
he had just reread his book for the rst time in three years and did admit that many 
sections presented the case for Copernicanism in an extremely favorable light. He was 
never actually put to torture during his stay in prison, but the ever-present threat of 
such questioning induced Galileo to confess his “errors” and to sign a recognition of the 
Church’s authority in astronomical matters. (There is a persistent legend to the effect that 
Galileo, on making his public abjuration, muttered to himself the words, “Nevertheless 
it does move.”) The full text of the recantation was read from every pulpit, and to all 
university students by their professors. As an additional safeguard against Copernican 
ideas, the Dialogue was placed on the Index of Prohibited Books, where it remained until 
1822. In 1744, for the rst time, the Church permitted it to be printed, and then only 
with the sentence on Galileo and his recantation included in the same volume. Galileo’s 
trial illustrates one of the grave problems that the early seventeenth-century scientists 


344 


Chapter 8 The Mechanical World: Descartes and Newton 


Frontpiece of Galileo’s Dialogo (1632): Aristotle, Ptolemy, and Copernicus (from 


left to right) debate the structure of the universe. (By courtesy Editions Culture et 
Civilisation.) 


had to confront. They were allowed to pursue their experiments provided that they kept 
the results to themselves. Because the Dialogue was written in a popular style accessible 
to a wide public, Galileo’s prosecutors charged him not so much for holding his own 
views as for openly proclaiming them. 

Apparently broken in spirit, Galileo was sentenced to permanent house arrest, under 
surveillance of of cers of the Inquisition. His devotion to scientic study remained un- 
abated, however. Before his astronomical discoveries with the newly devised telescope 
had made him world-famous, he had been about to publish a great work on mechanics. 
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Galileo turned again to this project of a quarter-century earlier, and though nearly blind, 
went on to write his most important work, Discorsi e Dimostrzioni Matematiche Intorno a 
Due Nuove Scienze (Discourses and Mathematical Demonstrations Concerning Two New 
Sciences), a treatise analyzing projectile motion and gravitational acceleration. From this 
book the science of dynamics can be said to have taken its origin. Because the Inquisi- 
tion would allow no work of Galileo’s to be published in Italy, the manuscript had to be 
smuggled out of the country, and it was printed in Holland in 1638. 

As time passed, the Church gradually softened its assessment of the Copernican 
heliocentric system as “abjured, cursed and detested.” The question of Galileo’s heresy, 
however, remained open until 1992. After three centuries, Pope John Paul II formally 
acknowledged that Galileo had been wrongfully condemned, and that his judges had 
erred in insisting upon the literal reading of Holy Scripture as the way to understand the 
physical world. 

One is tempted to say of Galileo what had once been said of Plato: he was the maker 
of mathematicians rather than the author of mathematical treatises. Although Galileo had 
exceptional mathematical ability, it showed itself episodically and not in any magnum 
opus on some branch of the subject. (His pupils Cavalieri and Torricelli went on to achieve 
a degree of fame.) His great contribution was the revolutionary idea that mathematics 
was the vehicle of scientic explanation, or as he himself put it, “Nature is written in 
mathematical language.” Thus, the emphasis was changed from the Pythagorean view 
that Nature was mathematical to the view that Nature lent itself to mathematization. The 
underlying principle of this “new Pythagoreanism” was that the universe behaves in a 
logical way that could be understood through a combination of direct observation and 
mathematical reasoning. 


The Beginning of Modern Notation: Francois Viéta 


It is dif cult to say precisely why there should have been such spectacular math- 
ematical achievements at this particular time, but the improvement in the means of 
mathematical expression was a necessity for the transition from ancient to modern con- 
ceptions. New results have often become possible only because of a different mode of 
writing. Certainly, by 1600, new mathematical notations were blossoming in Europe like 
owers in the springtime. The signs + and — rst appeared in print in Johann Wid- 
mann’s Mercantile Arithmetic (1489), where they refer, not to addition or subtraction or 
to positive or negative numbers, but to surpluses and de cits in business problems. The 
rst person to use them in writing an algebraic expression was the Dutch mathematician 
Vander Hoecke (1514). In the earliest English treatise on algebra, published under the 
alluring title The Whetstone of Witte (1557), Robert Recorde introduced the symbol =, 
but with longer lines, to denote equality. Recorde selected the sign because, so he says, 
“Noe 2 thynges can be moare equalle,” than two parallel straight lines. However, this 
notation was not immediately popular. Xylander (1575) preferred two upright parallel 
lines, and the use of the sign « for equality (possibly a corruption of the rst two letters 
of the word aequalis) continued well into the 1700s. Symbols for multiplication devel- 
oped much more slowly than the symbols for addition and subtraction. Thomas Harriot, 
in his Artis Analyticae Praxis (Practice of the Analytic Art, 1631), denoted multiplication 
by a dot, but the Clavis Mathematicae (Key to Mathematics, 1631) of another English 
algebraist, William Oughtred, used the cross sign x. Harriot was also responsible for the 
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symbols > and < for “greater” and “less,” which were an improvement over the symbols 
Land _] invented simultaneously by Oughtred for the same purpose. (The lack of sym- 
metry in Oughtred’s notation made it dif cult to remember.) In Rahn’s Teutsche Algebra, 
published in 1659, the symbol ~ for division was encountered in print for the rst time. 
The square root sign J — is traceable to Christoff Rudolff’s Die Coss (1525), where it 
had only two strokes. It is frequently said that Rudolff chose V because it resembled 
a small r, the initial letter of the word radix. Typesetters were inclined to improvise on 
these notations, turning the root J into the position for “greater than” and into the 
position \ for “less than.” 

In the 1500s, dominance in algebra passed from Italy to other continental countries, 
particularly Germany and France. Michael Stifel was Germany’s most notable writer on 
the subject. Originally an Augustinian monk, he was converted by Martin Luther and 
thereafter spent many years as an itinerant preacher. Stifel had a habit of dabbling in 
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number-mysticism, as when he determined that 666, the number of the malevolent Beast 
in the book of Revelation, represented the reigning Pope Leo X. A study of certain biblical 
passages also led him to announce the exact time that the world would come to an end: 
8 o’clock on October 3, 1533. Members of his faithful congregation abandoned their 
properties and spent their savings on frenzied revelries in anticipation of the foretold 
day. When the prophecy failed to materialize, the furious peasants had Stifel arrested 
and thrown into jail. Only through Luther’s personal intervention was he subsequently 
released. 

Putting the number nonsense behind him, Stifel turned his attention to legitimate 
mathematical studies. He secured a master’s degree from the University of Wittenberg 
and later lectured both at K6nigsberg and at Jena. His principal work, the practical 
textbook Arithmetica integra, appeared in 1544 when he was nearly 60 years old. The 
German-language Deutsche Arithmetica was published the next year, followed in 1553 
by his extensive commentary on Christoff Rudolff's Die Coss. Stifel’s improvement in 
algebraic notation was to designate integral powers of an unknown quantity by repeating 
the same letter the requisite number of times; his [AAA would mean A? to us. He allowed 
negative coef cients in equations, but refused to acknowledge the legitimacy of negative 
roots. Stifel continued the common practice of expressing equality in words: for example, 
in the Arithmetica integra he writes 


$5B — 1BB aequatur 1156. 


In France, Francois Vieta (1540-1603), councilor to the court of Henry IV and the 
leading mathematician of his day, took a decisive step in perfecting algebraic symbolism. 
Since the time of Euclid, letters had been used to represent the quantities that entered into 
an equation, but there had been no way of distinguishing quantities assumed to be known 
from those unknown quantities that were to be found. Viéta suggested using letters of the 
alphabet (capitals) as symbols for quantities, both known and unknown—vowels were to 
designate unknown quantities, what we now call variables, and consonants to represent 
numbers assumed to be given. Simple though it was, this convention had enormous 
consequences in liberating algebra from having to deal with particular examples involving 
specic numerical coef cients. Before the introduction of Viéta’s literal notation (that 
is, a notation in which letters stand in place of numbers), attention had focused on 
specic equations only. Each equation, such as 3x + 2=0 or 6x*7+5x +1=0, had 
an individuality all its own and had to be handled on its own merits. Literal notation 
made it possible to build up a general theory of equations—to study not an equation like 
6x? + 5x +1 =0 but the quadratic equation ax? + bx +c = 0. Previously the idea of 
an equation in which the coef cients might be of either sign, each independent of the 
sign preceding it, was wholly lacking. 

The vowel-consonant notation of Vieta had a short existence; for within a half- 
century of Victa’s death, Descartes’s Géométrie appeared. In this work, letters at the 
beginning of the alphabet were used for given quantities, and those near the end (es- 
pecially x) for the unknown. This rule was rapidly assimilated into seventeenth-century 
practice and has survived to modern times. 

Vieta retained the last vestige of verbal algebra by writing A quadratus, A cubus, and 
so on, for the different powers of a quantity A. The immensely convenient idea of using 
exponents to indicate the powers to which a quantity is raised was another contribution 
of Descartes and occurred for the rst time in his Discours de la Méthode, published 
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in 1637. The successive powers of x were denoted, much as is still done, by x, xx, 
x3, x*,.... For some strange reason, however, Descartes invariably wrote the expression 
xx in place of x”. This use of a repeated letter for the second power continued for many 
years, certain writers preferring xx on the unmathematical grounds that it occupied no 
more space than x? did. 

The chief difference between Diophantus’s abbreviated symbolism and Descartes’s 
notation is that Diophantus used no sign for addition, denoting the operation instead by 
simple juxtaposition. Furthermore, there was no clearly displayed relationship between 
AY and the symbol ¢ of which it is the square, as there is with x7 and x. Whereas 
Diophantus would have written 


K°4 tA‘ 61¢2M3, 
this became, according to Descartes’s symbolism, 
4x3 — 6xx x 2x +3 


(both correspond to our 4x? — 6x? = 2x +3). A century earlier, the Italian algebraist 
Bombelli might have indicated the same equation by 


4Cm. 6Q aeqtur 2R p. 3. 


Here, a syncopated algebra is met in which the letters p and m are used as abbreviations 
for addition (piu) and subtraction (meno), while each power is represented by its own 
symbol, the unknown denoted by R, its square by Q, and its cube by C. Bombelli had 
another form of expression, namely 


A 6 aeqtur 2 p. 3, 


where the power of the unknown quantity is represented as a numeral above a short 
circular arc. On the other hand, Cardan, who did not accept negative coef cients, would 
probably have written the equation in question as 


4 cubus aequantur 6 quadratus & 2 res & 3 


(modern form: 4x7 = 6x? + 2x + 3). For “unknown,” Cardan had the Latin term res. 
literally “thing”; the ligature &, or the word ef, stands for addition. 

Thus, we have seen algebra pass through three stages: the rhetorical, in which all 
statements and equations were written out in ordinary language; the syncopated, in which 
familiar terms were abbreviated; and the symbolic, in which every part of an expression 
was characterized by an ad hoc symbol. 


The Decimal Fractions of Simon Stevin 


Decimal fractions were the most important innovation in arithmetic since the gen- 
eral introduction of the Hindu-Arabic numeration system. Such fractions, along with a 
more or less convenient notation for them, were inevitable; it is only surprising that this 
“admirable invention” took so long to appear. Although the new numerals had gained 
ascendency in Western Europe by 1500, sexagesimal notation continued to be employed 
for representing fractions. Decimal and sexagesimal bases were actually used in combi- 
nation in individual numbers, with decimal notation being used for a number’s integral 
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part and sexagesimal notation for its fractional part. This dual usage is illustrated in the 
calculation of tables of square, cube, and higher roots. John of Meurs (circa 1343), for 
example, extracted /2 by the method 


1 1 414 
/2= 00 ¥ 2000000 * 1414 =1+ 7, 
which he has clearly done using a 10-based computation. He expressed the result, how- 
ever, in sexagesimal terms as 1°24’50"24’”. If it now seems odd to use mixed notations 
in this way, recall that trigonometric tables still appear in this format. 

Although the idea was maturing in the work of various mathematicians—particularly 
the Frenchman Francois Viéta—the rst person to give a systematic exposition of the rules 
of operation of decimal fractions was Simon Stevin (1548-1620), a native of Bruges. In 
his younger days Stevin worked as a bookkeeper in Antwerp, leaving the Low Countries 
during the period of unrest preceding the Dutch revolt against Spanish rule. After traveling 
in Prussia, Poland, and Norway for 10 years, he nally established himself in the northern 
part of the Netherlands, an area which by then had shaken off the domination of Spain. In 
1583 Stevin entered the newly founded University of Leiden. He was 35 years of age by 
then—at that time, rather late in life to become a student—but he did so well that within 
a few years he was teaching mathematics at the university. On the recommendation of 
a former pupil, Prince Maurice of Nassau, Stevin was appointed quartermaster general 
for the Dutch armies and inspector of dikes and canals; he held these powerful positions 
until the time of his death. 

Eager to explain his views, Stevin wrote on a wide variety of topics. His earliest 
publication, Table of Interest Rates (1582), gave the common reader rules for computing 
simple and compound interest. It also offered tables, previously kept secret by bankers, 
for speedily guring discounts and annuities. The Art of Fortifiation (1586) was put 
to practical use by military engineers during the wars of the next two centuries. In 
mathematics, Stevin brought out his Arithmetic (1585), elaborating on the best from 
older arithmetic and algebraic writings; he also prepared a French version of the rst 
four books of Diophantus’s Arithmetica; the latter work, the rst translation of Diophantus 
into any European vernacular, was probably based on the Latin text of Xylander. Stevin’s 
fame in the history of science rests mainly on the Principles of Hydrostatics of 1586, the 
rst substantial advance in the subject beyond the work of Archimedes. 

Calculation with decimal fractions was the theme of Stevin’s popular pamphlet 
De Thiende (The Tenth), published in Flemish in 1585; a French translation under the 
title La Disme was printed in the same year. The English rendition in 1608 introduced 
the word “decimal” into our language. The 29-page booklet carried the subtitle “Teach- 
ing how all computations that are met in business may be performed by integers alone 
without the aid of fractions.” That is, as soon as fractions are put into decimal form they 
may be treated as integers; the operations of ordinary arithmetic are done in the same 
manner as for integers, with the decimal point merely having to be placed correctly in 
the nal result. Stevin did not think of using a single sign to separate the two parts of a 
number, as we would use a decimal point. Instead, he suggested a cumbersome notation 
using encircled numbers to indicate powers of 10. Each succeeding digit in the fractional 
part of a number was to be followed by an encircled number noting the corresponding 
power of 10 in the denominator; the integral part of the number would be followed by an 
encircled zero. For instance, the number that we would represent by 34.567 was written 
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by Stevin as either 


© © @ ® 
34@5@6Q7@B@ o 34 #5 6+ 7, 


as the circumstances required. In multiplying, say, 0.000378 by 0.54 he arranged the 
computation as 


The La Disme gained rapid and wide circulation, but Stevin’s clumsy circle 
notation—taken from Bombelli’s Algebra—was short-lived. The rst writer to use a 
period as a separator between the integral and fractional parts of a number seems to have 
been the Vatican astronomer Christoph Clavius; this device appears in his Astrolabium 
of 1593. The main introducer of the decimal point into computational practice was the 
Scottish nobleman John Napier. Napier’s new mathematical instrument, common loga- 
rithms, became the natural vehicle for the decimal idea. In the preface to his in uential 
treatise on logarithms Napier stated, “In numbers distinguished by a period in their midst, 
whatever is written after the period is a fraction.” Within 50 years of Stevin’s little tract, 
decimal numeration was widely adopted throughout Europe. 

The collected mathematical works of Stevin were translated from Flemish into French 
and augmented with many notes by Albert Girard (1595-1632). Girard was born in the 
Duchy of Lorraine, but being a Protestant he settled in the northern Netherlands where 
those of all faiths were accepted. There he worked as a translator and a military engineer 
to support his family. Girard’s own great work, Linvention Nouvelle en lAlg* ebre, was 
published in Amsterdam in 1629. In the theory of equations he followed Stevin’s notation 
in designating roots of polynomials by fractional exponents, but he occasionally used 
modern symbolism such as ¥/ and ¥/. At rst, Girard intentionally neglected imaginary 
numbers as roots of equations, calling them “impossible solutions.” Later, however, he 
extended his notion of number to include the complex numbers. He is said to be the rst 
to describe negative solutions geometrically, as line segments oriented in the direction 
opposite to the segments corresponding to positive numbers—thus anticipating the notion 
of a number line. 


Napier’s Invention of Logarithms 


The practitioners of mathematics in the sixteenth and seventeenth centuries came 
from and followed all walks of life. John Napier (1550-1617) was the eighth Baron of 
Merchiston in Scotland. He studied at home until he was 13 years old, the normal age 
for entering St. Andrews, the oldest of the Scottish universities. Records fail to show that 
he completed a degree at St. Andrews; most likely Napier chose to study abroad, as was 
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the custom among young noblemen. We know for certain that by 1571 he was back in 
Scotland, soon to be married and take up the life of a country gentleman. Following the 
death of his father in 1603, Napier moved to Merchiston Castle, near Edinburgh, where 
he spent the remainder of his life. 

The period toward the end of the sixteenth century was one of continual religious 
strife in Scotland, engendered by the establishment of Calvinist Protestantism as the 
religion of the state and the subsequent fear of invasion by the Invincible Armada of 
Catholic Spain. A vocal defender of the Protestant faith, Napier published (1594) a 
commentary on the book of Revelation, A Plaine Discovery on the Whole Revelation of 
Saint John. In it he bitterly attacked the Catholic Church, arguing that the Pope in Rome 
was the anti-Christ. The commentary also contained the “discovery” that the Creator 
proposed to end the world in the years between 1688 and 1700. The polemical tract 
was widely read, passing through 21 editions in various languages. At least 10 editions 
were printed during the author’s lifetime. Napier considered this work to be his great 
contribution to humanity, and certainly it made him far more famous in his day than did 
his mathematics. 

Aside from his popular theological writing, Napier earned considerable reputation as 
an inventor. One of his more practical conceptions was a device with a hydraulic screw 
and revolving axle for pumping water out of coal mines. Much like Archimedes, Napier 
drew up plans of various engines of warfare for the defense of his country and faith. 
Among these destructive weapons were two types of burning mirrors that would destroy 
an enemy ship “at whatever appointed distance,” an artillery piece that could “clear a 
eld of four miles’ circumference of all living creatures exceeding a foot in height,” 
an armored chariot with “a living mouth of mettle” that could “scatter destruction on 
all sides,” and “devices for sayling under water.” It is not known whether any of these 
war machines advanced further than their designs, but Napier’s vision of the future was 
ultimately realized in the form of the machine gun, tank, and submarine. 

It is not surprising that the various legends associated with Napier emphasized his 
ingenuity and imagination. In one story, he is said to have used a jet-black rooster to 
discover which of his servants was stealing from him. Napier told each of the servants 
to go, one by one, into a darkened room with instructions to stroke the bird’s back. He 
assured them that the rooster would identify the guilty person by crowing. What the 
servants did not know was that Napier had liberally dusted the rooster with soot. The 
thief, unwilling to touch the bird, was the only one to return with his hands still clean. 

Napier’s writings in mathematics were concerned with the practicalities of compu- 
tation. The small book entitled Rabdologiae (from the Greek words rabdos and logia, 
meaning “rod” and “ collection”), written in Latin and published in the year of his death, 
introduces a form of rod by the use of which two numbers can be multiplied in a me- 
chanical way. Napier’s invention is often referred to as Napier’s “bones” because of 
the title of a 1667 English edition of the work, The Art of Numbring by Speaking-Rods: 
Vulgarly termed Nepeirs Bones (sic). The translator rendered the last part of the original 
title inaccurately, taking it to mean speech (logos). 

Napier’s rods consisted of 10 rectangular blocks of wood or bone designed to form 
a sort of multiplication table. The face of each rod was divided into nine squares. A 
particular digit was engraved in the top square, while the lower squares contained mul- 
tiples of it from 2 to 9. When the product involved two integers, they are separated by a 
diagonal, with the tens digit above and to the left. The end rod, which served as a row 
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marker, had the Roman numerals I to IX written on it. Suppose, for instance, that one 
wished to multiply 458 by 36. The rods headed by 4, 5, and 8 were placed side by side, 
along with the marker rod. Looking across the line indicated by III, the third multiples 
are noted; and the tens digit in each square is carried diagonally downward and added 
to the units in the next square on the left. Corresponding to the multiplier 3, one should 
have 4, 2+5, 1+ 2, and 1. This gives the digits 4, 7, 3, and | of the product 458 - 3 in 
reverse order; the product itself is 1374. Multiplication by 6 is performed in exactly the 
same way to get 2748. 


Il 3-458 = 1374 


VI 6-458 = 2748 


The partial products are now added together—taking into account the place value of 
each—to arrive at 


1 6 4 8 8 


By allowing partial products to be obtained rapidly, Napier’s numbering rods essentially 
reduced the process of multiplication to one of addition. The device, now a historical 
curiosity, was highly esteemed in its time. 

The new movement to facilitate numerical calculation culminated in Napier’s in- 
vention of logarithms. The term “logarithm” meaning “reckoning number” was coined 
by Napier. It has been said that his creation came on the world “as a bolt from the 
blue,” unforeshadowed by the work of others or any previous line of thought. Napier 
spent 20 years of persistent effort in compiling his logarithmic tables, or “canons,” as 
he called them. As early as 1594, the astronomer Tycho Brahe heard from a visiting 
Scot the rst news of a great simpli cation in the art of computation. This monumental 
endeavor was nally revealed in 1614 in a small Latin volume of 147 pages—90 of 
them lled with tables—which bore the title Mirifii Logarithmorum Canonis Descriptio . 
No time was lost translating it (1616) into English as A Description of the Admirable 
Table of Logarithms. Curiously, the word “logarithm” appears on the title page, but the 
phrase “arti cial number” is used through the body of the text itself. The Descriptio was 
followed three years later by a work that had probably been written before it. Mirifii 
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Logarithmorum Canonis Constructio was published posthumously and gave an account 
of the method by which the tables were constructed. 

Napier’s purpose in devising logarithms was to provide assistance with the tedious 
effort needed for calculations involving very large numbers. He was familiar with Michael 
Stifel’s Arithmetica Integra of 1544, in which the German algebraist set down side-by- 
side successive powers of 2 and the corresponding exponents, 


0123 4 5 6 7 8 
12 4 8 16 32 64 128 256 


and pointed out that the sum of two terms in the (upper) arithmetic progression had 
a connection with the corresponding product of two terms in the (lower) geometric 
progression. Perhaps this gave Napier the germ of the idea of developing a procedure 
that would substitute the operations of addition and subtraction for those of multiplication 
and division. This is what the invention of logarithms achieved. 

The “wonderful canon” of Napier was designed for trigonometric applications, giving 
the logarithms of sines of angles from 30° to 90° at intervals of one minute. In his day and 
for some time thereafter, it was customary to regard the sine of an a angle @ not as a ratio 
but as the length of the half-chord subtending a central angle 2a in a circle with suitably 
large radius R; in symbols, sina = 4 chord 2a. Napier decided that by taking R = 10’, 
he would achieve an accuracy of seven signicant gures before introducing fractions. 
In particular, sin90° = 10’, which he calls the “whole sine” being the maximum value 
the sine could attain. 

In uenced no doubt by the remarks of Stifel, Napier started by pairing the terms of 
a geometric progression with those of an arithmetic one. For the successive terms of a 
geometric progression to be close together, their ratio must be almost 1. Napier chose 
to use 0.9999999, or in modern exponential notation 1 — 10-7, as the common ratio. 
Then to avoid the nuisance of decimals he multiplied this by 10’. The next step was to 
calculate the series of values 


wt =10""7; n=0,1,2,..., 100. 


After rst using the phrase “arti cial number,” Napier was later to call the exponent n 
the logarithm of the number 107(1 — 10~’)". His choice of terminology appears to be 
this: 107(1 — 10~’)" is derived from 10’ by n successive multiplications by the ratio 
1 — 10~’. Hence n, which is the logarithm, might be called either “the number of ratios” 
or “the reckoning number” depending on one’s translation of Jogos, a Greek word with 
several related English meanings. 

If N = 10/(1 — 107)”, let us write Nap.log N =n for the Naperian logarithm of 
N. Then Nap.log 107 = 0; that is, Nap.log sin 90° = 0 or as Napier says, “nothing is the 
logarithm of the radius.” Also, 


Nap. log 107(1 — 1077) = Nap. log 9999999 = 1, 
Nap. log 107(1 — 10-7)? = Nap. log 9999998.0000001 = 2, 
and so on. 


There are several points of interest. First, the values of Nap.log N increase as 
the numbers N (sine values) decrease, contrary to the behavior of modern logarithms. 
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Because of the choice of 10’ rather than 1 as the number whose logarithm is zero, the 
familiar rules for logarithmic computation do not apply in Napier’s system. In partic- 
ular, Nap.log MN is not equal to Nap.log M + Nap. log N. In fact, the relations M = 
107(1 — 10-7)", N = 107(1 — 10-7)", and 1 = 107(1 — 10~7)* imply that 


MN = 107(1 — 107-7y"*"107 
= 107(1 = 10-7y"*"(1 = 10-7)-* = 107(1 = 1g ye 


resulting in 
Nap. log MN =m-+n-—k = Nap. log M + Nap. log N — Nap. log 1. 


But to his great credit, Napier had converted the problem of multiplying large 
numbers to one of adding their logarithms (taking into account that Nap.log 1 = 
1611809). 

Because he did not use exponents to assist him—their notation was still something 
of a novelty—Napier calculated his geometric progression in an interesting chain of 
reasoning. This can be illustrated in modern symbolism by setting N, = 107(1 — 1077)’. 
It is an easy step to show that 


Nn = Nr-1C _ 107’) = Nn-1 — igs Ae 


Now 107-7N,_1 merely involves a shift of the decimal point, so that the value of N,, is 
obtained with little labor from that of N,,-;. Napier simply subtracted from each newly 
obtained term its 10~’th part to produce the next term, as follows: 


10000000.0000000 
—1.0000000 
9999999.00000000 
—0.9999999 
9999998.0000001 
—0.9999998 
9999997 .0000003 


He continued these successive subtractions until the hundredth term in his geometric 
progression was reached, which was Nj099 = 9999900.0004950; thus 100 is the logarithm 
of this last-written number. 

With the logarithms of the principal numbers in hand, it was still much more dif cult 
to obtain those of all the other intermediate numbers. This is not the place to discuss 
the details, other than to say that Napier used a sophisticated and ingenious interpolation 
scheme. : 

There is one more point worth noticing, namely, that the numerical value (1 — 10-7)"° 
is nearly equal to 1/e = lim,_,..(1 — n~')". Thus Napier’s logarithms are essentially a 
system of logarithms to the base 1/e, although he never thought of them in this way. 

Napier’s Descriptio was greeted with great enthusiasm, particularly by Henry Briggs 
(1561-1631) who wrote, “I never saw a book that pleased me better, or made me wonder 
more.” Briggs was then a professor of geometry at the newly founded Gresham College in 
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London; later, in 1620, he was to become the rst to hold the Savilian Chair in astronomy 
endowed at Oxford by Sir Henry Savile. So new and original did logarithms seem to 
Briggs that he hastened to Edinburgh for a month’s visit at Merchiston Castle. The story 
is told that at the very moment when Napier complained to a common friend, “Mr. Briggs 
will not come,” a knock on the gate announced his arrival. The two mathematicians gazed 
at one another rapt and speechless for a full quarter of an hour. During their ensuing 
discussions, Briggs suggested that the new system would be much more convenient 
if the logarithm of 1 were 0 and that of 10 were 1—improvements that Napier had 
already considered. This would make the logarithms of all numbers larger than one 
positive. Because Napier’s health would not permit it, Briggs embarked on the tedious 
task of preparing the rst set of common, or Briggsian, logarithms using 10 as a base. 
His Arithmetica Logarithmica, printed in 1624, contains 14-place logarithms of the rst 
20,000 numbers and those from 90,000 to 100,000. The logarithms of the intervening 
numbers, from 20,000 to 90,000, were calculated to 10 decimal places by the Dutch 
bookseller and publisher Adrian Vlacq (1600-1666). These appeared in 1628, along with 
Brigg’s results, in a work which the most industrious Vlacq modestly described as the 
second edition of the Arithmetica Logarithmica. 

For those with an interest in disputes regarding priority of invention, it should be 
mentioned that the Swiss instrument maker Jobst Burgi (1552-1632), independent of 
Napier, conceived the idea of logarithms toward the end of the 1580s. However, Biirgi’s 
tables were brought out in Prague only in 1620 with the title Arithmetische und Ge- 
ometrische Progress-Tabulen, six years after Napier’s Descriptio had established priority 
of publication. The Progress-Tabulen was printed under unfavorable circumstances, at 
the same time as the rst engagement of the Thirty Years’ War (the battle of the White 
Mountain, just beyond the walls of Prague). Few copies of Birgi’s work were saved, 
and it went virtually unnoticed. 


The Astronomical Discoveries of Brahe and Kepler 


Rarely has a new procedure proved so useful to mathematicians of its day as did 
Napier’s wonderful discovery. One early consequence was William Oughtred’s 1622 
invention of a mechanical computation device based on logarithms—the slide rule. The 
instrument, not precise enough for astronomers’ use, was not employed much in the 
seventeenth century. The almost immediate acceptance of logarithms on the European 
continent was mainly due to the zeal and reputation of Johannes Kepler who, grasping 
their value in the calculation of planetary motion, set about to compile his own tables. 
In a burst of high admiration, Kepler addressed (1620) a laudatory oration to the Baron 
of Merchiston. He intended it as a public compliment and did not know that Napier had 
been dead for three years. 

During the half-century after Copernicus, few eminent scientists were bold enough 
to champion a theory that pushed the sun out of its honored place at the center of the 
universe. Then in the early 1600s, a youthful Johannes Kepler, with his intense faith in 
the mathematical simplicity of nature, set forth a chain of daring conclusions that would 
strengthen and purify Copernicus’s scheme. Although Kepler is chie y remembered for 
his discovery of three great laws concerning the motion of the planets, his achievements 
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in pure mathematics would have been enough to win him enduring recognition. He was 
the rst to enunciate clearly the principle of continuity—the continuous change of a 
mathematical entity from one state to another—treating the parabola as the limiting case 
of either an ellipse or a hyperbola in which one of the two foci moves off to in nity. 
(He introduced the word “focus.”) Kepler found the area of a circle by imagining the 
circle to be made up of an in nite number of triangles having their common vertex at the 
center and their in nitely small bases lying along the circumference. Likewise, he showed 
the volume of a sphere to be one-third the product of the radius and the surface area by 
regarding the sphere as consisting of in nitely many small cones, each with its vertex 
at the center of the sphere and its base on the surface. This identi cation of curvilinear 
areas and volumes with the sum of an in nite number of in nitesimal elements of the 
same dimension is a remarkable anticipation of results found later in integral calculus. 
Kepler applied these conceptions to the volume of some 92 solids obtained by revolving 
conic sections about their diameters, chords, or tangents. His interest in volume problems 
apparently came after noting the inaccuracy of the methods then in use in estimating, for 
purposes of taxation, the capacity of wine casks. 

Johannes Kepler (1571-1630) was born in Weil, a city near Stuttgart in southern 
Germany, to impoverished parents. His father was a drunkard who repeatedly went off 
to ght as a mercenary soldier in the Low Countries, before settling down as a tavern- 
keeper. His mother was equally erratic; an ignorant woman of violent temper, her taste 
for sorcery eventually led to imprisonment on the charge of practicing witchcraft. Kepler 
himself was a sickly child unable to attend school regularly. An attack of smallpox, which 
nearly killed him at the age of four, left him with crippled hands and impaired vision. 
Owing to the kindly patronage of the duke of Wiirttemberg, the young Kepler was able to 
enroll at the University of Tiibingen, one of the centers of Protestant theology, receiving 
the degree master of arts in 1591. He was fortunate enough to study with Michael 
Mastlin, the professor of mathematics, who taught him privately the work of Copernicus 
while hardly daring to recognize it openly in his professorial lectures. In 1593, at age 
22, Kepler prepared a disputation enthusiastically supporting the Copernican theory of 
a sun-centered universe; but it was never presented, because the professor in charge of 
those academic exercises at Tiibingen was so unalterably opposed to Copernicanism that 
he refused to permit the dissertation to be heard. Kepler, feeling that his views were 
incompatible with dogma that then prevailed, abandoned his intention of entering the 
Lutheran ministry and instead accepted the position of “provincial mathematician” in 
the Protestant seminary at Graz (the capital of the Austrian province of Styria). 

It could hardly be said that Kepler was a success in this humble teaching post. 
Lecturing beyond the capacity of his pupils, he managed to drive most of them away. 
In the rst year, his mathematics classes were attended by only 12 students and in the 
second year by none. To keep his salary from being wasted, Kepler was assigned the 
duty of preparing a yearly almanac, which was expected to contain not merely the usual 
meteorological predictions but prophecies on the remarkable events of the coming year. 
When his predictions for the year 1595—a serious cold spell, peasant uprisings, and 
invasion by the Turks—were all ful Iled, Kepler gained a considerable reputation as a 
prophet and astrologer. 

From the time of his earliest research, Kepler had an almost Pythagorean belief in 
the simplicity and harmonious unity of the universe. He was convinced that the heavenly 
bodies were arranged according to some simple geometric law. (“Geometry provided God 
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Johannes Kepler 


(1571-1630) 


(From A Short History of Astronomy by Arthur Berry, 
1961, Dover Publications, Inc., N.Y.) 


with a model for the Creation and was implanted into man, and not merely conveyed to 
his mind through his eyes,” he declared in the Harmonices Mundi.) While at Graz, he 
made his_ rst attempt to discover hidden mathematical regularities embedded in the solar 
system. There were at that time six known planets—Mercury, Venus, Earth, Mars, Jupiter, 
and Saturn—and these were known to be at successively greater distances from the sun. 
Casting around for a pattern from which the Creator might have worked, Kepler arrived 
at the idea of relating the distances between the planetary orbits to the ve geometric 
bodies known as the regular solids: cube, tetrahedron, dodecahedron, icosahedron, and 
octahedron. He theorized that the hypothetical spheres of the planets could be inscribed in 
and circumscribed about the ve regular solids properly distributed in succession between 
them. The result satis ed Kepler that he had uncovered one of the fundamental secrets 
of the universe—the cosmos was constructed so that there were six and only six planets. 
In a letter penned shortly thereafter he recorded his inordinate delight, writing “The joy 
which this discovery gave me can never be described; I regret no more the time wasted.” 
The fruits of these intellectual speculations were set forth in 1596 in a book whose title 
may be abridged to Mysterium Cosmographicum [The Mystery of the Universe]. The 
work attracted considerable attention, not only because of the novelty of the ideas, but 
because it ardently advocated the Copernican planetary system as opposed to Ptolemy’s. 
Of course, the discovery of new planets quite upset the underlying assumptions of the 
Mysterium Cosmographicum. 

As a Lutheran, Kepler was subject to all kinds of disciplinary measures in the 
Catholic city of Graz; but his reputation mostly kept him free from the persecution that 
Galileo suffered as a result of his knowledge. Then, one day in 1598, all Protestant clergy 
and teachers were abruptly ordered to leave town by sunset. Although Kepler was allowed 
to return within a month, he saw that he could not hope to be left in peace much longer. 
By a happy chance for science, the Mysterium Cosmographicum had caught the eye of 
the Danish astronomer Tycho Brahe, who needed a junior partner in his astronomical 
research and offered Kepler the position as his chief assistant. 
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Kepler’s model, showing the relations between the planetary spheres and regular 


geometric solids. From his Mysterium Cosmographicum (1596). (Extract taken from A 
History of Science, Technology and Philosophy in the 16th and 17th Centuries, by A. Wolf. 
Reproduced by kind permission of Unwin Hyman Ltd.) 


Tycho Brahe (1546-1601) was the pioneer of accurate astronomical observation. 
With the help of the king of Denmark, he had constructed in 1576 a splendid observa- 
tory on a 2000-acre island near Copenhagen. He equipped this with the most accurate 
instruments possible, including a 37-foot quadrant for measuring altitudes. None of the 
instruments had lenses, for the telescope was not invented until around 1600. With im- 
mense patience and skill, he labored for some 20 years compiling a vastly more extensive 
and incomparably more precise set of records than any of his predecessors had possessed. 
When the king died, the patronage was not extended by his successor, so Tycho moved to 
Prague in 1599, taking his most portable instruments along. There he entered the service 
of the eccentric Rudolf IH, Holy Roman Emperor and the greatest patron of astrologers 
and alchemists in Europe. Kepler accepted Tycho’s invitation to join him and arrived 
in Prague early in the following year. It was a fortunate alliance. Tycho was a splendid 
observer but a poor mathematician, whereas Kepler was a splendid mathematician but a 
poor observer. 

Toward the end of 1601, after drinking copiously at a dinner party, Tycho was 
suddenly felled by a burst bladder. Adhering to the strict etiquette of the day, he remained 
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Diagram used by Kepler to demonstrate the elliptical orbit of Mars. From his 


Astronomia Nova (1609). (From A Short History of Astronomy by Arthur Berry, 1961, Dover 
Publications, Inc., N.Y.) 


at the table through the rest of the meal, not wishing to leave before the other guests 
did. He died 11 days later, after intense suffering. On his deathbed, he turned to Kepler 
in particular and begged him to complete some of his tables on planetary motion as 
quickly as possible. It is also said that, in the delirium that preceded his death, Tycho 
repeated several times, “I hope that I will not appear to have died in vain.” Kepler did 
not gain possession of Tycho’s instruments, and they were inadvertently burned. Because 
Kepler’s poor eyesight made him an indifferent observer, the loss was of little practical 
consequence. The most important scienti c inheritance Tycho left him was a vast wealth 
of astronomical observations of unparalleled accuracy. In the hands of Kepler, this store 
of information was to produce the next great advance in mathematical astronomy. 

Shortly before Tycho’s death, Kepler received the title of Imperial Mathematician 
from the Emperor Rudolf; and now he succeeded to Tycho’s position as of cial as- 
tronomer to the emperor. But his royal master had the habit of paying his salary only 
rarely or in part, so that Kepler was forced to earn additional income by casting the 
horoscopes of eminent men. “Mother Astronomy would certainly starve if the daughter 
Astrology did not earn their bread,” he is reported to have said. 
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The riddle of the orbit of Mars engrossed Kepler’s attention for the next eight years. 
Tycho’s very accurate measures of the position of Mars relative to the sun enabled Kepler 
to test various hypotheses and cast them aside when they proved incompatible with the 
observed movement. With a Pythagorean craving for simplicity, he felt sure that the orbit 
was a circle. It was only after many failures to t the data to a circular orbit that he 
began to suspect that it must be some other closed path. For a long time, Kepler was 
inclined to believe that it was an oval, shaped like an egg. He tried various sorts of 
ovals, but none eliminated the discrepancies between his tentative theories and Tycho’s 
observations. Years of work and disappointment nally forced him to the conclusion that 
only an elliptical orbit, with the sun occupying one of the two foci, satis ed Tycho’s 
data. The same was presumably true for all other planets, because the harmony of nature 
demanded that all “have similar habits.” This was Kepler’s celebrated rst law. Another 
conclusion he extracted from the astronomical data was that the speed with which a 
planet traversed its elliptical orbit varied in a regular pattern, accelerating with approach 
to the sun and decelerating with departure from the sun. From this he was led to another 
pillar of celestial mechanics, Kepler’s second law: The line drawn from the sun to a 
planet sweeps over equal areas in equal times. 

The full history of his investigation of Mars, together with the laws just stated, 
was published in 1609 in a long book called Astronomia Nova. After 10 years’ further 
effort, Kepler arrived at a relation, his third and last great law of planetary motion, 
connecting the times of revolution of any two planets with their respective distances 
from the sun. The ground was thus prepared for the later achievements of Isaac Newton, 
who was able to prove mathematically not only that the behavior implicitly extended by 
Kepler to all the planets agreed with observation but also that no other behavior was 
possible. 

Kepler’s celebrated results can be described thus: 


The planets move in elliptical orbits with the sun at one focus. 


2. Each planet moves around its orbit, not uniformly, but in such a way that a straight 
line drawn from the sun to the planet sweeps out equal areas in equal time 
intervals. 


3. The squares of the times required for any two planets to make complete orbits 
about the sun are proportional to the cubes of their mean distances from the sun. 


Archimedes is reported to have boasted, “Give me a place to stand and I will move the 
world.” The observations Tycho Brahe had gathered became for Kepler a place to stand, 
and he did move the world. His three laws, which established the rst correct principles of 
planetary mechanics, overturned medieval cosmology and much of Aristotelian physics. 
From ancient times through the days of Copernicus and Tycho the idea that a planet’s 
orbit must be a circle had been unchallenged. Indeed, a circle was accepted as the perfect 
geometrical form, and perfection was accepted as the normal state of heavenly affairs. 
That the real orbits of planets were ellipses was a triumph for the new astronomy and 
the right of Western scientists to pursue their investigations independent of theological 
doctrines. 

One fundamental question could not yet be answered: What held the planets in their 
courses? As we shall see, the vortex theory of Descartes and Newton’s theory of universal 
gravitation were responses to this challenge. 
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8.1 Problems 


Solve the following problems from the Treviso 
Arithmetic: 


(a) Aman _ nds a purse with an unknown number 
of ducats in it. After he spends 1/4, 1/5, and 1/6 
of the amount, 9 ducats remain. It is required to 
nd out how much money was in the purse. 


(b) A hare is 150 paces ahead of a hound which 
pursues him. The hare covers 6 paces each time 
the hound covers 10. It is required to know 
how many paces the hound has made when he 
overtakes the hare. 


(c) The Holy Father sent a courier from Rome to 
Venice, commanding him that he should reach 
Venice in 7 days. The most illustrious Signoria 
of Venice also sent another courier to Rome, 
who should reach Rome in 9 days. And from 
Rome to Venice is 250 miles. It happened that 
by the order of these lords the couriers started 
their journeys at the same time. It is required to 
nd in how many days they will meet. 


. From Nicolas Chuquet’s Tripary, 1484: I am owed 
3240 orins by a debtor who pays me 1  orin the 
rst day, 2 the second day, 3 the third day, and so 
on. In how many days will the debt be paid off? 


. From Robert Recorde’s The Whetstone of Witte, 
1557: A captain marshalls his army in a square 
formation. When the square is of one size, he has 
284 men too many. But when he rearranges them in 
a square one man more on a side than before, he 
lacks 25 men. How many men does he have? 


. From Christoph Clavius’s Algebra, 1608: If I gave 7 


lire to each beggar that came to my door, I would 
have 24 lire left. But if I tried to give them 9 lire 
apiece, I would be lacking 32 lire. How many 
beggars came to my door, and how many lire did I 
have? 


. The two problems below are found in the Rechnung 
(1552) of Adam Riese. Solve them. 


(a) A son asks his father how old he is. The father 
answers him by saying: If you were already as 
much, half again as much, and a fourth again as 
much older than you are now, and one more 
year, you would be 100 years old. The question 
is, how old is the son? 


10. 


11. 
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(b) Seven orins from Padua may be exchanged for 
5 at Venice, and 10 orins at Venice are worth 
6 at Nuremberg. Also 100 orins from 
Nuremberg are worth 73 at KéIn. What is the 
value in KéIn of 100 Paduan  orins? 


Use Napier’s rods to multiply 458 by 79. 


From the de nition of Napier’s logarithm, derive the 
formulas: 


(a) Nap.log(M/N) = 

Nap. log M — Nap. log N + Nap. log 1. 
(b) Nap.log M’ = 

r Nap. log M + (1 — r)Nap. log 1. 


If N = 107(1 — 10-7)", justify the equations below 
relating the Naperian logarithm of N to its natural 
logarithm: 


Nap. log N = log,_,9-7(10-7N) 
= 107 log,,_ 49-1107 10-7) 
= 107 log, /,(10-7N) = 107 log,(107/N). 


Find an approximation to the number N such that 
Nap.log N = 6. 


The transformation 


Nap. log 1 — Nap. log x 


1 = 
i Nap. log 1 — Nap. log 10 


converts Napier’s logarithm to Briggs’s common 
logarithm. Show that: 


(a) logl =0, log10=1. 

(b) logxy = logx + logy, 
log(x/y) = logx — logy. 

(c) logx” =rlogx. 

(d) log10"x =n + logy. 


Frangois Vieta’s trigonometric skill led to his 
discovery (1593) of an in nite-product expansion for 
z in terms of square roots. Supply the missing details 
in his derivation: 


(a) 1=sinz/2 


_ a T A ees cd XT 
2 sin — cos — = 2° { sin — cos — }) cos — 
4 4 8 


8 4 
= 2? (sin td cos =) cos id cos = 
~ 16 16 4 
gipia. Ie 1 sd cd Tv 
= 2" sin — cos — cos — cos — ---cos —. 
2" 4 8 16 2 


362 


8.2 


(b) 


(c) 


(a) 
(b) 


lim 
n—-oo 


u Iu WU 
2/m = cos — cos = COS — : :- 


( 
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m/2" (c) x24 12x = 64. 
nvco \ Sinz /2" (d) 3x? + 10x = 32. [Hint: Multiply both sides by 
1 3 and let z = 3x.] 
Qn-l —E) : 


13. To solve the equation x? + ax = b* geometrically, 
René Descartes would have used the method as 


8 16 described. Draw a line segment AB of length b and 


2+ 2c0s1/2 2+ 2cosx/4 at A erect a perpendicular AC of length a/2. With C 
_ (ES) (ES) as center, construct a circle of radius a/2 and draw a 
line through B and C, intersecting the circle at points 
V2 +2cos 7/16 oe D and E. Prove that the length of the segment BE is 
2 the value of x that satis e s x7 + ax = b’. 


_ v2 V24v2 y24+v24+v2 
-~ ee eee. 


12. Viéta solved the quadratic equation x? + ax = b by 
substituting x = y — a/2. This produces a quadratic 
in y in which the rst-de gree term is missing. Use 
Vieta’s method to solve the quadratic equations: 


x? + 8x =9. 


x? + 10x = 144. 


Among the principal movers in the seven- 


Descartes: The Discours de la Méthode teenth-century scientic revolution, René 


Descartes must certainly be included.Through 
the publication of La Géométrie, which made 
analytic geometry known to his contempo- 
raries, Descartes is generally acknowledged to have laid the foundations for the growth 
of mathematics in modern times. This rst really great advance beyond the techniques 
known to the ancients changed the face of mathematics and led, within a generation, to 
the development of the calculus by Newton and Leibniz. It is not too much to say that 
Descartes’s career marks the turning point between medieval and modern mathematics. 

René Descartes (1596-1650) was born at La Haye, a small town about 200 miles 
southwest of Paris, in the province of Touraine. His father belonged to the lesser nobility. 
He was a councilor at the Parlement of Brittany—in effect, a provincial judge. Descartes 
went through the normal upbringing of a gentleman of that time. At age eight he was 
placed in the lately founded Jesuit College of La Fleche, perhaps the most illustrious 
school in which a student could enroll. There he came to know Marin Mersenne, who 
was seven or eight years older. The rst ve years of the curriculum at La Fléche 
were devoted to the traditional course in languages and the humanities. The nal three 
years embraced logic, philosophy, physics, and mathematics. Mathematics, because of 
the certainty of its demonstrations, was the only subject that really satis ed Descartes, 
even at this early age. 

Descartes’s health was delicate during infancy and childhood, and he was not ex- 
pected to live long. His teachers at La Fléche, recognizing this physical weakness, treated 
him with exceptional consideration; regular attendance at lectures was not required of 
him, and he was allowed to lie in bed each morning as late as he pleased. He never lost 
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René Descartes 


(1596-1650) 


(Smithsonian Institution.) 


this habit; throughout the rest of his life (except for one unfortunate incident that may 
well have hastened his death), Descartes preferred to rise late, spending the early hours 
in bed meditating and writing. Indeed, when he visited Pascal in 1647, Descartes stated 
that the only way he could do good work in mathematics and preserve his health was 
never to allow anyone to get him up in the morning before he felt so inclined. 

On leaving school in 1612, Descartes followed the usual path of a young man of 
wealth living in France by going to Paris to taste the pleasures of its social life. This 
phase did not last long, for in Paris, he renewed his schoolboy friendship with that most 
indefatigable of learned gossips, the good Father Mersenne. Mersenne soon rekindled 
Descartes’s interest in serious study, and in almost cloistral retirement they devoted 
two years to mathematical investigation. Although the younger Descartes had no deep 
inclination to follow his father’s profession, he then entered the University of Poitiers, 
where he earned a degree in law in 1616. 

In 1617 Descartes, then 21 years old and tired of textbooks, decided to learn more 
about the world at rsthand. He enlisted in the army as a gentleman volunteer, rst 
joining the troops of Prince Maurice of Nassau in Holland and afterward taking service 
under the Duke of Bavaria. There is no evidence of any real soldiering on Descartes’s 
part, only years of leisure, in which he had time to pursue his favorite studies. The 
night of November 10, 1619, while in winter quarters with the Bavarian army along 
the Danube, was critical in Descartes’s life. He escaped the cold by shutting himself up 
alone all day in a “poéle’”—literally a stove, actually an overheated room. Tired from 
the heat, he dreamed three feverish dreams, in which he discovered “the foundations 
of a marvelous science.” At the same time his future career as a mathematician and 
philosopher was revealed to him. (Near the close of the nal dream, as Descartes tells 
us, he saw a book opened at a passage of the Latin poet Ausonius, containing the words 
“Which way shall I follow?” As the dream continued, an unknown man handed him a 
bit of verse beginning, “Is and is not,” which he understood as representing truth and 
falsehood in human knowledge.) Descartes neglected to specify the exact nature of the 
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marvelous science whose foundations he found on this memorable day. Some authorities 
are inclined to believe that he formulated the principles of analytic geometry; others 
feel that Descartes conceived a complete reform of philosophy based on the methods of 
mathematics. As Bertrand Russell observed, “Socrates used to meditate all day in the 
snow, but Descartes’s mind only worked when he was warm.” 

By 1628, having grown weary from years of aimless wandering through Holland, 
Germany, Hungary, and Italy, Descartes settled down to what might be called the produc- 
tive period of his life. Holland, which had recently won independence after a protracted 
struggle with Spain, seemed the country best tted to offer the tolerance and tranquility 
Descartes needed to pursue his researches. There, in great seclusion (barring three brief 
visits to France to look after family affairs), he meditated and wrote for 20 years. Until 
then he had published nothing. Descartes conceived therefore of writing an almost en- 
cyclopedic treatise on physics, which he chose to call Le Monde (The World). The time 
from 1629 to 1633 was occupied with building up a cosmological theory of vortices to 
explain all natural phenomena. On the eve of the completion of Le Monde, he learned 
that Galileo’s Dialogue on the Two Chief Systems of the World, published the previous 
year, had earned the censure of the Church. It was clear that the earth was not to be 
summarily dismissed from its position as the immovable center of the solar system. His 
own work, af rming as it did the heliocentric hypothesis, would have made him equally 
guilty with Galileo, so Descartes prudently abandoned the project. The publication of Le 
Monde had to wait until 1664, well after his death. 

It was not moral weakness that forced Descartes to suspend publication of Le Monde, 
but rather that he never ceased to regard himself as a sincere and devoted Roman Catholic. 
He wrote sadly to Mersenne, “This has so strongly affected me that I have almost 
resolved to burn all my manuscript, or at least show it to no one. But on no account 
will I publish anything that contains a word that might displease the Church.” Not that 
the fruits of his labor were withheld from the world, for Descartes did not destroy 
his papers as he rst threatened to do. The ideas contained therein, modied but not 
abandoned, had their presentation to the public in his rst principal published work, the 
Discours de la Méthode (1637). Although the Discours included a summary of Le Monde, 
Descartes so sidestepped the controversy over Copernicanism that one could glean little 
from it concerning his cosmology; in particular, any mention of vortices was studiously 
avoided. Finally, in 1644, the Principia Philosophiae was issued, in which he explained 
at some length the formation of the physical world, by “gradual and natural means” 
out of matter and motion. Descartes’s new “mechanical philosophy” quickly became the 
rage, a dominant feature of discussion in intellectual circles. 

By 1649, Descartes’s reputation had been established throughout Europe, and he was 
invited by Queen Christina of Sweden, the daughter of Gustavus Adolphus, to visit her 
court to tutor her in philosophy. She also suggested that he might help her in planning an 
academy of sciences that would rival the best in Europe. When Descartes had misgivings 
about living in “the land of bears amongst rocks and ice,” the young queen (she was 
then but 22 years old) dispatched an admiral to coax him and then a Swedish warship 
to fetch him. Accepting the invitation was a fatal decision on Descartes’s part. It is even 
said that a presentiment of death came over him as he prepared for the journey. 

Descartes was received with every honor and had no cause for complaint until the 
time drew near for his personal instruction of the queen to begin. From childhood on, 
Christina had slept no more than ve hours a night, and she was indifferent to heat or 
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cold. She proposed to Descartes that they meet three times a week, always at ve o’clock 
in the morning, when her mind was unfatigued and she felt the most energetic. For two 
months Descartes conformed to his royal pupil’s schedule, walking in the winter dawn 
from his rooms to the ice-cold library. His own lifelong routine was radically changed; 
as a foreign Catholic at a Lutheran court he felt isolated and homesick. “It seems to me,” 
he wrote to his friend the Comte de Brégy, “that men’s thoughts freeze here in winter 
just like water.” The rigors of one of the bitterest winters in memory proved too much for 
Descartes’s constitution, which had never been robust. On February 1, 1650, he caught 
a cold that rapidly developed into pneumonia, and he died after 10 days of suffering and 
delirtum. He was buried where Catholics were usually interred, in a cemetery set aside 
for infants who died before baptism. Fifteen years later his remains (except for the right 
hand, which was kept as a memento by the of cial who arranged the transaction) were 
conveyed back to France, where a magni cent monument was erected to his memory in 
the Church of Saint Genevieve. As Descartes’s doctrines were by then under the ban of 
both the Church and the universities, the funeral oration was prohibited by a court order, 
which arrived during the funeral service. 

The year 1637 saw the publication of the work that is considered the most signi cant 
of Descartes’s writings: Discours de la Méthode pour bien conduire sa Raison et chercher 
la Vérité dans les Sciences (Discourse on the Method of Rightly Conducting the Reason 
in the Search for Truth in the Sciences), with its scientic appendages La Dioptrique, 
Les Météores, and La Géométrie. The Discours is not, as commonly supposed, a formal 
philosophical treatise but a short autobiographical résumé of Descartes’s progress in 
arriving at his method. Its rst edition had 78 pages, roughly a sixth of the entire work. 
It was written in his native tongue, though traditionally Latin was used for learned 
subjects, and it showed at once the power and precision of the vernacular as a vehicle 
for expressing highly complicated philosophical and scientic thoughts. (In Principia 
Philosophiae, Descartes reverted to Latin to make the work more acceptable to the Church 
and the universities.) Descartes’s use of the French language speeded the diffusion of his 
ideas. The work was widely read; but though the Discours brought fame to its author, 
the fortune went to the book’s printer. The printer paid a small price indeed for one 
of the landmarks in Western thought. Descartes had asked only to be given, instead of 
royalties, 200 free copies of the new book for distribution to his friends. 

The whole of Descartes’s philosophy of “systematic doubt” as expounded in the 
Discours is dominated by his pursuit of certainty. The certainty of mathematics, he 
delighted to repeat, consists of this—it starts with the simplest elements whose truth is 
recognized, and then proceeds by the process of deduction from one evident proposition 
to another. Mathematics should therefore be a model for other branches of study. To let 
Descartes speak for himself: 


The long chains of simple and easy reasonings by means of which geometers are accustomed 
to reach the conclusions of their most dif cul t demonstrations led me to imagine that all 
things, to the knowledge of which man is competent, are mutually connected in the same 
way, and that there is nothing so far removed from us as to be beyond our reach, or so 
hidden that we cannot discover it, provided only we abstain from accepting the false for the 
true, and always preserve in our thoughts the order necessary for the deduction of one truth 
from another. 


The character of the reasoning of mathematics rather than the results was what so im- 
pressed Descartes. And he was anxious to see whether, by arguing in a mathematical 
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fashion from the most universal principles, it would be possible to deduce everything 
rationally knowable. 

The starting point for Descartes’s thought was to discover the simplest ideas or 
principles, those of which there could be no doubt. Because he had lost all con dence 
in traditional teachings. Descartes began by breaking away from authority altogether 
in matters of science and philosophy, deliberately rejecting all entrenched dogmas and 
doctrines. In his own words from the Discours: 


I thought that I ought to reject as absolutely false all opinions in regard to which I could 
suppose the least ground of doubt, in order to ascertain whether after that there remained 
anything in my belief that was wholly indubitable. 
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Descartes was thus led to one proposition so sound that it could not be doubted, the 
certainty of his own existence; for doubt itself is an act of thought and thought does 
not take place without a thinker. He enunciated this in the most famous sentence in 
philosophy, one that has been the subject of numerous commentaries: “Je pense, donc je 
suis.” (I think, therefore I am.) Having satis ed himself of the existence of a thinking 
being, Descartes passed on to a search for other propositions that appeared equally 
self-evident and irrefutable. For him, there was no greater guarantee of the truth of a 
proposition than that it should survive the most careful scrutiny of his own independent 
criticism. ““We ought never to allow ourselves to be persuaded of the truth of anything 
unless on the evidence of our reason,” Descartes wrote. This unbounded con dence in 
the capacity of human reason helped launch the Great Debate between faith and reason 
that was to preoccupy most Western Europeans in the century to come. 


Inventing Cartesian Geometry 


Three appendixes to the Discours were actual illustrations of Descartes’s new method 
of discovering scientic truths. Although the Discours was intended to be a preface to 
La Dioptrique, Les Météores, and La Géométrie, history has completely reversed the 
sequence; and today the Discours is studied by students of modern philosophy, while 
these works on science are virtually ignored. La Dioptrique (Dioptrics) deals with the 
nature and properties of light, including an account of the law of refraction, the anatomy 
of the human eye, and the shape of lenses best adapted for the newly invented telescope. 
Les Météores (Meteorology) aims at a scientic explanation of atmospheric phenomena; 
it is concerned with such topics as how snow crystals are formed, the size of raindrops, 
the cause of thunder and lightning, and the formation of the rainbow. Of the three essays 
accompanying the Discours, the third, La Géométrie (Geometry), is the one in which 
Descartes made his great and lasting contribution to pure mathematics. In the Géométrie, 
he combined the methods of algebra and geometry to produce the new eld of analytic 
geometry. The English philosopher John Stuart Mill called this “the greatest single step 
ever made in the progress of the exact sciences.” Tradition holds that the idea of analytic 
geometry came to Descartes while he watched a y crawl along the ceiling of his room 
near a corner; his immediate problem became expressing the path of the y in terms of 
its distance from the adjacent walls. The story is more agreeable fable than fact. 

Of the three parts of La Géométrie, the rst two are devoted mainly to applying 
algebra to geometry, while the third treats the theory of equations. Book I bears the title 
Problems Which Can Be Constructed by Means of Circles and Straight Lines Only. At the 
threshold of the work, Descartes introduced the algebraic notation still in use today. The 
last letters of the alphabet, x, y, and z, designate unknown quantities, and the rst letters 
of the alphabet stand for constants. Descartes was perhaps the rst to use the same letter 
for both positive and negative quantities. Our modern exponential notation for powers is 
also found here. In a more radical step, Descartes broke with Greek tradition by divorcing 
numbers from reference to physical quantity. Instead of interpreting a? (or aa as he wrote 
it) and a, for example, as an area and a volume, he considered them nothing more than 
lines. For Descartes, a” was simply the fourth term in the proportion 1 : a =a : a* and 
as such could be represented by a line once a was given. To devise a construction that 
corresponded to the proportion 1: a =a: a’, he arbitrarily chose a unit length 1, to 
which all other lengths were referred. Then a triangle with sides 1 and a was drawn; in a 
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similar triangle, in which the side corresponding to 1 was a, the other side would be a’. 


The problems that formed the central theme in La Géométrie were generalizations 
of the three- and four-line locus problems Pappus had propounded in his commentary 
on the Conics of Apollonius. In their original form the Pappus problems run: given four 
lines in a plane, nd the locus of a point that moves so that the product of the distances 
from two xed lines (along specied directions) is proportional to the square of the 
distance from the third line [three-line locus problem], or proportional to the product of 
the distances from the other two lines [four-line locus problem]. Whereas Pappus had 
stated without proof that the required locus was one of the conic sections, Descartes 
showed this algebraically. Subsequently, Newton solved the problem geometrically in 
his Principia (1687). 

Descartes began his attack on the problem by choosing one of the given lines, say, 
AB, anda _ xed point on it, say, A (he selected what would later be called an axis of 
coordinates and an origin). From an arbitrary point C of the locus sought, a straight line 
CB was drawn to AB, meeting it at a given angle. The lines AB and BC were then the 
quantities that would determine the position of C, and he called them x and y: 


I would simplify matters by considering one of the given lines and one of those to be drawn 
(for example, AB and BC) as the principal lines to which I shall try to refer all others. Call 
the segment of the line AB between A and B, x and call BC, y. 


The lengths of the other lines were expressed in terms of x and y; and by the conditions 
of the problem. Descartes combined these, to arrive at an equation of the curve upon 
which C would have to lie. 


A 


What we have here is the germinal idea of a coordinate system in which the position 
of a point in the plane is de ned_ by its distances, x and y, from two xed axes. Descartes 
was choosing what in current language is an oblique coordinate system, although he did 
not formally introduce a second axis, the y-axis. Nowhere in La Géométrie does the 
modern rectangular coordinate system appear. Descartes’s presentation differed from that 
now current also in his use of only positive values of x and y, that is, by his restriction 
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of curves to the rst quadrant. The name “coordinate” does not appear in the work of 
Descartes. This term is due to Leibniz, and so are “abscissa” and “ordinate” (1692). 

In the second book of La Géométrie, called On the Nature of Curved Lines, Descartes 
distinguished between two kinds of curves, geometrical and mechanical (or as Leibniz 
later preferred to call them, algebraic and transcendental). He insisted that curves more 
complex than lines, circles, and conic sections were proper objects of geometric investi- 
gation provided that they were securely de ned. Descartes’s criterion for the acceptability 
of curves was that their points should be determined by the intersection of two lines, 
each moving parallel to one coordinate axis with “commensurable” velocities. These are 
the geometric curves for which there is an algebraic equation in two variables de ning 
all their points. As Descartes said, 


370 


Chapter 8 The Mechanical World: Descartes and Newton 


All points of those curves which we may call “geometric,” that is, those which admit of 
precise and exact measurement, must bear a de ni te relation to all points of a straight line, 
and this relation must be expressed by means of a single equation. 


On the other hand, curves like the quadratrix and spiral, which arise from “two 
simultaneous motions whose relation does not admit of precise determination,” were 
rejected from geometry and stigmatized as mechanical curves. Such curves by their 
nature allow direct construction of only certain of their points. In this way Descartes 
widened the scope of elementary geometry, which until then had been restricted to 
constructions by straightedge and compass, by giving full geometric status to many 
curves formerly excluded. 

Descartes, in Book II, devised a purely algebraic method for nding the normal at 
any given point on a curve whose equation was known; the tangent line could be taken 
as the perpendicular through the point to the normal. Apart from the isolated attempt 
of Archimedes to draw the tangent to his spiral, constructing tangents to curves had not 
been seriously studied until the early 1600s, when it exercised the minds of some of the 
ablest mathematicians. Descartes, who was led to the “tangent problem” by his optical 
studies, called attention to it in Book II with the statement, “I dare say that this is not 
only the most useful and the most general problem in geometry that I know, but even 
that I have ever desired to know.” 

The approach Descartes used in constructing the normal—and so indirectly, the 
tangent—at any specied point P on a curve took as unknown the point of intersection 
of the normal and the x-axis. In other words, according to Descartes, let f(x, y) = 0 be 
the equation of the curve and (x9, yo) the coordinates of the point P. Suppose further 
that the normal to P is already drawn and that Q, with coordinates (x), 0), is the point 
at which the normal meets the x-axis. Then the equation of the circle with center Q 
passing through P is 


(«—m) +y? =@o— m1) + 9%. 


The result of eliminating one of the variables, say, y, between this equation and the 
equation f(x, y) = 0 of the curve is a new relation of the form g(x, x;) = 0. In general, 
a circle described with center Q and radius PQ will cut the curve in two points; but 
if PQ is the desired normal, the two points will coincide and the circle will be tangent 
to the curve at P. Therefore, in the equation g(x, x;) = 0 leading to the x-coordinates 
of the points at which the circle intersects the curve, we impose the condition that will 
render a pair of equal roots. This condition will give the correct value of x; for Q 
to be the point at which the normal cuts the x-axis. Thus, the problem is reduced to 
determining a double root of an algebraic equation. 
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Descartes used his method of tangents on many different curves in La Géométrie, 
each selected to show the process to best advantage. For an illustration, consider the 
construction of the tangent to the parabola y* = ax at the point (a, a). The equation of 
the circle having its center on the x-axis and going through (a, a) is 


@@-myP ty =@-myt+a 
The elimination of y by substituting ax for y? gives 
(x —x tax =(a—mY +a’, 
or 
x? + (a — 2x))x + 2a(x, — a) = 0. 


Because this is to be an equation with equal roots, Descartes would have compared it 
with (x — r)? = 0. Equating corresponding coef cients in 


x 4 (a — 2x1)x + 2a(x; — a) = x? —Irxtr’ 
gives a — 2x; = —2r and 2a(x; —a) = r?; whence 
(a — 2x1)? = 4r? = 8a(x; — a), 
or x; = 3a/2. Thus, the point on the x-axis through which the normal to the parabola at 


(a, a) passes is (3a/2, 0). The required tangent is then the line through (a, a) perpendic- 
ular to this normal. 


Missing in Descartes’s exposition was a general scheme for determining conditions 
under which a polynomial equation 


ox” +x" ~ he ba, = ie a, =0 


would have a double root r. In the examples worked out in La Géométrie, he usually found 
the condition by equating the polynomial as shown term by term with the polynomial 


(x — rox” — 2 + byx” — 3 4+ +e + Dy — 1x + Dp). 
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This was frequently tedious, because a lot of time was spent calculating the coef cients 
by in the equated polynomial, which were later themselves to be eliminated from the 
calculation. Thus, Descartes’s tangent method was soon superseded by Fermat’s. 


The Algebraic Aspect of La Géométrie 


The third and last book of the Géométrie belongs more properly to algebra than to 
geometry, concerning as it does the nature of equations and the principles underlying their 
solution. In the summary of the main properties of equations with which the book opens, 
Descartes recommended that all terms should be taken to one side and equated with 
zero. Though he was not the rst to suggest this, he was the earliest writer to realize the 
advantage to be gained. He pointed out that a polynomial f(x) was divisible by (x — a) 
if and only if a was a root of f(x). We nd too an intuitive proof of the theorem that 
any equation of degree n has n roots. Descartes’s words are: “Every equation can have 
as many distinct roots (values of the unknown quantity) as the number of dimensions of 
the unknown quantity in the equation.” His proof relied on the principle that every root 
a must appear in the corresponding linear factor (x — a) of f(x), and that it requires n 
such factors to achieve x” as the highest power of x in f(x). 

Apart from a reform of notation, which gives this part of the work a modern look, 
the lasting contributions are a systematic use of negative and imaginary roots (Descartes 
himself was among the rst to use this latter designation) and a result still known as 
Descartes’s rule of signs. This remarkable rule—implicit in the work of several earlier 
writers, including Cardan and Harriot, but r st used explicitly by Descartes—enables one 
merely by looking at a given equation to assign an upper bound to the number of its 
positive roots. 

Two consecutive terms of an equation are said to present a variation in sign if their 
coef cients have opposite signs; thus, in the equation x? + x? — x + 2 = 0, there are two 
variations. With this idea, Descartes’s rule of signs may be stated as follows: 


The number of positive roots (each root counted as often as its multiplicity) of 
an equation 


(1) f(x) = aox" +ayx""! + +++ + ay,-1X + a, = 0, ay > 0 


with real coef cients is equal either to the number of variations in the signs of its 
coef cients or to this number decreased by a positive even integer. 


For example, the equation x? + x? — x +2 =0 has either two positive roots or none, 
the exact number not being found by Descartes’s rule. The two roots may coincide to 
give a repeated root, but if they are distinct then neither is repeated. 

As originally formulated by Descartes, the rule stated that an equation could have 
as many “true” (real, positive) roots as its successive terms showed changes in sign. 
John Wallis claimed in his Algebra (1685) that Descartes failed to notice that the rule 
breaks down in the case of imaginary roots, observing, “This rule is either a mistake or 
an inadvertence, for it must be taken with this caution, that is, that the roots are real, 
not imaginary.” But Descartes said not that the equation always had to have so many 
roots, only that it might have. There is ample evidence that he was aware of the failure 
of his rule if the equation contained imaginary roots, and it is unfortunate that he did not 
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express the fact more clearly. Newton formulated it more precisely in his Arithmetica 
Universalis (1707). 

Because the negative roots of f(x) = 0 are simply the positive roots of f(—x) = 0, 
Descartes also gave a law of signs for the number of negative roots: No equation can 
have more negative roots than there are variations of sign in the coef cients of the 
polynomial f(—x). Thus, x° — 10x? + x + 1 = 0 has either two negative roots or none, 
since x° — 10x? — x + 1 = 0 has two sign changes. 

If it should happen that the largest possible number of positive roots added to the 
largest possible number of negative roots gives a sum less than the degree of the equation, 
then Descartes was prepared to recognize the existence of imaginary roots to Il out the 
number. Take, for instance, the equation x° + 3x7 + x — 1 =0. This equation, having 
only one sign variation, cannot have more than one positive root. Changing x to —x, 
we get x° — 3x? — x — 1 = 0, and because there is one variation, the original equation 
cannot have more than one negative root; hence, it cannot have more than two real roots. 
The proposed equation is of degree six, so it has at least four imaginary roots. 

In this book, Descartes carefully considered the problem of solving equations of the 
third degree and higher, and he succeeded in effecting a simple solution of the quartic. 
Starting with a general quartic x+ + ax? + bx? + cx +d =0, he replaced x by z—a/4 
to obtain a reduced quartic equation 


z+ pe +qz+r=0 


lacking a term in z>. The left-hand member is expressed as the product of two quadratic 
factors 


(2 +ke+m\(2 —ketn)= 24+ (m+n—k)22 + k(n — m)z t+ mn. 
Comparing the coef cients in the two forms of the equation leads to the relations 
p=m+n-Fk, g=k(n—™m), r=mn. 
Ifk #0, the rst two give 
2n= p+k4+q/k, 2m = p+k? —q/k. 
Substituting these values in mn = r, one gets the equation 
k8 + 2pk* + (p* — 4r)k? — q? =0. 


The latter, a cubic in k?, can be solved by methods already obtained. Any root k? 4 0 
produces a pair of quadratic equations 
1 q 
2 2 
ztkz+ 5 \P + 


and 


1 q 
2, 2 


whose four roots are the roots of the reduced quartic. 
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Descartes considered as an example of the technique the equation 
x* — 17x? — 20x —6 = 0. 
This led him to the cubic 
k® — 34k* + 313k? — 400 = 0. 


Because k? = 16 is a root, the solution of the given quartic is reduced to solving two 
quadratics, namely, 


2+42+43(16-17+ %)=0 
and 
2 — 42+ 3(16-17- %)=0, 
which is to say, 
24+4z24+2=0 and 2°-47-3=0. 
Solving these equations, Descartes got the roots 
—24+V72, -2-V2, 24+V77,  2-Vv7; 


and these are the roots of the original quartic. 

Descartes’s exposition in La Géométrie is so far from clear that the work has the 
appearance of an early draft of itself. He did not arrange the material in an orderly and 
systematic manner, and as a rule, he gave only indications of proofs, gladly leaving 
their detailed execution to the reader. Such statements as “I did not undertake to say 
everything,” or “It already wearies me to write so much about it,” occur frequently in 
the text. In concluding the work, Descartes justied the omissions and obscurities he 
affected with the remark that much was deliberately omitted “in order to give others the 
pleasure of discovering [it] for themselves.” 

This mathematical sloth was remedied when La Géométrie was translated into Latin 
and published with an explanatory commentary by Frans van Schooten, a professor of 
mathematics at Leiden. The Latin version underwent a total of four editions, appearing 
rst in 1649, then in 1659-1661, 1683, and 1693. It is safe to say that these editions 
established the place of coordinate geometry in the university mathematical courses in 
western Europe. For they not only made Descartes’s work available in the scholarly 
language of the time but included a large amount of supplementary material, clarifying 
the original account with which readers were having dif culty. In the greatly expanded 
two-volume version of 1659-1661, the commentary of van Schooten was more than twice 
the length of La Géométrie itself. The next generation of mathematicians all studied La 
Géométrie in one version or another, Wallis in the 1649 edition, and Newton in both the 
1649 and 1659-1661 editions, and Huygens used Descartes in the original French. 

The 1659-1661 edition of La Géométrie contained a convenient means of determin- 
ing the double roots that Descartes’s tangent method called for. This was supplied by 
the Dutch mathematician Johann Hudde, one of van Schooten’s most capable students 
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and the burgomaster of Amsterdam for some 30 years. Hudde’s rule, as it came to be 
known, asserted that if r was a double root of the polynomial equation 


ox" +ayx" | +--+ + ay_1x + an = 0, 
then r was also a root of the equation 
agbox”" + ayb\x"—! tes + Gy—1bn—-1X + anby = 0, 


where bo, bi,...,b, were any n numbers in arithmetic progression. The rule can be 
illustrated by the equation x? — 5x? + 8x — 4 = 0. Taking the arithmetic progression 3, 
2, 1, 0 (where the largest term is equal to the degree of the equation) and multiplying 
each coef cient of the equation by the corresponding term in the progression, one gets 
3x3 — 10x? + 8x = 0, or 3x? — 10x + 8 = 0. This equation has 2 as a root, and since 2 
is also a root of the original equation, it is a double root. Had there been no common 
roots, the original equation would not have possessed a double root. 


Descartes’s Principia Philosophiae 


In addition to the attempt to transfer the methods of mathematical thought to the 
physical sciences, Descartes’s great endeavor was to replace the medieval world-picture— 
with its blend of Aristotelian physics and Ptolemaic astronomy—by a scientic system 
in which all physical explanations could be sought. Descartes believed that the principal 
phenomena of nature could be framed in terms of mathematical and mechanical laws 
alone, laws of which one could be absolutely certain. He told us that “the rules of 
mechanics ... are the same as those of nature.” His conception of a mechanical universe, 
in which physical objects or events in one part exerted an in uence on all others, was to 
attain such popularity that references to the solar system as a gigantic piece of clockwork 
machinery became commonplace. 

In the Principia Philosophiae, which appeared in Amsterdam in 1644, Descartes 
used the notion of vortices as a creative mechanism to account for the origin and the 
current state of the universe. Rejecting the notion of a void or vacuum in nature, he 
postulated that the space occupied by the solar system was_ lled with vaguely de ned 
primordial matter, the so-called plenum or ether. This all-pervading material, having 
been endowed by God’s hand at the beginning with a xed and nite quantity of motion, 
evolved thereafter by the laws of mechanics, without interference. The motion of the 
stellar mass set up an immense whirlpool or vortex, in the middle of which was the 
sun. Within this vast ethereal vortex there were lesser vortices tting together like a 
mass of soap bubbles; these exerted local in uences much as eddies arose in streams 
and were carried along by the current. To escape censure by the Church on the question 
of the earth’s motion, Descartes asserted that the earth was at rest within its vortex and 
therefore stationary relative to it. At the same time, it was borne by its surrounding 
celestial matter around the sun, just as a boat at repose in the middle of a sea could be 
carried imperceptibly by the tides. Thus, by a stretch of the imagination, Descartes was 
able formally to declare that the earth could be considered not to move. One critic has 
claimed that this theory of a motionless earth carried along by an enveloping ether was 
like a worm in a Dutch cheese sent from Amsterdam to Batavia; the worm has traveled 
6000 miles without changing its location. 
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By comparing heavenly bodies in motion with terrestrial objects caught up in 
whirlpools of water, Descartes was able very plausibly to explain the workings of the so- 
lar system in readily understandable terms. Everyone had seen eddies form in the course 
of a river, trapping and carrying along a passing leaf, and the mental picture carried 
conviction. It could not be said of Descartes’s ideas on the creation, as was written of 
the cosmology of Robert Fludd (1574-1637), “The obscurity of the style is only equalled 
by the absurdity of the matter.” 

Though the theory of vortices was an ingenious scheme, it was unsupported by 
any experimental evidence and was irreconcilable with a multitude of known celestial 
phenomena, including Kepler’s law that the motions of the planets were not circular 
but elliptical. Despite the inherent defects of Descartes’s new doctrine, there was 
something in it that satis ed the philosophical hunger of the times. It attracted a host 
of enthusiastic adherents, who found it wholly intelligible, and it withstood for half 
a century all efforts to dislodge it (probably because it had no real competition). But 
then came Isaac Newton, with his mathematical proofs involving gravitation, which 
could not be explained by vortical motion. By curious coincidence, the publication of 
Descartes’s Principia Philosophiae coincided almost exactly with the birth of Newton, 
who before many years was to demolish the Cartesian concept of ethereal vortices to 
make way for his greater theory of universal gravitation. Not that Descartes’s views 
were easily displaced by the celebrated alternative, particularly in France where national 
pride led to an unwillingness to accept Newtonian physics. For a generation or so after 
the publication of Newton’s Principia Mathematica, the spiritual heirs of Descartes tried 
to remove some of the glaring blemishes in the original theory. Not before the second 
half of the eighteenth century did the Cartesian scientic ideas altogether lose their 
command. Descartes, for all his fantasies, made the rst serious effort to emancipate 
natural knowledge from theology, that is, to explain the architecture of the universe by 
mechanical principles without recourse to divine intervention or occult forces. 

It was the general policy of the Church to allow new scientic positions to be 
stated hypothetically, provided that they were formally declared to be conditional on the 
Church’s interpretation of divine revealed truth. (Galileo was condemned mainly because 
the acid mockery of his dialogue too obviously belied his formal declaration.) To avoid 
ecclesiastical censure, Descartes explicitly put forward the vortex theory as speculative. 
He stated in the Principia Philosophiae itself, “I want what I have written to be taken 
simply as an hypothesis which is perhaps far removed from the truth.” Descartes did not 
hesitate to say that though his theory might be wrong in detail, it had, nevertheless, the 
great merit of providing a plausible explanation of gravity, light, magnetism, and other 
physical properties hitherto regarded as “innate” to the world. This wily maneuver could 
hardly have deceived the trained censors of the Holy Of ce, and Descartes’s works were 
placed on the Index of Prohibited Books in 1663. This, however, did not detract from 
their growing popularity—although it was enough to prevent a funeral oration for him. 
By 1740, his works were removed from the Paris Index in order to provide an alternative 
to the Newtonian synthesis, which was then gaining gradual acceptance in France. 

Descartes’s achievements in mathematics, philosophy, optics, meteorology, and sci- 
ence leave no doubt that he was the dominant thinker of the 1600s. Descartes was only 
incidentally a mathematician, and his analytic geometry—or as Voltaire described it, “the 
method of giving algebraic equations to curves”—was just an episode in a career devoted 
to many innovations. It is curious that this man who brought such luster to France lived 
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Diagram of Descartes’s system of celestial vortices. From his Principia 


Philosophiae (1644). (Extract taken from A History of Science, Technology and Philosophy in 
the 16th and 17th Centuries, by A. Wolf. Reproduced by kind permission of Unwin Hyman, Ltd.) 


nearly all his productive years beyond her borders, taught in none of her schools, and 
even as a soldier fought in none of her foreign wars. 


Perspective Geometry: Desargues and Poncelet 


About the same time that Descartes was attempting to algebraize geometry, Girard 
Desargues (1593-1662), a self-taught architect and engineer of Lyons, was opening new 
grounds with a purely geometric study of perspective. His program was grounded in 
the efforts of the Renaissance painters to represent, as accurately as possible, the three- 
dimensional world on a_ at canvas. Little is known of Desargues’s life other than that 
he gave a series of public lectures in Paris between 1626 and 1630. It is said that he 
saw service as an engineer in 1628, at the famous siege of the Huguenot bastion of La 
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Rochelle and that he there made the acquaintance of Descartes. Most of Desargues’s work 
on the projective geometry—the study of properties of gures that remain unaltered under 
projection—is embodied in a brief treatise that appeared, as an edition of 50 copies, in 
1639. Its title, “Brouillon project dine atteinte aux ‘evénemens des rencontres diin cone 
avec un plan,” might be translated as “Rough Draft of an Investigation into the Results 
of the Intersection of a Cone and Plane.” 

Desargues’s great break with the past called for extending the Euclidean plane by 
adding certain “in nitely distant” points. He considered two or more parallel straight 
lines as meeting in one and only one point—a natural, direct development of painters’ 
new techniques of perspective. The addition of this point, called the point at in nity, 
represented no contradiction to Euclid’s geometry, but rather an extension of it. Because 
there are many sets of parallel lines, each having its own direction, there are many such 
points at in nity. These new points are all considered to lie on one new line—later called 
the line at infiity —-which would contain no ordinary points. The plane, together with 
the line at in nity, is called the projective plane. 

Adopting the convention of the line at in nity had the effect of eliminating both- 
ersome special cases caused by the concept of parallelism: For instance he could assert 
that any two distinct lines meet in one and only one point. 

Desargues treated asymptotes of the hyperbola as its tangents, with the points of 
contact lying at an in nite distance from the hyperbola’s center. He also viewed lines as 
parts of circles of in nite radius. 

The “Rough Draft” contains many important results, including one which still 
bears Desargues’s name. In short, Desargues’s Theorem states that if the lines joining 
corresponding vertices of two triangles pass through a point, then the intersection 
points of corresponding sides lie on a line. With reference to the gure, suppose that 
triangles ABC and A’B’C’ are so situated that the three lines AA’, BB’, and CC’ 
joining corresponding vertices pass through the same point P; the conclusion is that the 
three pairs of sides BC with B’C’, CA with C’A’, and AB with A’B’, when suitably 
extended, meet in points R, S, and T which lie on a line. There are no parallel lines 
in the new geometry of Desargues, whence R, S$, and 7 will always exist; points at 
in nity have the same status as ordinary points. 


The Rough Draft was not well received by Desargues’s contemporaries or his imme- 
diate successors, with the important exception of the youthful Blaise Pascal. He wrote, 
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in his Essay pour les Coniques, “I wish to acknowledge that I owe the little that I have 
discussed in this subject to his writings.” The general dismissal of Desargues’s short trea- 
tise is not surprising. It was no doubt overshadowed by the rapid success of Descartes’s 
Geomeétrie introduced two years earlier. It had the added disadvantage of being written 
in such a strange style and vocabulary as to make it almost unreadable. Not being a 
professional mathematician, Desargues was obliged to invent a number of new technical 
terms. He borrowed many of them from botany: His readers found it hard to guess that 
“brins” (twigs) referred to line segments and “noeuds” (knots) were their end points. 
Conics were introduced as “coupes de rouleau,” that is, slices with a cone. Of some 70 
eccentric terms used by Desargues, only the word “involution” has survived. 

Descartes was quick to point out to his friend that “your terms will only have 
the effect of making your proofs more dif cult, and discourage people from reading 
them.” Indeed, most mathematical scholars were either baf ed or repelled by the obscure 
nomenclature and the absence of symbolism. The publisher eventually destroyed the 
unsold copies of the Rough Draft; all the copies were lost by the end of the century. 
Desargues’s ideas on projective methods sank into oblivion for nearly two centuries. 
Fortunately, a handwritten copy of the Rough Draft, discovered by chance in a Paris 
bookstore in 1845, allowed his contributions to the new kind of geometry to survive. 

For the next century and a half after Desargues’s lifetime there was no signi cant 
progress in projective geometry. The subject was dramatically rescued from neglect 
by a young French military engineer, Jean Victor Poncelet (1788-1867). Before being 
commissioned an of cer in Napoleon’s Grande Armée, he had been a student of the 
geometer Gaspard Monge at the Ecole Polytechnique. Poncelet was abandoned as dead 
on the battle eld during the disastrous retreat from Moscow and spent the year 1813-1814 
in a Russian prison. To keep up his spirits he reconstructed, without the aid of books, 
what he had learned from Monge of plane and analytic geometry; and he presented 
this material, along with his own results on projective geometry, to his fellow prisoners 
who had also attended the Ecole Polytechnique. On his return to France late in 1814, 
Poncelet began to put the work of his prison year in a form suitable for publication. The 
monumental Traité des Propriétés Projectives des Figures, published in 1822, gave the 
new geometry its rst full presentation as a separate mathematical discipline. 

Poncelet later served the French government in various capacities, which often de- 
prived him of time for mathematics. In 1837 he became professor of applied mechanics 
at the University of Paris; and thereafter (1848) was appointed to the post of rector at 
the Ecole Polytechnique. 

In the Traité, Poncelet made extensive use of a controversial principle of geometric 
continuity. The principle asserted the permanence of geometric properties as one gure is 
transformed, by projection or any gradual distortion, into another gure. Although it led 
him to far-reaching results, the principle was dangerously vague and had to be handled 
with great care; for, as Cauchy indicated, it was “capable of leading to manifest errors.” 
Poncelet also exploited the law of duality in projective planes, which may be expressed 
as follows: Any valid statement about a relationship between points and lines gives 
rise to a second valid statement when the words “point” and “line” are interchanged. 
This remarkable property means that we need only prove one of two dual statements, 
as the other is then necessarily true. The dual of Desargues’s theorem, for instance, 
would say that if two triangles are so situated that the three points of intersection of 
pairs of (extended) corresponding sides lie on a straight line, then the lines joining 
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the corresponding vertices all pass through the same point. As the principal creator of 
projective geometry as a separate mathematical discipline, Poncelet can rightly be called 


the “father of modern geometry.” 


8.2 Problems 


1. 


2. 


To multiply two numbers geometrically, Descartes 
said: 


Let AB be taken as unity, and let it be required 
to multiply BD by BC. I have only to join the 
points A and C, and draw DE parallel to CA; 

then BE is the product of BD and BC. 


Show that the length of BE is the product of the 
lengths of BD and BC. 


In La Géométrie, Descartes constructed the positive 
solutions to the quadratic equation x7 = ax — b?, 
where b < a/2. Given a circle of radius NL = a/2, 
draw a tangent to L and lay off from the point of 
contact a length LM = b. Then, through M, draw a 
line parallel to NL, 


cutting the circle in the points Q and R. Prove that 
the lengths MQ and MR represent the two positive 
solutions to x? = ax — b’. [Hint: If the parallel to 
LM through Q cuts the diameter in segments of 
length y and z, then y+ z =a and yz=)?.] 


Assume that in the ve-line Pappus problem, four of 
the lines /, Jy, 15, and J, are parallel and an equal 
distance apart, and that the ft h line /; is 
perpendicular to the others. Prove that if /; and /, are 


taken as the x-axis and y-axis, 


respectively, and if p, denotes the distance of a point 
P = (x, y) from the line /,, then the locus of all 
points satisfying p; p3p4 = ap2ps is given by 


(a+x)(a —x)(2a — x) =axy. 
This locus, which occurs in La Géométrie, was later 


called the Cartesian parabola, or trident, by Newton. 


Show that the equation x3 — x7 + 2x + 1=0 has no 
positive roots. [Hint: Multiply by x + 1, which does 
not change the number of positive roots. ] 


. Find the number of positive roots of the equation 


x 4+2x3-—x?4+x-1=0. 


. From Descartes’s rule of signs, conclude that the 


2n 


equation x-” — 1 = 0 has 2n — 2 imaginary roots. 


. Without actually obtaining these roots, show that 


(a) x°+3x+7=0 and 
(b) x°—5x° — 7x7 +8x+20=0 


both possess imaginary roots. 


. Verify the following assertions. 


(a) Ifall the coef cients of an equation are positive 
and the equation involves no odd powers of x, 
then all its roots are imaginary. 

(b) Ifall the coef cients of an equation are positive 
and all terms involve odd powers of x, then 
zero is the only real root of the equation. 

(c) An equation with only positive coef cients 
cannot have a positive root. 


. Prove that 


(a) The equation x? + a?x + b* = 0 has one 
negative and two imaginary roots if b ¥ 0. 
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10. 


11. 


8.3 


(b) The equation x? — a*x + b* = 0 has one 
negative root, while the other two roots are 
either imaginary or both positive. 

(c) The equation x* + a*x? + b*x — c? = 0 has just 
two imaginary roots if c # 0. 


Use Descartes’s method to nd the solutions of the 
following quartic equations: 


(a) x*—3x?+ 6x —2=0. [Hint: The sextic 
k® — 6k* + 17k? — 36 = 0 has k®> =4 asa 
solution. ] 

(b) x*—2x?-— 8x -—3=0. 


The logarithmic spiral, whose equation in polar 
coordinates is r = ae“, where —oo < @ < 00 

(a > 0, c > 0), was invented by Descartes in 1638. 
Prove that if P = (r;, 6,) is any point on the spiral 
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other than the origin, then the part of the spiral from 
the origin to P is an unending curve of nite length. 


Ps(r). 0) 


(Hint: The length of the arc in question is given by 
a1 dr \” 
i r+ (=) do. 
oe dé 


For the English, the seventeenth century was an 
age of great cultural achievement—the time of 
Shakespeare and then Milton—and grave domes- 
tic upheaval. In England, as elsewhere in Eu- 
rope, kings who claimed to rule by the will of 
God clashed with their representative bodies who 


claimed to rule by the will of the people. The smoldering antagonism between Charles I 
and Parliament broke into a con ict of the worst kind, the Great Civil War of 1642-1646 
(often called the Puritan Revolution). By 1650, England had beheaded its monarch as 
“tyrant, traitor, and enemy of the Commonwealth” and was experimenting with a repub- 
lican form of government under the leadership of Oliver Cromwell. Cromwell’s death in 
1658 weakened the Commonwealth suf ciently to allow the restoration of Charles II to 
the English throne in 1660. After that, England settled down: at the close of the century 
the country was able to provide to Europe a model of limited monarchy, constitutional 
government, and a measured religious toleration. 

The period was an extremely creative one for mathematics and modern science. The 
growing intellectual curiosity in scienti c matters may be judged by the proliferation 
of academies or societies devoted to the pursuit of experimental knowledge. These in- 
formal collaborative groups were often only incidentally connected to the government, 
remaining all the while hopeful of royal patronage. Their emergence was partly a reaction 
to the indifference of the universities which, still formally medieval, lagged behind the 
mainstream of scientic development. Most scholars of the period drew little distinction 
between the physical and mathematical sciences; the best scientic accomplishments and 
the most profound mathematical ones were frequently realized by the same individuals. 
England’s rich soil nourished an array of brilliant thinkers, among them Robert Hooke, 
James Gregory, Edmund Halley, and Isaac Newton. Newton’s supreme achievement—the 
differential calculus—not only reshaped the unsound conceptions of planetary motion, 
but provided a rational basis for physical science as a whole. Robert Boyle’s declaration, 


“Nature plays the mathematician, 


af rmed his belief that the power of mathematical 


reasoning was the most effective instrument for the attainment of scientic truth. 
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By 1660 England had become, arguably, the major center for organized scienti c 
activity in Europe. And yet during the early part of the century the English contribution 
to mathematics was quite modest, often coming from individuals who were not mathe- 
maticians by profession. The church rector William Oughtred (1574-1660) typi es these 
practitioners. His Clavis Mathematicae (1631), a treatise of only 88 cramped pages serv- 
ing as an introduction to arithmetic and algebra, was one of the few textbooks published 
in England in those days. Its in uence was felt through ve Latin editions and two aug- 
mented English translations, the rst in 1647. When Viéta’s symbolic form of reasoning 
was still little known, the Clavis carried it to extreme by employing over 150 signs 
and abbreviations, some curious and ill-chosen. Oughtred explained that some symbols 
eliminated “verbose expressions” and permitted one to grasp a mathematical situation 
more clearly. The only notations of his that are still widely used today are the cross x 
denoting multiplication, and the plus-or-minus symbol +. Oughtred was sometimes crit- 
icized for neglecting his clerical calling by spending time giving free private lessons in 
his rectory to pupils inclined toward mathematics; on the other hand, among these pupils 
was Christopher Wren, the future architect of St. Paul’s Cathedral. A staunch Royalist, 
Oughtred is said to have died in a “transport of joy’”—at the age of 86—upon hearing 
that the House of Commons had voted for the restoration of Charles II. 

The year 1631 was the date of publication, not only of Oughtred’s Clavis Mathemat- 
icae, but also of Thomas Harriot’s Artis Analyticae Praxis. The very title was modeled 
after that used by Viéta 40 years earlier. Probably written in 1610, this larger though 
inferior treatment of algebra appeared in print a full decade after its author’s death. At 
the time of his death in 1621, Harriot was famous throughout Europe for the publication 
of only one book: not a mathematical work but A briefe and true report of the new found 
land of Virginia (1588). 

Employed to teach the science of navigation to the sea captains of his patron, Sir 
Walter Raleigh, Harriot had been sent on a colonizing expedition to Virginia in 1585. He 
returned from the New World a year later, having mapped what is now North Carolina 
and with an addiction to tobacco smoke. His last years were plagued by cancer of the 
nose. Although Harriot’s will directed his executors to review and publish as much of 
his mathematical writings as might prove feasible, only the Praxis was ever brought out. 

In its enthusiastic embrace of the new “analytic art,” Harriot’s Praxis is perhaps 
more symbolic than any other algebra written; page after page is so symbol-laden as to be 
almost devoid of prose. Its style caused some, who could not see the growing capabilities 
of the new language, to describe algebra as “covered over with a scab of symbols.” 

The Praxis is the work in which the signs > and < for “greater than” and “less than” 
are rst introduced. Following his predecessor Vieta, Harriot used vowels for unknowns 
and consonants for known quantities; but for Viéta’s capital letters he substituted low- 
ercase ones. His iteration of symbols, as with aaaa for a+, was superior to Oughtred’s 
aqq used for the same purpose. In connection with numbers, Harriot used the words “af- 
rmative” and “negative,” using the minus sign to denote the latter. On the other hand, 
negative roots of equations were rejected as being useless. Another notable innovation 
was the use of a long horizontal bar to cover all the terms affected by a radical sign. 
Unfortunately for Harriot’s reputation, the appearance of the Praxis initiated two con- 
troversies fueled by nationalistic pride: rst, how much and how directly did his work 
derive from Viéta, and second, to what extent Descartes’s algebra was borrowed from 
Harriot. 
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The main focus for mathematical interest in those years was neither at Oxford nor 
Cambridge, but at London’s little Gresham College. The college had been founded in 
1596 under the terms of the will of the wealthy merchant Thomas Gresham, and continued 
to exist for 160 years. It offered a practical curriculum of geometry, trigonometry, and 
astronomy, subjects that an audience of navigators and surveyors now required. On the 
whole, the Gresham professorate was far abler and more up-to-date than those to be 
found in the nation’s two universities. Both old institutions still preserved the traditional 
pattern of education preparatory to the learned professions of medicine, law, and divinity. 
Mathematical studies occupied a very peripheral position in the curriculum, unrecognized 
as a formal academic subject. The stipulated texts were still those of ancient authors: 
Euclid, Ptolemy, Apollonius, and Aristotle. There was little opportunity to delve into 
more modern works, although by midcentury Henry Billingsley’s 1570 translation of the 
Elements, Peter Ramus’s Algebra Libri Duo (1586), and Oughtred’s Clavis Mathematicae 
were occasionally assigned. (Billingsley, a merchant, had attended Oxford and Cambridge 
but never took a degree.) The more mathematically adventurous students acquired a 
knowledge of the subject beyond the university walls, through their own effort and 
talent. What is remarkable is that Wallis, Barrow, and Newton would emerge out of this 
mathematical desert. 


Wallis’s Arithmetica Infinitorum 


Probably the most able mathematician to arise in the interval between Descartes 
and Newton was England’s John Wallis (1616-1703). He entered Cambridge University 
in 1632, studied theology and received the master’s degree in 1640, the same year in 
which he took Holy Orders. Wallis held a Fellowship (the local term for a junior faculty 
appointment) at Cambridge for about a year, but vacated the coveted position when he 
decided to marry. While living in London, he attended the weekly meetings of a group 
of “learned men” interested in scientic discussion and experiments; in a few years this 
informal gathering grew into the Royal Society of London, with Wallis as a founding 
member. When 32 years of age, he was appointed Savilian Professor of Geometry at 
Oxford, a position he kept until his death over half a century later. Wallis’s installation 
came as a surprise to many, for there was little evidence as yet of his mathematical 
promise. 

Wallis is often called the father of English cryptography, a title resulting from his 
activities during the Civil War. Early in the conict Wallis was shown a royal letter 
written in cipher, which he managed to decode in the space of two hours. This feat 
began his long career as an expert in cryptographics. At the behest of Parliament, he 
exercised his special talent by deciphering secret dispatches of Charles I that had fallen 
into Puritan hands. Wallis subsequently secured the living of a parish in London and 
the highly prized Savilian professorship as rewards for his service. At the restoration of 
the monarchy, Wallis’s cryptographic skill was deemed so valuable that Charles II was 
not averse to employing the man who had so recently worked against the royal cause; 
Charles conrmed the position at Oxford and made Wallis the king’s chaplain. Wallis 
had managed to stay on fairly good terms with the monarchy by possessing the courage 
to sign a petition, known as the Remonstrance, against the execution of Charles I. 

For someone who gured so prominently in the annals of English mathematics, 
Wallis was surprisingly late in making it a serious occupation. Around the age of 30, 
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Wallis happened upon a copy of Oughtred’s Clavis Mathematicae, which seemed to cast a 
“symbolic spell” upon him. Quickly mastering it, he charted a path through the literature, 
reading the works of Torricelli, Cavalieri, and Harriot. The statutes of the Savilian chair 
required that Wallis offer introductory courses in practical and theoretical mathematics. 
This elementary material became the basis for his rst contributions, the Tractatus de 
Sectionibus Conicis (1656) and Mathesis Universalis (1657). The Tractatus is notewor- 
thy for its use of Descartes’s coordinate geometry to identify the ellipse, parabola, and 
hyperbola with certain equations of the second degree, rather than by treating them as 
curves generated by plane sections of a cone. In 1655, Wallis published the Arithmetica 
Infiitorum , the main work on which his mathematical renown is grounded. It, more 
than any other volume, served as a springboard to Newton’s results on the differential 
calculus a decade later. Here the familiar knot symbol oo for “in nity” makes its initial 
appearance in print. 

Wallis’s last and most widely read book, entitled A Treatise of Algebra: Both Histor- 
ical and Practical, was nished in 1676 but not published until 1685 when he was in his 
seventieth year. Written in his native language instead of the still universally used Latin, 
it marked the rst serious attempt in England to produce a history of mathematics. A 
number of pages in the Algebra are devoted to the historical development of the subject. 
Wallis comments on the work of the Italian algebraists Pacioli, Cardan, and Bombelli, 
contrasting their efforts with those of Vieta, Descartes, and Harriot. The Algebra differed 
from the introductions of Oughtred and Harriot in advocating the acceptance of negative 
and imaginary numbers. For, Wallis argued, even if these quantities de ed a meaning- 
ful de nition, they were mathematically useful and could be interpreted in the physical 
world. To reinforce this point he gave the example of positive integers, representing land 
gained from the sea, with negative ones representing land lost to it. 

Although Wallis never explicitly used negative and fractional exponents, he laid the 
groundwork for their use by writing “5, whose index is —2,” and “./2, whose index is 4.” 

An expanded Latin version of the Algebra, the De Algebra Tractatus, was issued in 
1693. Among the additions in this work was a brief exposition of Newton’s method of 
uxions, which Wallis hoped “Mr. Newton himself will publish in due time.” Newton’s 
groundbreaking manuscript Methodus Fluxionum did eventually appear, posthumously 
in an English translation, as Method of Fluxions (1736). 

Wallis dedicated the Arithmetica Infiitorum to Oughtred for having rst awakened 
his interest in mathematics. In it, he arithmetizes the geometrical methods of Bonaventura 
Cavalieri (1598-1647), a pupil of Galileo, for determining the area beneath curves of the 
form y = x*, k, a positive integer. The approach involves investigating the limit-sum of 
the kth powers of the rst 7 positive integers. As an illustration, we may consider the 
parabola y = x”. Wallis starts by observing that 


074+ 1? 1 1 1 
——- = -=-+- 
17+12 2 3 6 


0?4+1274+27 5 1 1 


242242? 12 — 37 12 


0? + 1242? + 3? 7 1 1 


434824 183 18 
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OP +174+274+37+4 3 1 1 


24+424+9442442 8 3° 24 
Reasoning by analogy, or “modus inductus” as he liked to call it, Wallis assumed that 


024+ 124274 324.--4+7n? i 1 


m+nmtnttn te tr 3° bn 


(Wallis might very well have employed the ancient Pythagorean relation for the sum of 


squares of integers: 17 + 27+ 3?+---+n? = 2@+UC*t)) The greater the number of 
terms, the closer the ratio on the left side approaches i which was taken as its limiting 
value. 


To nd the area under the parabola y = x? from x = 0 to x =a, Wallis imagined 
the region to be made up of n in nitely narrow vertical rectangles, each of width *. 


D4 


The ratio of the sum of the areas of these rectangles to the area of the circumscribing 
rectangle behaves as 


(2)° + (2) + (yt t (Bl PEP HP Ete? 


i n 


@+e4+a+---+a ~ ptnrtn2+..-.+n2 
Thus, Wallis concluded that 


area OaB — 412422? 4.--4n?2 1 


DST Or OC CU = 5 
area OaBC noon? +n2+n2+---+n? 3 
a result which, in modern notation, is equivalent to the expression 
a2 
do x-dx a 


a 37 


After contenting himself with several particular cases, Wallis was led to infer the general 


formula 
i ‘ qh} 
x“dx = , 
0 k+1 
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where k is a positive integer. Reliance on the questionable principle of continuity, or 
“permanence of form,” allowed him to assert further that the same result held even when 
k is negative (k ~ —1) or fractional. 

By shunning the almost purely geometric methods of his predecessors for more 
arithmetic forms, Wallis was led to results that are nowadays reached by the integral 
calculus. He was apparently unaware that many of his contributions were not new at all, 
only approached from a different point of view. In any case, the Arithmetica Infiitorum 
was one of the most stimulating mathematical works produced up to that time in England: 
well known and widely read, not least by Isaac Newton. Its awkward inductive proofs 
were reshaped, within a few decades, into the more ef cient formalism of the Newton- 
Leibniz calculus. 


The Lucasian Professorship: Barrow and Newton 


Descartes’s vortex theory, which offered a general explanation of the motions of 
the planets in mechanical terms, envisioned physical nature as a machine and man as 
the reasoning mind capable of comprehending it. Descartes began with a program of 
scientic rationalism—‘“There is nothing in my physics that is not in geometry,” he 
wrote to Mersenne—but ended by building what Huygens called a philosophical romance. 
The inconsistency of vortex motion with Kepler’s laws eventually led to a search for a 
different kind of scientic explanation for the fabric of the heavens. It remained for a still 
greater mind, Isaac Newton, to give the scholarly world the synthesis for which it yearned. 
Newton’s Philosophiae Naturalis Principia Mathematica (1687) was the climax of the 
soaring intellectual thought that marked the seventeenth century, the Century of Genius. 
Probably the most momentous scientic treatise ever printed, it aimed, in Newton’s 
words, “to subject the phenomena of Nature to the laws of mathematics.” In realizing 
this ideal, the Principia laid the foundations of modern celestial mechanics, the principles 
of which were to dominate the mathematical physics of the eighteenth and nineteenth 
centuries. 

In the year that Galileo died, on Christmas Day of 1642, Isaac Newton was born in the 
village of Woolsthorpe, in Lincolnshire near Cambridge. His father, a farmer of moderate 
means, had died a few months before Newton was born, so that the responsibility for 
the boy’s upbringing devolved on his mother. Newton was born prematurely and was 
so small and frail at birth that no one thought he would survive beyond a few days. 
But the infant, so undersized that his mother said he could have been tted into a quart 
pot, must soon have thrown off these “symptoms of early death,” for he lived to be 
85 years old. 

Newton’s life can be conveniently divided into three parts, each of which was largely 
con ned to a particular locality. The rst (1643-1669) covers his boyhood in Lincolnshire 
and his undergraduate days; the second (1669-1687), his life as Lucasian professor at 
Cambridge, during which he produced most of his work in mathematics; and the third 
(1687-1727), lasting nearly as long as the other two periods combined, his career in 
London as a highly paid government of cial. 

The rst 18 years of Newton’s life were times of turmoil and torment for his native 
land. In the year of his birth, the Great Civil War broke out between Charles I and 
Parliament. Newton used to tell of the great storm that swept over England on the day 
Cromwell passed away. The force of the gale was so strong that he tried to measure 
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it by rst jumping with the wind, then against it, and comparing the difference in the 
measurements of the longest jumps. In later years, he liked to call this the rst of his 
experiments. 

For one who became so great, Newton showed little sign of the massive talent that 
was to change the current of scientic thought. At age 12, he was sent to a public school in 
the neighboring town of Grantham. He seems to have been relatively inconspicuous in his 
academic work, being reported as “idle” and “inattentive.” The young Newton was apt to 
neglect the prescribed studies to spend most of his time creating mechanical contrivances 
such as sundials, waterwheels, and homemade clocks. After two years at Grantham, he 
was called home at the death of his stepfather to help with the farm, but he showed little 
aptitude or interest in the agricultural pursuits of his forefathers. Fortunately an uncle 
recognized Newton’s quickness, sent him back to school at Grantham, and arranged for 
him to go to the college the uncle had attended, which happened to be Trinity College, 
Cambridge. Newton arrived at Cambridge in 1661, as a subsizar, a student who had some 
of his fees remitted in return for performing various menial duties. 

Cambridge was an excellent place for someone wanting to pursue a law degree, 
and it appears that Newton enrolled with that intention. The seventeenth century was 
close enough to the Middle Ages that medieval modes of thought persisted in education. 
The curriculum was still cast in the old humanistic tradition, with its exclusive diet of 
reading the ancients and their commentators. The main courses for undergraduates were 
logic, rhetoric (the art of literary composition), and moral philosophy. The Latin and 
Greek languages were studied less for their own sake than as ancillary to other subjects. 
Although English had become the prevailing language for teaching and examining, the 
educated person nonetheless required Latin for writing and formal oration; academic 
occasions were adorned with ornate Latin speeches. A knowledge of Greek was essential 
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among scholars, not only because of its use in the New Testament but because it was the 
language of the ancient authorities in philosophy. As for science and mathematics, the 
most signi cant advances took place outside of the universities and barely affected their 
curriculum. Not until the establishment (1663) of the Lucasian professorship, endowed 
by a Mr. Henry Lucas in his will, were undergraduates at Cambridge provided any formal 
instruction in mathematics by the university. King Charles II granted a dispensation that 
would allow the chair to be held by someone not in Holy Orders, and he furthermore 
ordered all undergraduates past the second year to attend its holder’s lectures. 

Newton’s mathematical genius blossomed suddenly and unexpectedly. He probably 
entered Cambridge more backward than most undergraduates, knowing little more than 
the bare rudiments of computation, which he picked up from the elementary arithmetics 
of his day. It is not clear when or how he was rst introduced to advanced mathematics. 
There is a dubious story that Newton’s mathematical awakening came in 1663 when he 
purchased a book on astronomy at a country fair and found that he could not under- 
stand the diagrams in it without knowing trigonometry. With a view to throwing some 
light on the trigonometry, he then secured an English edition of Euclid’s Elements, but 
reportedly abandoned it as “a triing book” and turned to more advanced learning. He 
thereupon managed to master by himself van Schooten’s richly annotated Latin edition 
of Descartes’s Géométrie—which was about eight times the size of the original—and 
passed on to a careful study of John Wallis’s Arithmetica Infiitorum (published in 1656). 
Whatever Newton’s mathematical taste as a young man, there is ample documentation 
that he pored over the texts of Euclid and Descartes, deriving continuous inspiration from 
the latter’s Géométrie. We are told that he always regretted that “he had applied him- 
self to the works of Descartes and other algebraic writers before he had considered the 
Elements of Euclid with that attention which so excellent a writer deserves.” Although 
Newton was a “late bloomer” in mathematics, by 1664 his reading had taken him to the 
frontiers of contemporary mathematical knowledge. The absence of precocity was more 
than counterbalanced by the rapidity with which his formidable talent matured, once 
stimulated. 

The arrival in 1663 of Isaac Barrow (1630-1677) as the rst occupant of the Lucasian 
chair seems both to have inspired Newton’s developing mathematical power and led him 
to adopt an academic career. Barrow, then only 33 years old and an early member of 
the Royal Society, was already regarded as one of the foremost mathematicians of the 
period. His researches on drawing tangents to curves and on determining areas bounded 
by curves had very nearly led him to the invention of the calculus. Barrow had also come 
to mathematics by a circuitous route. As a boy he so plagued his teachers, and was so 
rebellious at home, that his father was heard to pray that if it pleased God to take any 
of his children, he could best spare Isaac. He entered Trinity College in the year 1644, 
received his bachelor’s degree in 1648, and then stayed on as a fellow of the college. 
An excellent scholar in Greek, theology, physics, and astronomy, Barrow edited various 
works of Euclid, Archimedes, and Apollonius. He was nominated by a former teacher for 
the Greek professorship at Cambridge, but because of his staunch loyalist sentiments was 
denied the position. Driven out of Cambridge in 1655 in the Puritan purge, he sold his 
personal library and set out on an adventurous four-year tour of eastern Europe, which 
included ghting off a pirate attack during a sea voyage from Italy to Turkey. 

While living in Constantinople, Barrow developed an interest in the writings 
of the early church fathers. On his return to England in 1660 (which happened to 
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coincide with the restoration of Charles II to the throne), he took Holy Orders and 
was rewarded with the Greek professorship previously denied him. The stipend for this 
post was small, and two years later he augmented it by accepting the appointment as 
professor of geometry at Gresham College, London, a position he held only a short 
time before transferring to the Lucasian chair. As he explained, there were many schol- 
ars who could worthily undertake the duties of the Greek professorship, so he saw 
no reason why he should not retire from the “grammatical mill” and follow his own 
inclination. 

The new chair in mathematics had few obligations connected with it; the occupant 
need merely work during one term of the academic year, lecturing once a week and 
conferring with students twice a week. Barrow, however, instituted a regular series of 
general introductory lectures during his tenure as Lucasian professor. One of those attend- 
ing was Isaac Newton, a student destined to become the greatest mathematician England 
has ever produced. Barrow’s lectures for the years 1664-1666 were published posthu- 
mously in 1683 as Lectiones Mathematicae. However, his Lectiones Opticae (1669) and 
Lectiones Geometricae (1670) were printed almost immediately, with Newton assisting 
in the preparation. 

Barrow’s Lectiones Geometricae presented, in 13 lectures, a collection of theorems 
concerned with drawing tangents to curves and nding lengths of curves and the areas 
bounded by them. His method for determining the tangent to a point P on a curve 
given by the polynomial equation f(x, y) = 0 closely resembles that used in our modern 
calculus textbooks. He observed that the tangent could be obtained if some other point 
on it were known, for instance the point T at which the desired tangent should meet the 
x-axis. For this, Barrow took a point Q on the curve, close to the rst point P, and by 
drawing parallels to the coordinate axes constructed a small right triangle POR (which 
he called the differential triangle). The closer the point Q to the point P, the more nearly 
similar the triangles POR and PTM. Barrow thought it reasonable to take them as similar 
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on the grounds of the “inde nite smallness” of that part of the curve, whence 
TM OR 
MP RP’ 
On setting OR = e and RP = a, it follows that if (x, y) are the coordinates of the point P, 


then Q has coordinates (x — e, y — a). Now substitute these coordinates into the equation 
f(x, y) = 0, and in Barrow’s own words, 


reject all terms in which there is no a or e (for they destroy each other by the nature of the 
curve); reject all terms in which a and e are above the rst power, or are multiplied together 
(for they are no value with the rest, as being in nite ly small). 


The ratio a/e can then be found. Finally, T can be determined by using the length of 
the line segment TM: 


or= om ~ T™ = om - mp () =x-y(-). 


y 


Barrow gave ve examples of this method of the differential triangle, including 
@) 24+y=r, (ii) x+y =rxy, (iii) y=(r —~x)tan o- 
r 


To see the process clearly, it will suf ce to consider the case x* + y> = r>. Since Q is 
taken to be on the curve, its coordinates satisfy the equation, so that 


«-eP+Q-aP=r’, 
or 
x — 3x°e 4 3xe? -—e + y> —3y’at 3ya? -—a@ =r’. 
Because x? + y? =r’, the last-written equation becomes 
—3x7e + 3xe* — 8 — 3y’a+3ya* — a? = 0. 
Discarding the terms that contain powers of a and e beyond the rst, we get 


3x7e + 3y-a = 0, 
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and from this, 


The Lectiones Geometricae was the culmination of the seventeenth-century investi- 
gations leading toward the calculus. Although the tangent method of Barrow resembled 
the process of differentiation, he apparently did not perceive the deeper signi cance. Nor 
was he able to justify why the higher powers of a and e should be neglected in his 
calculations, which for a rigorous foundation can be explained only in terms of limits. 
Thus, we can better describe Barrow as a signal precursor of the differential calculus, 
and Newton as its rst inventor. 

All through the Middle Ages, and right on until the second half of the seventeenth 
century, outbreaks of bubonic plague were a recurrent feature of English life. The years 
1665-1666 are best remembered for the Great Plague, which swept through closely 
packed London, leaving no fewer than 68,500 dead—one in four of the estimated popu- 
lation. In June, 1665, the diarist Samuel Pepys (1633-1703) wrote: 


This day, much against my will, I did in Drury Lane see two or three houses marked with 
a red cross upon the doors, and “Lord have mercy upon us” writ there; which was a sad 
sight to me, being the rst of the kind that, to my remembrance, I ever saw. 


Then, in September, Pepys recorded in his Diary, “I have stayed in the city till about 
7400 died in one week and of them about 6000 of the plague, and little noise heard 
day or night but tolling of bells.” As a precaution, Cambridge University was closed 
for the better part of two years and Newton took refuge at the isolated family farm in 
Woolsthorpe. In 1666 plague-ridden England was shaken by another disaster, the Great 
Fire of London; the smoldering ruins covered 436 acres or approximately half of the 
city. The re, although it destroyed some 13,200 homes and 90 parish churches, did at 
least mitigate the plague. 

While Newton was forced to live in seclusion at home, he began to lay the 
foundations for his future accomplishments in those elds with which his name is 
associated—pure mathematics, optics, and astronomy. During these two “golden years” 
at Woolsthorpe, Newton made three discoveries, each of which by itself would have made 
him an outstanding gure in the history of modern science. The rst was the invention of 
the mathematical method he called uxions, but which today is known as the differential 
calculus; the second was the analysis of white light (sunlight) into lights of different 
colors, separated in the visible spectrum according to their different refrangibilities; the 
third discovery was the conception of the law of universal gravitation. These three dis- 
coveries were made before he was 25 years old. Referring to this period of leisure and 
quiet, Newton later wrote, “All this was in the two plague years of 1665 and 1666, for 
in those days I was in the prime of my age for invention and minded Mathematics and 
Philosophy [physics] more than at any time since.” 

Newton returned to Cambridge early in 1667, the virulence of the plague having 
subsided. In 1668, he was elected a fellow of Trinity College, and a few months there- 
after, was created a master of arts. One of the highest honors in the scientic world, the 
Lucasian professorship, became his a year later, when Barrow was called to London as 
chaplain to Charles II and resigned the chair of mathematics. Barrow proposed his incom- 
parable pupil, Isaac Newton, as his successor. Whether Barrow made way for Newton in 
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recognition of the latter’s superior powers or simply out of the desire to make theology 
his full-time study remains a question, but the effect was the same. Newton accepted 
the appointment and held the chair actively until 1696. (Newton was denied further 
promotions at the university because he was a Unitarian and did not accept the doctrine 
of the Trinity.) As for Barrow, he did no further mathematical work. After Barrow 
became a doctor of divinity, the king appointed him (1672) master, then vice-chancellor, 
of Trinity College, saying that he was giving the position to the best scholar in England. 


Newton’s Golden Years 


On assuming the Lucasian chair, Newton chose optics for his inaugural course of 
lectures, conscious that he would eclipse Barrow’s Lectiones Opticae both in subject 
matter and methodology. Owing to the novelty of the subject and his rigorous treatment 
of it, Newton’s lectures did not attract a large audience. Often nding no one present— 
despite an of cial letter from Charles II con rming the statutory attendance requirement— 
he would leave the lecture hall and return to his private research. By the end of 1669, he 
had worked out the details of his discovery of the heterogeneous character of white light, 
though these were not published until many years later under the title Opticks (1704). 

Newton was led to the study of optics by the imperfections of the contemporary lens 
telescopes, in which chromatic aberration produced colored edges around the images seen. 
Accordingly, in 1668, he designed and constructed the rst re ecting telescope, that is, a 
telescope in which the rays of light from the object viewed are concentrated by means of 
a concave mirror instead of the convex lens of Galileo’s refracting telescope. Knowledge 
of Newton’s telescope was con ned to a limited circle of Cambridge friends. Eventually 
word reached the Royal Society whose fellows, their curiosity aroused, asked Newton to 
send the instrument for inspection. Its appearance at the meeting of January 1672 caused a 
sensation, and Newton suddenly became a celebrated gure. Among those who examined 
the telescope were King Charles II, Robert Hooke, and Christopher Wren (the architect of 
St. Paul’s in London, and at this time Savilian professor of astronomy at Oxford). At the 
same meeting, Newton was elected a fellow of the society. He immediately responded 
by sending the society an article embodying the results of his lectures on optics. Printed 
in the Philosophical Transactions of the Royal Society under the title New Theory about 
Light and Color, this was his_ rst scientic paper to be published. It contained the rst 
formal announcement, beyond the bounds of his Cambridge lecture hall, of the ingenious 
experiment of 1666 in which he demonstrated that white light is composed of the various 
colors of the spectrum. Light from a tiny hole in a shutter had been passed through a 
triangular glass prism. Newton wrote that as he observed the refracted colors on the wall, 
“TI was surprised to see them in oblong form; which according to the received laws of 
refraction, | expected would have been circular.” The band of light varied in color from 
red at the bottom, through orange, yellow, green, and blue, to violet at the top—the order 
of colors in the rainbow. Newton concluded that sunlight, and white light generally, is 
composed of rays of every color and that some colors are more sharply bent, or refracted, 
than others. This was in direct contrast to the current theory that white light was basic, 
whereas colors were due to various mixtures of light and “darkness.” 

Newton’s paper on the nature of light provoked much criticism from those with 
established reputations, particularly from Robert Hooke, a genius second to Newton but 
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to few others. Senior to Newton in age by seven years, and one of the original fellows 
of the Royal Society, Hooke viewed himself as the society’s authoritative spokesman 
on experimental physics. Hooke had recorded in his great work Micrographia (1665) an 
experiment similar to Newton’s with the prism. He felt slighted by Newton’s ignoring 
his priority and joined other members of the society in condemning Newton for asserting 
conclusions that they felt did not necessarily follow from the experiments described. 
Hooke maintained that Newton did not prove that all colors were actually in every ray 
of light before it suffered a refraction. 

Another point of contention concerned the medium through which light is transmit- 
ted. Hooke, recognizing the analogy between light and sound, favored the hypothesis that 
light traveled on the waves of a weightless invisible agent called “lumeniferous ether” 
and argued that this doctrine explained most of the optical phenomena then known. New- 
ton, for his part, claimed that such a theory could not be reconciled with the fact that 
light traveled in a straight line. He proposed that light was composed of a stream of tiny 
particles, or corpuscles, of different sizes (the size corresponding with the color) and 
moving with different velocities. The increasingly acrimonious dispute was carried on 
for four years. The friction with Hooke only conrmed Newton’s tendency to secrecy 
and isolation and dimmed his early enthusiasm for the Royal Society. “I am so perse- 
cuted,” he wrote, “with discussions arising out of my theory of light that I blame my 
own imprudence for parting with so substantial a blessing to run after a shadow.” Having 
neither the time nor the desire to engage in controversy, Newton retired from the public 
world of science. The decision led him to refrain from publishing a general account of 
his optics until after the death of Hooke in 1703. There was even a greater delay in the 
appearance of Newton’s mathematical writings, few of which were voluntarily given to 
the world by himself. 

During the next few years, Newton spent most of his time at optics and mathematics. 
As a consequence of his study of Wallis’s Arithmetica Infiitorum in the winter of 1664— 
1665, he had discovered the general binomial theorem, or expansion of (a + b)", where 
n may be a fractional or a negative exponent. (Except when n is a positive integer, the 
resulting expansion is an innite series.) He rst enunciated the formula, and tried to 
recapture his original train of thought leading to it, 12 years later (1676) in two letters 
written to Henry Oldenburg, the multilingual secretary of the Royal Society. These letters 
were to be translated into Latin and forwarded to Leibniz, who in his early struggles with 
his version of the calculus had asked for information about Newton’s work on in nite 
series. As given in the rst letter to Oldenburg (the Epistola Prior of June 1676), the 
formula, or rule, as Newton called it, was written in the form 


m — 3n 
4n 


m— 


2n 


n= 
3n 


where each of A, B, C, and D denotes the term immediately preceding it; that is, A 
represents P’””/”, B represents (m/n)AQ, and so on. Newton’s letter, and his calculations 
in general, employed negative and fractional exponents, which after this time became a 
universally recognized practice. He wrote, “For as analysts, instead of aa, aaa, etc., are 
accustomed to write a”, a3, etc., so instead of a, Va3, Vc : a5, etc., I write a!/?, a3/?, 
a*/3, and instead of 1/a, 1/aa, 1/a*, I write a~!, a~*, a~>.” 

Wallis had earlier constructed a table of values of what today would be written 
fra —x’)"dx for certain positive integers n, but the evaluation of fa — x?)'/2dx had 
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eluded him. Nonetheless, by a highly elaborate and dif cult analysis, he arrived at a 
remarkable expression for 4/z in the form of an in nite product: 

Aye 1 ee seSa eT 7 

m fla —x2)W2dx  2-4-4-6 


Newton had the insight to change Wallis’s xed upper bound in the integral to a free 
variable x (he had no symbol for the integral, but de ned the integral as a limit of a 
sequence of sums), and then to look for a general pattern that seemed to run through a 
set of particular instances. Considering the expansions whose modern equivalents are 
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Newton observed that the rst term of each expression is x, that x increases in odd 
powers, that the algebraic signs of the terms alternate, and that the second terms 5x3, 


2x3, 3x3, and 4x3 are in arithmetical progression. Reasoning by analogy, he assumed 
that the rst two terms of {} (1 — 17)!/?dr should be 
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The successive powers of x revealed for the rst time the binomial character of the 
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& ) and ) for the particular case n = ; in the general formula 
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In other words, Newton found that the binomial coef cient form held for nonintegral 
values, and in particular, he used the coef cient 


(?)- 2-3-3 
k Leo 


From this, Newton went on to deduce by differentiation the expansion 
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and checked its correctness by multiplying the preceding series by itself, term by term, 
to get 1 — x?. 

It is odd that Newton derived the binomial expansion in an integral form, namely, 
a — t’)!/?dt, before he realized that the same form is preserved in (1 — x?)!/? if from 
the expression for a —t*)'/*dt the denominators 1, 3, 5, 7, 9, --- are omitted and 
each exponent lowered by 1. 

Newton was aware that deriving the binomial theorem by looking for structural 
patterns in tabulated cases was not a rigorous method of reasoning. He therefore checked 
that the particular expansion of (1 — x*)~! arising from his formula was equivalent term 
by term with the expansion obtained by “brute-force” long division. Newton never did 
publish his binomial theorem, nor did he prove it in generality. It became widely known 
through private circulation of his tract De Analysi (1669), but no account appeared in a 
printed text until 1685, when Wallis’s Treatise of Algebra quoted extracts of Newton’s 
letters to Oldenburg. 

The well-known anecdote that the problems of gravitation were brought home to 
Newton by the fall of a ripened apple in the orchard at Woolsthorpe seems to rest on 
good authority. A friend of Newton, William Stukeley, a fellow of the Royal Society, 
related: 


Amidst other discourse, he told me, he was just in the same situation [under the shade of 
some apple trees], as when formerly, the notion of gravitation came into his mind. It was 
occasioned by the fall of an apple, as he sat in a contemplative mood. Why should the apple 
always descend perpendicularly to the ground, thought he to himself. Why should it not go 
sideways or upwards, but constantly to the earth’s center? Assuredly, the reason is that the 
earth draws it. 


Newton began to wonder how high gravity extended. Skyward to the moon perhaps? 
If so, could the same force that pulled the apple to the ground be the force that retained 
the moon in its curvilinear orbit? To settle the question, he needed to know the ratio by 
which the gravitational pull decreased with distance from earth. And for this, Kepler’s 
celebrated third law of planetary motion—which stated that the squares of the times of 
revolution of any two planets (including the earth) about the sun are proportional to the 
cubes of their mean distances from the sun—provided valuable information. 

Newton and several other physicists of the day (notably Huygens, Halley, and Wren) 
had all shown independently that if Kepler’s third law were true—and they were not quite 
certain that it was—then the attractive force of gravity would diminish with the square 
of the distance. The argument probably went as follows. For a planet of mass m moving 
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with velocity v in a circular orbit of radius r, the centrifugal force is 


m wv 


z 
But if T is the time of one revolution, then 
2nr 
v= —, 
T 
On substituting this value of v, one gets 
4n?mr 
= 


as an expression for the constant force required to hold the planet in its circular orbit. 
By Kepler’s third law, T?/r? = c, where c is a constant; whence 


(=*) 1 
F= =. 
Cc r2 


Thus, if the earth’s gravity provided the force maintaining the moon in its orbit, this 
force would be inversely proportional to the square of the separating distance. 

During the plague years, Newton carried out a calculation to see whether a force of 
attraction that varied with the inverse square of the distance between two bodies would 
account for the motion of the moon around the earth. Unfortunately, Fate played a trick on 
Newton in this enterprise, and his test was at rst a disappointment. Through Galileo’s 
experiments with falling bodies, afterward repeated more accurately by others, it was 
generally known that the rate of fall at the surface of the earth was 16 - 60° feet in one 
minute. The accepted value for the distance of the moon from the earth was 60r, where r 
was the earth’s radius. Hence, if the inverse-square law held, the gravitational attraction 
the earth exerted on the moon would be 1/607 of the attraction the earth exerted on an 
object on its own surface. The moon would therefore descend a distance of 16 feet in 
one minute toward the center of the earth. 

The next stage in Newton’s calculation, determining the distance over which the 
moon actually fell in one minute toward the earth’s center, required an accurate value of 
the earth’s radius and the mean time of the moon’s revolution around the earth. The value 
of the latter was very nearly 27 days, 7 hours, and 43 minutes, or 39,343 minutes. “Being 
away from books,” Newton took for his calculations the standard local estimate, used by 
seamen and old geographers, that there were 60 miles to a degree of latitude along the 
earth’s equator. This led him to infer that the circumference of the earth was 60 - 360 
miles, so that its radius would be 60 - 360/27, or 3438 miles. The moon’s distance from 
the center of the earth was known to be 60 times the earth’s radius, so the moon’s orbit, 
taken to be circular, would be 607 - 360 miles long. If this were assumed to be the usual 
statute mile, whose length had been de ned in 1593 to be 5280 feet, then the orbit would 
be 607 - 360 - 5280 feet long. (Some historians argue that Newton was more likely to 
have set a mile equal to 5000 feet.) Hence the moon’s velocity in feet per minute at any 
point such as P would be 


607 - 360 - 5280 feet 


: = 173, 930 feet/minute. 
39, 343 minutes 
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Constrained to follow its curved path, the moon would have traveled from P to Q, an arc 
of length 173,930 feet, in one minute. Moreover, in that time it had “fallen” the distance 
PS = RQ toward the earth. Because triangles PSQ and QSP’ are similar, it follows that 


PS. SP’ = (SOY. 


Moon’s orbit 


If PP’ is used as an approximation to SP’, and arc PQ as an approximation to SQ, this 
last relation becomes 


_ (are POY (173, 930)? 
~ PP’ «60. - 60 - 360 - 5280 


or PS = RQ = 13.89 feet (or 14.67 feet if one takes a mile to be 5000 feet). Thus there 
was a serious discrepancy between the two values for the fall of the moon—the 16 feet 
per minute as determined from the inverse-square law of gravity, and on the other hand, 
the 13.89 feet per minute as deduced from the moon’s mean period and the size of the 
orbit. Although Newton said that he found his calculations to “answer pretty nearly,” 
they did not match well enough to be convincing. Somewhat discouraged that the results 
did not answer expectation, he abandoned all work on the gravitational problem, without 
bothering to publish any account of it. 

During the dozen years from 1667 to 1679, when Newton pushed the idea of gravi- 
tation to the back of his mind, others began to duplicate much of his rst work. In 1673 
the great Dutch scientist Christiaan Huygens published his mathematical analysis of the 
motion of the pendulum, Horologium Oscillatorium sive de Motu Pendulorum, a work in 
which he derived the law of centrifugal force for uniform circular motions. As a result, 
the inverse-square law for gravitational attraction was formulated independently by that 
versatile but jealous physicist Robert Hooke, by the astronomer Edmund Halley, by the 
architect and astronomer Christopher Wren, and by Huygens himself. Hooke, Halley and 
Wren—all brilliant young members of the newly founded Royal Society—were greatly 
interested in the problem of gravitation for noncircular orbits but were unable to han- 
dle the mathematics involved. Hooke, acting formally in his new capacity as secretary 
of the society, wrote (1679) a conciliatory letter to Newton, begging him to renew his 
correspondence with its members on scientic matters. Stimulated by Hooke’s opinions 
on the dynamics of planetary motions, Newton’s attention was drawn to his calculations 
on the moon that had lain neglected for twelve years. 


PS feet, 


398 


Chapter 8 The Mechanical World: Descartes and Newton 


The error in Newton’s original computation arose from taking the wrong value for 
the length of a degree of terrestrial latitude, which then yielded an incorrect value for 
the radius of the earth. The reason commonly given for Newton’s careless adoption 
of an inaccurate value is that because he was in the country at the time, he used the 
gure given in the only book at hand, a sailor’s manual. Several years later, Jean 
Picard obtained a very accurate value for the length of a degree on the earth’s meridian; 
his gure of 69.1 miles varied greatly from the 60 miles Newton had used. Picard’s 
determination of the size of the earth was carried out in 1669 and published in his 
Measure de la Terre in 1671. The gures were communicated to the Royal Society in 
1672, but Newton apparently paid no attention to them. 

When Newton nally became acquainted with Picard’s results (it is possible, even 
probable, that this took place in 1684), he began again to calculate how far the moon 
would fall toward the earth in one minute according to the new data. His biographers 
tell us that as he made the nal calculation his emotions overcame him, and his hand 
froze when he foresaw the result. He cried out to his assistant, “Work it out for me, I 
cannot complete it; what do you get?” “Sixteen feet per minute,” was the answer. The 
eagerly expected agreement in the gures conrmed Newton’s conjecture that the earth’s 
gravitation attraction does indeed provide the force maintaining the moon in its orbit. 

At this point, according to his own account, Newton was led to the discovery that if 
a planet were acted on by an attractive force varying according to the inverse-square law, 
then it would have to describe an elliptical orbit, with the attractive force residing in one 
of the ellipse’s foci. Despite the magni cence of this accomplishment, he was silent about 
his discovery, “being upon other matters.” No doubt Newton, with his morbid sensitivity 
to any kind of criticism or questioning, however small, was leery of publicity. But the 
real sticking point seems to have been his trouble in proving that the gravitational pull 
of a spherical body is the same as if the sphere’s whole mass were located at its center. 
(This is true for precisely those systems showing an inverse-square law of attraction.) An 
essential link in his line of reasoning was missing, and it apparently was not produced 
by Newton until 1685. 


The Laws of Motion 


The last act of the drama began several years later when Christopher Wren, in a 
sporting humor, made a gentlemanly wager with Hooke and Halley. In January 1684, 
Wren offered the prize of a book, up to the cost of 40 shillings, if either of his two friends 
could deduce within two months’ time the orbit of a planet acting under an inverse- 
square law of force. Halley failed to do so. Hooke claimed “that upon that principle [the 
inverse-square law] all the laws of the celestial motions were to be demonstrated,” and 
that he himself had done so; but he made an excuse for not putting forward his alleged 
explanation just then—and it was never forthcoming. In August of that year, the young 
and energetic Halley set out for Cambridge (or as he says, did himself “the honour to 
visit Newton’’) to see what suggestions Newton had to offer. Halley put the question to 
him: What path would the planets describe about the sun if they are attracted by a force 
that varies inversely as the square of the distance? Newton answered immediately, “An 
ellipse.” The amazed and overjoyed Halley asked him how he knew. “Why,” replied 
Newton, “I have calculated it.”” Somewhat characteristically, Newton was unable to nd 
the notes of the demonstration that he had effected some years earlier but promised to 
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reconstruct the solution and send it to Halley. This was accordingly communicated to 
Halley in November 1684. 

The reworking of the old problem seems to have aroused Newton’s interest in the 
whole question of planetary motion. He realized that his calculation could hardly stand by 
itself and therefore worked out enough new material to serve as a course of nine lectures 
on celestial mechanics. These were delivered, under the title De Motu Corporum (On the 
Motion of Bodies), when Cambridge opened in the autumn. Halley, on receiving New- 
ton’s promised demonstration, was excited enough to pay a second visit to Cambridge 
to persuade him to make his work public. On this trip, he studied Newton’s manuscript 
lecture notes and realized their immense importance. Halley begged that the results might 
be published, but had to be content with a pledge that they would be forwarded to the 
Royal Society to secure priority. Halley then reported to the body that he had lately seen 
Newton at Cambridge, and that there Newton had shown him a curious treatise, De Motu, 
promising to send it to the society to be entered on their register. The Royal Society not 
only published its Philosophical Transactions but from time to time undertook to bring 
out meritorious scientic works. Thus the society authorized the printing of De Motu 
Corporum and appointed Halley “to put Newton in mind of his promise.” 

Newton threw himself into the task of devising a rigorous mathematical formulation 
of the whole system of planetary motion, a formulation dependent only on his law of 
universal gravitation. Early in 1685, he was nally able to prove that the gravitational 
attraction exerted by a solid sphere, whose density varies only with distance to the center, 
acts as though the sphere’s mass were concentrated at its center. With the essential link 
in the argument then complete, it became possible for Newton to elucidate a whole series 
of cosmic movements that had heretofore been enigmatic. Under the personal encourage- 
ment of Halley, who kept in constant touch with the progress of the work, De Motu Corpo- 
rum broadened in scope to the bulky Philosophiae Naturalis Principia Mathematica [The 
Mathematical Principles of Natural Philosophy]. At the time the Principia was published, 
natural philosophy meant science in general and physics and astronomy in particular. 

Newton worked at its composition with a speed that was little short of fanatical; 
by his own testimony, “The Book of Principles was writ in about 17 or 18 months.” 
In March 1686, he duly presented the manuscript of Book I, De Motu Corporum [The 
Motion of Bodies] of the Principia to the Royal Society. Hooke at once came forth 
with the claim that he had been rst in discovering the inverse-square law of gravitation 
and had been the prime mover in the whole series of results in the Principia. Hooke 
had no doubt surmised that the motions of the planets implied an inverse-square law, but 
because he had been unable to verify that such a law accounted for elliptical orbits he had 
little claim to a scientic discovery. Halley acted as peacemaker and convinced Newton 
that he should soothe Hooke by inserting in a scholium a suitable acknowledgment, in 
which he should state that the inverse-square law “obtains in the celestial bodies, as Sir 
Christopher Wren, Dr. Hooke, and Dr. Halley have severally observed.” 

Newton was so annoyed by Hooke’s galling priority claims that when he announced 
to Halley (June 1687) that Book II was completed, he added that he was ready to eliminate 
the proposed third book: 


The third I now design to suppress. Philosophy is such an impertinently litigious lady that 
a man had as good be engaged in lawsuits as have to do with her. I found her so formerly, 
and now I can no sooner come near her again, but she gives me warning. 
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Edmund Halley 
(1656-1742) 


(Extract taken from A History of Science, Technology 
and Philosophy in the 16th and 17th Centuries, by A. 
Wolf. Reproduced by kind permission of Unwin Hyman 
Ltd.) 


Fortunately for the scientic world, Halley’s persuasive powers convinced the irritated 
author that he should continue. 

The Royal Society went virtually bankrupt when one of its books, Willoughby’s 
History of Fishes, failed to sell as expected. And although the society wanted to pay 
for the publication of the Principia, it could not enter on any fresh printing expenses. 
Rather than let the work remain unprinted, Halley agreed to defray the costs entirely out 
of his own pocket, no slight burden for a man of meager means. (In return, the society 
voted to give Halley 50 copies of the unsalable History of Fishes.) In September 1687, 
the complete Principia was published, in Latin, under the imprint of the Royal Society 
and its then president, Samuel Pepys, although it is to be doubted that the pompous and 
prosy diarist could have understood as much as a single sentence of it. 

If it were not for Halley, there might never have been a Principia. Not only did he fur- 
nish the funds for its prompt issue and act as a conciliator between Newton and the Royal 
Society, but he set aside his own research to keep an editorial eye on what is regarded 
as the greatest scientic work of all time. Halley gathered the necessary astronomical 
data, corrected obscurities in the text, and superintended the illustration and printing. 
The mathematician Augustus De Morgan (1806-1871) wrote in his Gallery of British 
Worthies of Halley’s efforts, “But for him, in all human probability, the work would 
never have been thought of, nor when thought of written, nor when written printed.” 
Newton generously acknowledges in the preface of the Principia that it was Halley who 
had persuaded him to deal with the subject and communicate his results to the Royal 
Society. Halley on his part pre xed a 48-line ode to the Principia eulogizing the dis- 
coveries therein. Certain verses were constantly quoted for their appropriate praise of 
Newton: 


Then ye who now on heavenly nectar fare, 
Come celebrate with me in song the name 
Of Newton, to the Muses dear; for he 
Unlocked the hidden treasuries of the Truth: 
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So richly through his mind had Phoebus cast 
The radiance of his own divinity. 
Nearer the gods no mortal may approach. 


Newton’s great work shows by its very title that it was intended to be a rebuttal to 
Descartes’s theory of vortices. It is the Mathematical Principles of Natural Philosophy 
as opposed to Descartes’s Principles of Philosophy—principles that Newton held to be 
thoroughly unreliable. Legend has it that Newton repeatedly wrote the pencil note “error” 
in the margin of his copy of Descartes’s book; and tired of having to write the same note 
again and again, he threw the book away never to reread it. 

Newton’s Principia tried to explain all the motions of the heavens according to the 
law of universal gravitation. The concept of gravitational attraction was introduced long 
before Newton’s time. For those who took the geocentric view, wherein the earth was 
thought to occupy a special position at the center of the universe, it was natural to believe 
that gravity was associated with the earth alone. When this ceased to make sense in a 
Copernican universe, gravity came to be regarded as an attribute of any large material 
body, and was spoken of as the tendency by which various celestial bodies tend to unite 
and come together. Newton’s master stroke lay in generalizing this to a law of universal 
gravitation, wherein every particle of matter, however small, attracts every other particle. 
In its modern form, the principle states: 


Any two material particles attract each other with a force varying directly with the product 
of their masses and inversely with the square of the distance between them. 


Nowhere in the Principia is the law found in these words, but different parts of it exist 
in different passages. In one place, Newton wrote “There is a power of gravity tending 
to all bodies, proportional to the several quantities of matter which they contain.” And in 
another, he stated that “gravity --- operates --- according to the quantity of solid matter 
which they contain --- decreasing always with the inverse square of the distances.” All 
this is perhaps more understandable in terms of symbols. What Newton had conjectured 
is that two particles attract each other with a force given by the formula 


Mm 


heer a 


where M and m are the masses of the particles, d is the distance between them, and G 
is a constant called the constant of gravitation. 

Although the results were rst found by means of the new uxional calculus, Newton 
was careful in the Principia to recast all his demonstrations in the form of classical 
Greek geometry with an almost complete lack of analytical calculations. He probably 
felt that the time-honored approach provided convincing arguments in a language other 
mathematicians and astronomers would understand. The uxional calculus was then un- 
published, and had Newton used it to arrive at results that were opposed to many of 
the theories prevalent at the time (such as Descartes’s vortex theory of the universe), 
the controversy about the truth of his ndings would have been complicated by disputes 
over the validity of the methods. Because the printed pages of the Principia have the 
appearance of Greek geometry, it is to be expected that there are few references to the 
“new analysis” through which Newton claimed to have discovered the propositions; in 
fact, the word fixion does not appear at any place in the work except in one lemma in 
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which Newton appears to have forgotten to change over from the initial analysis to the 
geometric style of presentation. 

The Principia consists of three books (containing 53, 42, and 48 propositions, re- 
spectively) as well as 25 pages of introductory matter. In the prefatory section, Newton 
de ned such concepts as mass, inertia, momentum, and centripetal force; and he laid 
down the three famous “Axioms or Laws of Motion,” which had to precede the mathe- 
matical propositions. These laws are, in his own words: 


1. Every body continues in its state of rest, or of uniform motion in a straight line, 
unless it is compelled to change that state by impressed forces. 


2. The change of motion [rate of change of momentum] is proportional to the 
impressed force and takes place in the direction of the straight line in which the 
force is impressed. 


3. To every action there is always an opposed and equal reaction. 


The rst two laws are deductions from the historical experiments of Galileo, but the 
principle expressed in the third law is one Newton himself introduced. 

Book I, containing the rst of two parts of De Motu Corporum, is a mathematical 
treatment of the laws of motion under the in uence of impressed forces, there being no 
resisting medium; it consists of what we should today call theoretical mechanics. The 
book is written in generality, and in the earlier propositions a geometrical point-mass 
plays the role of a physical body. Newton’s apparent aim was to develop the subject in 
such a way that he could apply it to the motion of comets, or planets and their satellites, 
in Book III. The outstanding result of the book was the proof that if a body describes an 
elliptical path (or for that matter, any path that is a conic section) under the in uence of 
an attractive force at one focus, then the force must vary inversely with the square of the 
body’s distance from the focus; conversely, if the attractive force varies inversely with the 
square of distance, then the orbit describes a conic section, with the center of attraction at a 
focus of the conic. Turning to results on attraction of spheres, Newton showed that if every 
particle attracts every other particle according to an inverse-square law, then the whole 
mass of a uniformly dense spherical body can be considered concentrated at its center. 

The second book of the Principia, De Motu Corporum Liber Secundus, deals with 
the motion of bodies—particularly of pendulums—in resisting media such as air and 
water. The effect of the resistance was regarded as proportional to the body’s velocity, 
to the square of its velocity, or to both combined. The book contains the mathematics of 
uid dynamics, in a wide sense of the term: calculation of the density and compression of 
gases and liquids, the rst printed analysis of wave motion ina uid, and an examination 
of the ow of liquids through ori ces. One of Newton’s motives in the book was to show 
mathematically that Descartes’s theory of vortices, with its set of whirlpools in which 
each planet was supposedly held, was dynamically unsound and hence not a possible 
explanation of celestial motion. He demonstrated that a vortex would impart to a planet 
a motion that could not be reconciled with Kepler’s laws (“it is manifest that the planets 
are not carried around in corporeal vortices’’). 

Book III, which bears the special title De Systemate Mundi, was the crowning 
achievement of Newton’s work. It contains the application of the general theory de- 
veloped in Book I to the solar system; many important phenomena of motion, terrestrial 
as well as celestial, are explained by the law of universal gravitation. By reference to 
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gravitational principles, Newton could compare the ratios of the masses and mean densi- 
ties of the sun, the earth, and any planet that had a satellite. He then made the remarkable 
surmise that the earth had a mean density between 5 and 6 times that of water (current 
calculations put the gure at 5.517). He showed that the earth is not exactly spherical 
as had been supposed since ancient times, and calculated the oblateness, or “ attening,” 
at the poles; his numerical estimate of the ellipticity of the earth as an oblate spheroid 
was — (today’s gure is about sa). Newton ascribed the tides to the solar and lunar 
attraction for the seas, demonstrating that very high tides would occur at new and full 
moon when the gravitational pull of the sun and moon act together, and low tides at 
quarters when the pulls tend to neutralize each other. Finally, he investigated at some 
length the behavior of comets. Whereas Kepler had supposed that comets run through 
space along straight lines, Newton, by treating them as planets with highly eccentric 
orbits, deduced that comets must describe conic sections with the sun in one focus. This 
suggests that some comets, instead of being seen only once, travel on long elliptical 
paths, a suggestion Newton’s friend Halley seized on. 

The unpredictability of comets had always made them objects of wonder. Ancient 
chroniclers saw the ery rush of comets as harbingers of evil—war, plague, earthquake— 
a natural view, because some calamitous event was bound to be taking place in some 
country where the comet was visible. A nal blow to the superstitious fear of comets was 
given by Halley, who showed that their sudden appearances and disappearances were in 
accord with gravitational law. Using Newton’s methods, Halley calculated the orbit of 
the Great Comet of 1682 and found it nearly identical with the orbits of a comet that had 
attracted Kepler’s attention in 1607 and of a bright comet observed by Peter Apian in 
1531. He rightly concluded that they were all appearances of a single comet describing 
ahighly attened elliptical orbit about the sun, within a period of some 75.5 years. This 
led Halley to make the daring prediction, which could not be ful lled until after his 
death, that the comet would again be seen in 1759. Its arrival, exactly as forecast, was an 
independent con rmation of Newton’s mathematical astronomy. “Halley’s Comet” has 
made subsequent returns to our sky, at the same intervals, in 1835, 1910 and 1986, and 
its previous visits have been traced back to dates before the Christian era. One famous 
appearance was before the Battle of Hastings in 1066, and it was interpreted by King 
Harold as indicating his defeat by William the Conqueror. In the Bayeux Tapestry, the 
comet appears on one of the corners with the inscription: “Here they marvel at a star.” 

The rst edition of the Principia was very small, probably numbering no more than 
300 copies, which were fairly soon dispersed by sale or gift. By 1691, it was impossible 
to get a copy of the work, and there was already talk of a new edition, possibly to 
be supervised by some younger mathematician. But not until 1709 was Newton able to 
enlist the services of someone. This was Roger Cotes (1682-1716), a brilliant Cambridge 
mathematician who was able to criticize and correct the Principia with con dence. Of 
this second edition, 750 copies appeared from the press in 1713. No fundamental work 
in the physical sciences met with such a demand as the Principia did. It is reported 
that one reader, unable to secure a hardbound copy for himself, transcribed the entire 
work longhand. This should not be interpreted as an indication of general enthusiasm, 
or even that there were many people in England capable of understanding the contents. 
Newton told a friend that “to avoid being baited by little smatterers in mathematics,” he 
purposely made the Principia abstruse. He was con dent that competent mathematicians 
would comprehend his concise reasoning and accept his conclusions. Still, there was a 
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wide public interest in the Principia that led to a ood of popularizations of it in the 
vernacular. Within the next century, there had appeared 40 of these in English, 17 in 
French, 11 in Latin, 3 in German, and | each in Italian and Portuguese. Eighteen editions 
of the Principia itself were published by 1789. 

The immediate continental reception to Newton’s principles of natural philosophy 
did not measure up to the admiration of the English. The scarcity of rst-edition copies of 
the Principia naturally did not favor wide dissemination of Newton’s ideas. Moreover, 
in France, where the Cartesian idea of vortices for explaining planetary motion was 
uncritically accepted, Newton’s constitution of the universe was dismissed on the ground 
that it rested on assuming an “action at a distance” without intervention of material 
substance; that is, on some unintelligible, even occult, quantity of the sort natural science 
had so recently rejected. Even so able a man as Huygens regarded the idea of gravity 
as “absurd,” and Leibniz wrote, “I do not see how he [Newton] conceived gravity or 
attraction.” Newton would have dearly liked to give a mechanical explanation, but the 
remote force of gravity was a mystery to him also. In the famous General Scholium 
appended to the second edition of the Principia to refute criticism that he had introduced 
occult quantities, Newton was still reluctant to commit himself: “I have not yet been able 
to deduce the reason of these properties of gravity from phenomena, and I do not invent 
hypotheses.” (This was not meant as a condemnation of all hypotheses in science, but only 
of those of speculative character that could not be proved or disproved by mathematically 
treated experiment.) “It is enough,” he added with a certain arrogance, “to say that gravity 
really does exist and act according to the laws which we have explained.” 

Newton had shown that the motions of the planetary bodies were not capricious but 
subject to precise calculation. The phenomena of the universe were under “natural law,” 
to be interpreted not by the Church, but by reason. These feelings were summed up in 
Alexander Pope’s lines: 


Nature and Nature’s law lay hid in night, 
God said, “Let Newton be,” and all was light. 


God had let others be also, but Newton reaped the fame, for the coming age of science 
was called the Age of Newton. 


Later Years: Appointment to the Mint 


Whereas Newton’s physical laws led some to view the universe as a self-suf cient 
mechanism, a gigantic piece of clockwork machinery, Newton himself insisted that the 
solar system was not a godless creation. This admirable ordering of the universe was 
precisely what conrmed in Newton his belief in a divine controller. The feeling that he 
was the man destined to unveil the ultimate truth about God’s creation led him to try his 
hand at theology and biblical studies. In his later years, he wrote at length on prophecies 
and predictions of future events. Newton’s main manuscript on religion, Observations 
Upon the Prophecies of Daniel, and the Apocalypse of John, was published (1733) after 
his death. What he called his mystic reveries probably consumed as much time and effort 
as the Principia did. 

Although by Newton’s death in 1727 Newtonian attraction was still regarded as 
obscure and inadmissible, there was a growing challenge to Cartesian attempts to explain 
the details of planetary motion. Not until the late 1730s, when Voltaire took up his 
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worshipful defense of Newton in his Letters Concerning the English Nation, did Newton’s 
gravitational view of the cosmic scheme begin to have advocates in France. Voltaire 
convinced his mistress Emilie Breteuil, Marquise du Chatelet, a competent scientist and 
mathematician, that she should prepare a French translation of the Principia, enriched 
by a commentary on the work. This account, which did not appear in print until 1759, 
remains the sole French translation of Newton’s masterpiece. Half a century elapsed 
before opposition to Newton’s views began to crumble and the leading mathematicians 
of Europe took up the task of perfecting the structure of the Newtonian synthesis. 
When Newton published the Principia, he reached the zenith of his genius. Whatever 
else he might do would pale in comparison with what had been achieved in that period of 
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unparalleled concentration in the middle of the 1680s. As it was, Newton lived another 
40 years, but a continually increasing portion of his time was devoted to public business 
of one sort or another and almost none to scientic work. The severe mental exertion of 
composing the Principia took its toll. Newton began to suffer from insomnia and lack 
of appetite, and by 1692 his mental health had deteriorated to the point where he was 
aficted with some sort of nervous illness. 

There have been few cases of such a dramatic rechanneling of the energies of a 
scientist as befell Newton on his recovery. A royal appointment (1696) as warden, and 
subsequently master, of the British mint made it possible for Newton effectively to sever 
his academic ties with Cambridge and pass the remaining years of his life in London. The 
post at the mint was not a sinecure, although it had been the king’s intention to provide 
one. At the time Newton became master, the general debasement of the currency had 
become a national calamity. Counterfeiting and adulteration of coins were so prevalent 
that pieces of full value were rare. Newton supervised the minting of new coinage, using 
the knowledge obtained through his numerous experiments with alloys to establish new 
standards of purity and weight. An important part of his duties involved organizing a 
campaign against counterfeiters and clippers, setting up a network of informers to track 
them down, and interrogating the chief offenders. Counterfeiting was stopped by raising 
the relief designs so high that coins could be struck by only the mightiest of presses, 
whereas grooving the rims of coins prevented undetected clipping. 

The news of Newton’s mental crisis led some to suppose that his intellectual powers 
had vanished forever. The effort spent on the Principia may have overstrained his mind 
and diminished his passion for scientic research, but his genius for solving mathematical 
problems with incredible speed was by no means impaired, as the Continental mathemati- 
cians learned when they dared to test him. In 1697, for example, John Bernoulli proposed 
the problem of nding the curve along which an object would slide from one point to 
another not directly beneath it in the shortest time. Newton gave the solution within a 
day, and when he permitted it to be published anonymously, Bernoulli recognized the 
author from the sheer power and originality of the work, exclaiming, “The lion is known 
by its paw.” On another occasion (1716), Leibniz, who was stung by the accusation of 
plagiarism from friends of Newton, sent a problem “for the purpose of feeling the pulse 
of the English analysts.” The challenge, which involved nding the orthogonal trajecto- 
ries of the family of hyperbolas having the same vertices, was presumably meant to be 
a severe test for the most experienced mathematicians. It is reported that Newton, aged 
64, received the problem around ve o’clock after returning home weary with the day’s 
business at the mint, and solved it before going to bed. 

During the latter years of his life, Newton began to take more interest in the Royal 
Society. Without opposition, he was elected its president in 1703, an ofce to which he 
was reelected annually until the close of his life more than 24 years later. Newton became 
something of a national gure, one of the principal sights of London for all visiting 
foreign intellectuals. Queen Anne knighted him, a farmer’s son—the rst scientist so 
honored—in a ceremony at Cambridge in 1705. 

Newton was well past 80 before he began to suffer seriously from the complaints 
of old age. Taken ill while presiding at a meeting of the Royal Society, he died less than 
three weeks later, on March 20, 1727, in his eighty- fth year. Newton’s passing seemed 
to arouse in the British people a national consciousness that so great a scientist had 
been one of their countrymen. His body was interred in Westminster Abbey, where the 
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Extract from Newton’s Principia Mathematica (1687). (By courtesy Editions Culture et 
Civilisation. ) 


greatest of England’s departed lay. Dukes and earls, all fellows of the Royal Society, 
carried his cofn. It was a great occasion. Voltaire, who attended the funeral, was 
moved to tell, “I have seen a professor of mathematics, only because he was great in 
his vocation, buried like a king who had done good to his subjects.” Inscribed in Latin 
on Newton’s magni cent monument are the words: “Let mortals rejoice that there has 
existed such and so great an ornament of the human race.” 

The exalted esteem in which Newton’s genius was held reached almost absurd 
heights in later times. When the queen of Prussia asked Leibniz what he thought of 
Newton, he replied: “Taking mathematicians from the beginning of the world to the time 
when Newton lived, what he had done was much the better half.” Competent critics 
agreed that the Principia, on which the fame of Newton chie y rests, surpassed all other 
scientic works in power and originality. A century later, Lagrange called the Principia 
the greatest production of the human mind, and his contemporary Laplace felt that it 
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was assured a preeminence above all other productions of human genius. With more 
than a touch of envy, Lagrange remarked that because there was only one solar system 
it could be granted to only one man to discover its fundamental laws: “Newton was the 
greatest genius that ever existed, and the most fortunate, for we cannot nd more than 
once a system of the world to establish.” 

Beside these magni cent eulogies, it is pleasant to consider Newton’s evaluation of 
his own work. Shortly before his death, he told some friends: 


I do not know what I may appear to the world; but to myself I seem to have been only like 
a boy playing on the seashore, and diverting myself in now and then nding a smoother 
pebble or prettier shell than ordinary, while the great ocean of truth lay all undiscovered 
before me. 


Newton was aware that he had traveled along a broad highway prepared for him by others. 
His well-known statement, “If I have seen farther than others, it is because I have stood 
on the shoulders of giants,” shows his appreciation of the cumulative and progressive 
growth of science. Yet it took the genius of Newton to nd the key and show, once and 
for all, just how Nature is regulated by mathematical law. With the Principia, all the 
pieces of the puzzle suddenly fell into place, all obscurities were made plain. 
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The invention of the calculus was one of the great 


Gottfried Leibniz: The intellectual achievements of the 1600s. By one of 
Calculus Controversy those curious coincidences of mathematical history 


not one, but two men devised the idea—and al- 
The Early Work of Leibniz most simultaneously. The methods of the calculus 

of Newton in England and Leibniz on the Continent 
were so similar that the question whether Leibniz borrowed the crucial concepts from 
Newton or discovered them independently gave rise to a long and bitter controversy. 
The tactics of the principal protagonists were so unworthy of these two titans, and the 
violence of the accusations and counteraccusations so injurious, that neither escaped 
with his reputation untarnished. When inferences of plagiarism became public charges, 
a committee of the Royal Society, called to adjudicate this most notorious of scienti c 
disputes, found—not surprisingly—in favor of the society’s own president against one 
of its oldest foreign members. 

Gottfried Wilhelm Leibniz (1646-1716) was born in the university town of Leipzig 
some two years before the Peace of Westphalia put an end to the Thirty Years’ War. 
His father, a jurist and professor of moral philosophy at the university, died when the 
boy was 6 years old. As a result, the young Leibniz was left almost without direction 
in his studies. The boy’s world was the world of books. A precocious child, he taught 
himself Latin from an illustrated copy of Livy’s history of Rome when he was about 8, 
and had begun the study of Greek by the time he was 12. This led to his being given 
unhampered access to his father’s library, which had previously been kept under lock 
and key. Here, according to his own testimony, he became acquainted with a wide range 
of classical writers. Leibniz wrote in later life: “I began to think when I was very young; 
and before I was_ fteen I used to go for long walks by myself in the woods, comparing 
and contrasting the principles of Aristotle with those of Democritus.” 

In the fall of 1661, the same date that Newton entered Cambridge, Leibniz became a 
student at the university of his native city, Leipzig. Only 15 at the time, he was regarded 
as something of a prodigy and soon outstripped all his contemporaries. The education 
Leibniz received at Leipzig followed traditional, conservative lines, with its emphasis 
on religion (orthodox Lutheran doctrine) and philosophy. Arithmetic was taught at an 
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Gottfried Wilhelm Leibniz 
(1646-1716) 


(Smithsonian Institution.) 


elementary level, prescribed by the textbooks of the German Jesuit Christoph Clavius 
(1537-1612). And although lectures were given on Euclid’s Elements, Leibniz did not 
pay them sufcient attention. Van Schooten’s edition of Descartes’s Géométrie, which 
he tried to read on his own, seemed much too complicated for him. Hardly 17 years old, 
Leibniz graduated from Leipzig in 1663 after defending a thesis on a point of philosophy. 
He passed the summer term at the University of Jena, where he attended mathematics 
lectures, then returned to Leipzig to concentrate on legal studies; he earned his master’s 
degree the following year. Leibniz was given a teaching position in the philosophical 
faculty at Leipzig, for which he qualied by writing Disputatio Arithmetica de Com- 
plexionibus; this work, which was expanded into Ars Combinatoria (1666), extensively 
develops the theory of permutations and combinations for the purpose of making logical 
deductions. Leibniz was ignorant of the mathematical literature, so the Ars Combinatoria 
contained little that was new. Leibniz later called it the work of a young man just out of 
school. Nonetheless, it is a work of great interest in that it discusses the establishment of 
a new mathematics-like language of reasoning (characteristica universalis) in which all 
scientic concepts could be formed by combinations from a “basic alphabet of human 
thoughts.” Moreover, Leibniz suggested that a calculus of reasoning could be devised that 
would provide an automatic method of solution for all problems that could be expressed 
in his scientic language. 

In 1666, Leibniz applied for the degree of doctor of law but was refused by the 
Leipzig faculty on the threadbare grounds that he was too young; a more likely explana- 
tion is that they were jealous of his ability. Disgusted, Leibniz quit his native city, never 
to return except in passing, in order to enroll at the University of Altdorf (Nuremberg). 
The following year, he received a doctorate from Altdorf, probably using a thesis he had 
already completed in Leipzig. His dissertation made such a favorable impression that he 
was offered a professorship on the strength of it; but Leibniz declined, having, as he said, 
“very different things in view.” He chose instead to be near the center of political power. 
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An essay that he wrote on the study of law gained him a post (1667) in the service of the 
archbishop-elector of Mainz, where he was charged with reforming the current statutes. 
Except for a four-year sojourn in Paris, the rest of Leibniz’s life was spent in residence 
at the courts of Mainz and Hanover—at Mainz until 1672 and at Hanover from 1676 
until his death in 1716. 

Leibniz’s_ rst problem was to acquaint himself with the entangled legal and political 
position of the German states. Throughout the seventeenth century, Europe had been in- 
volved in a political revolution no less signi cant than the scientic one. While England 
was undergoing the Puritan Revolution and Civil War, the Continent was shattered by the 
terrible Thirty Years’ War. What began in 1618 as a rebellion by a group of Bohemian 
leaders against the House of Hapsburg was followed by a bitter war of reprisal that spread 
rapidly to involve all of Germany, then at various times Scandinavia, the Netherlands, 
France, Spain, Hungary, Italy, Poland, Russia, and England. For 30 consecutive years 
Germany became the battle eld of Europe, until the Peace of Westphalia (1648) nally 
ended the Continent’s most destructive war until the twentieth century. From 1648 on- 
ward, the medieval Holy Roman Empire was little more than a polite anachronism. There 
remained only the titles and trappings and a loosely knit union of 360 states (or nearly 
2000 enclaves if all the imperial knights and their estates were counted) nominally held 
together by a mere shadow of an emperor in Vienna. The Peace of Westphalia granted 
the various German states the mischievous privilege of maintaining their own armies 
and entering into independent alliances for their own welfare and protection. It was this 
bargaining power that gave to the art of diplomacy its current importance. 

With the disintegration of the empire, France, now unied, became the center of 
European power and the symbol of its culture. French now became the language of polite 
society and of diplomacy everywhere. Louis XIV, the Sun King, who came into his royal 
inheritance in 1661, was the true “arbiter of Europe.” At the time Leibniz was appointed 
legal counsel at Mainz, both Germany and Holland were threatened by the aggressive 
policy of France. The young lawyer conceived (1670) the idea of diverting Louis’s 
attention from Germany by proposing to him—in a famous memorandum, Consilium 
Aegyptiacum—that a worthier objective for a Christian king would be to seize Egypt 
from the Turks, who were then massing their forces at the eastern gate of Europe. The 
elector agreed to send the author himself to Paris to explain the details of the plan. 
(This bears a curious resemblance to a plan Napolean devised 128 years later, and it 
is sometimes supposed that he read Leibniz’s memorandum.) In March 1672, Leibniz 
left Mainz for Paris but arrived there too late. Louis had already decided to invade the 
“nation of shwives and tradesmen,” as he called Holland. One of the king’s ministers 
observed that crusades had gone out of style. 

Leibniz’s journey to Paris turned into a long stay, lasting from 1672 to 1676. Al- 
though the political ends for which it was undertaken were not realized, it was perhaps 
the most important event in the future mathematician’s life. Paris was the intellectual 
capital of Europe, and Leibniz came into contact with many scholars. Most important was 
the great Dutch scientist Christiaan Huygens, who lived in Paris from 1666 until 1681. 
When Newton went to Cambridge, he studied under Isaac Barrow, one of the foremost 
mathematicians of the period; but the mathematical instruction Leibniz had received at 
Leipzig was far from adequate. Leibniz himself said that before going to Paris he was 
“not even a novice in mathematics.” So far as Leibniz can be called the mathematics pupil 
of anyone, he was a pupil of Huygens. Huygens recognized the brilliance of the studious 
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young German, and realizing that he was lacking in mathematical training, undertook to 
guide his_ rst investigations. 

During one of his early meetings with Huygens, Leibniz asserted that he could nd 
the sum of any in nite series whose terms were formed by some rule (provided only that 
the series should converge). Huygens, wishing to make an immediate test of the young 
man, suggested that he try to determine the sum of the reciprocals of the triangular 
numbers: 

1 1 1 1 1 2 
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By the clever device of writing each term as the sum of two others, using 
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Leibniz was able to obtain the sum demanded by Huygens. For 
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Also at this time, Leibniz constructed a working model of a new calculating machine 
that was an improvement on the machine Pascal had already invented. Whereas Pascal’s 
device performed only addition and subtraction, Leibniz’s machine accomplished multi- 
plication and division by repeated addition and subtraction. “It is unworthy of excellent 
men,” wrote Leibniz, “to lose hours like slaves in the labor of calculation.” 

It was impossible for Leibniz to nd much time for mathematical studies just then. 
In 1673, he crossed the channel to England on a diplomatic mission for the elector of 
Mainz to encourage peace negotiations between France and Holland. While in London, 
he made the acquaintance of his fellow countryman Henry Oldenburg, the permanent sec- 
retary of the Royal Society, who introduced him to various mathematicians and scientists 
connected with the society—Pell, Collins, Boyle, and Hooke. Leibniz was unanimously 
elected a member of that illustrious body, owing in part to an exhibition of his calculating 
machine at a meeting of the society and in part to the friendly of ces of Oldenburg. 

The diplomatic mission from Mainz met little success; and after two months’ stay in 
London, Leibniz returned to Paris, where he was able to nd time to pursue his studies 
without hindrance. Huygens was in the midst of publishing his great work, Horologium 
Oscillatorium, which dealt with numerous mechanical problems arising out of pendular 
motion. He gave his young friend a copy as a present, but Leibniz’s mathematical knowl- 
edge was insuf cient to enable him to understand the contents of the book. Looking on 
it as something of a disgrace to be ignorant of such matters, Leibniz proceeded with 
the greatest fervor to work his way through the standard mathematical works of the pe- 
riod. He devoted himself to van Schooten’s two-volume edition of Descartes’s Géométrie 
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(which heretofore had been beyond him), thus, as he said, “entering the house of ge- 
ometry truly as it were by the back door.” Guided by the friendly advice of Huygens, 
he also made himself acquainted with the manuscripts of Pascal. On several occasions, 
Leibniz was to declare that he was led to the invention of calculus more by studying 
Pascal’s writings than anything else. These few years in Paris were the most intensely 
creative period of Leibniz’s life, during which time he grew from a beginner to a mature 
mathematician. 

As the rst fruits of these studies, Leibniz was able to obtain the celebrated alternating 
series that bears his name, 
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Leibniz seems to have found this series in 1673, but the formula was known to the 
young Scottish mathematician James Gregory (1638-1675) in 1671. It is an elegant 
formula for z but the series converges too slowly for computational purposes. Newton 
pointed this out when in a neat counterblast, he sent Leibniz (through Oldenburg) the 
variant expression 


In the midst of the London negotiations, Leibniz’s patron, the elector of Mainz, died 
unexpectedly. Leibniz, hoping to gain a permanent foothold in Paris, sought a seat in 
the Académie des Sciences with a royal pension, but all his efforts failed (not until 1700 
was Leibniz admitted to the Académie, in the same year as Newton). With reluctance, 
Leibniz looked around for a new political of ce. He nally settled on the position of 
counsel and librarian to the Duke of Hanover, a post he was to Il for the remainder of 
his life. 


Leibniz’s Creation of the Calculus 


In the years from 1672 to 1676, spent in Paris, Leibniz’s slowly owering mathemat- 
ical genius matured. (This concentrated period of creativity is reminiscent of Newton’s 
“golden years” 1664-1666 at Woolsthorpe.) During this time, he developed the principal 
features and notation of his version of the calculus. Various methods had been invented 
for determining the tangent lines to certain classes of curves, but as yet nobody had made 
known similar procedures for solving the inverse problem, that is, deriving the equation 
of the curve itself from the properties of its tangents. Leibniz stated the inverse tangent 
problem thus: “To nd the locus of the function, provided the locus which determines 
the subtangent is known.” By the middle of 1673, he had settled down to an exploration 
of this problem, fully recognizing that “almost the whole of the theory of the inverse 
method of tangents is reducible to quadratures [integrations].” 

Because Leibniz was still struggling with the notation for his calculus, it is not 
surprising that these early calculations were clumsy. Either he expressed his results in 
rhetorical form or else used abbreviations, such as “omn.” for the Latin omnia (“all’’) 
to mean “sum.” The letter / was used to symbolize what we should write as dy, the 
“difference” of two neighboring ordinates. In a notable manuscript dated October 29, 
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1675, but never published, Leibniz made his symbolic connection of the direct and 
inverse tangent problems. In this essay Leibniz wrote of a theorem he had obtained by 
a geometrical argument: 


We have a theorem that to me seems admirable, and one that will be of great service to this 
new calculus, namely, 


omn./? 


—_! 
=omn. omn./—, whatever / may be. 
a 


The horizontal overbars were used in place of our parentheses, and the constant a (which 
would be dx) was taken equal to 1. Leibniz remarked, “This is a very ne theorem, and 
one that is not at all obvious.” Immediately afterward, he stated another theorem of the 
same kind, 


omn. x/ = x omn. / — omn. omn. /. 


These equations are historically important, because it was here that Leibniz rst 
introduced the symbol /, an elongated form of the letter S for “sum.” In the middle of 
the paper, he said: 


It will be useful to write f for omn., so that fl = omn. /, or the sum of the /’s. Thus, 


Fajita faesfi-[fi 


Thus, in our notation of the calculus, he has shown that 


s(faynf(fe)er om fesrnnr fom 


Leibniz was not yet using the differential under the integral sign; there exists another 
manuscript, written several weeks thereafter (November 21), in which he improved his 
notation by writing f f(x)dx, the direct ancestor of the modern form. 

Later on, in the same manuscript of October 29, Leibniz explored the contrary 
calculus, perceiving the dual nature of the integration and differentiation processes: 


Given /, and its relation to x, to nd f 1. This is to be obtained from the contrary calculus, 
that is to say, suppose that [J = ya. Let | = ya/d; then just as [ will increase, so d will 
diminish dimensions. But { means a sum, and d a difference. From the given y, we can 
always nd y/d or /, that is, the difference of the y’s. Hence one equation can be transformed 
into another. 


The symbol d was at rst placed by Leibniz in the denominator, probably by analogy 
with the division process; in a paper written on November 1, three days later, he replaced 
y/d by the familiar dy, which then seemed more appropriate to him and which he kept 
in all his future work. 

Leibniz next investigated the basic algorithms of calculus, especially the question 
whether d(xy) was equal to dx dy and whether d(x/y) was equal to dx/dy. In the 
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manuscript of November 11, he concluded that the expressions were not the same, though 
he could not give the true value of each. Ten days later, he correctly determined the prod- 
uct rule, and in July 1677 gave a statement of the quotient rule. In the case of the product, 
Leibniz subtracted xy from (x + dx)(y + dy) and discarded the term dx dy with the re- 
mark that “the omission of the quantity dx dy, which is in nitely small in comparison 
with the rest, for it is supposed that dx and dy are innitely small, will leave x dy + 
ydx.” All this is, of course, without sound justi cation. To nd the differential of the 
quotient z = x/y, Leibniz set x = zy and used the product rule: dx = zdy + y dz leads to 


_dx—zdy  dx—(x/y)dy — ydx —xdy 


dz 
y y y? 


By November 1676, he was also able to state the rule d(x") =nx"~' dx for integral 
and fractional values of n. 

Leibniz’s investigation into the calculus was based on what he called the “char- 
acteristic triangle.” Isaac Barrow had used the characteristic triangle in England, but 
according to Leibniz’s own account, the inspiration for its use came from reading the 
work of Pascal. Referring to himself in the third person, Leibniz wrote that on running 
across a gure in Pascal “a light suddenly burst upon him.” For a curve y = f(x), the 
characteristic triangle is the right triangle whose sides consist of PO(= dx), OR(= dy), 
and PR, part of the tangent to the curve at a typical point P. Leibniz noted that the 
characteristic triangle was similar to the triangle PVW formed by the normal n, the 
subnormal o, and the ordinate y at the point of contact, and also similar to the triangle 
PVU formed by the tangent ft, the subtangent s, and the ordinate y. From the similarity 
of the characteristic triangle and the triangle PVW, he got 


d d. 
i ate or o dx = ydy. 


Regarding dx and dy as in nitely small and summing up, Leibniz came to the result 


Joa fray 


To solve a de nite problem (November 11), he supposed the subnormal to be inversely 
proportional to the ordinate—that is, o = a*/y—and found that y?/3 = a?x; and hence 
the curve with the given property was the cubic parabola. 

For another application, Leibniz used the fact that when the characteristic triangle 
was very small the chord PR could be considered of the same length as the length ds of 
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the curve. Because the characteristic triangle was similar to triangle PVW, 
—_=— or yds =ndx. 


Summation then gave Leibniz 


[vam [na 


a formula for the surface of revolution obtained by rotating the original curve about the 
X-axis. 

In this memorable series of manuscripts, the symbols dx (rst as x/d) and [ f(x)dx 
of Leibniz’s calculus came into being. There were few if any new discoveries here— 
Newton’s disparaging judgment was that “not a single previously unsolved problem 
was solved”—but a formalism was developed that helped systematize and generalize 
the diverse geometric results of old. His newly contrived symbolism freed calculus of 
its bondage to geometry and allowed Leibniz to achieve many results virtually without 
effort. 

Leibniz gradually elaborated his differential-integral calculus but never actually 
founded it on the limit concept; his differential ratio dy/dx was always thought of as 
a quotient of “differences” and his integral simply as a sum. The rst prominent math- 
ematician to suggest that the theory of limits was fundamental in calculus was Jean 
d’Alembert (1717-1783). D’Alembert wrote most of the mathematical articles in that 
cardinal document of the Enlightenment, the Encyclopédie (28 volumes, 1751-1772), 
and in an article entitled “Différentiel” (volume 4, 1754) said “the differentiation of 
equations consists simply in nding the limits of the ratio of nite differences of two 
variables in the equation.” In other words, he came to the expression of the derivative 
as the limit of a quotient of increments, or as we should write it, 


Unfortunately, d’Alembert’s elaboration of the limit concept itself lacked precision. 
Therefore, a conscientious mathematician of the 1700s would have been no more satis ed 
with this de nition than with currently available interpretations of the derivative. 


Newton’s Fluxional Calculus 


Before Leibniz left Paris in the autumn of 1676, he found himself in possession of the 
rules and notation of his calculus. He suspected, but could not be sure, that Newton had 
developed an equivalent approach, one far more geometrically slanted. This was precisely 
the case. That development had taken place as early as 1665—1666, when Newton was in 
his twenties, during the same period in which he had discovered the binomial theorem. 
The bulk of Newton’s early work on calculus was condensed into a small treatise of 
some 30 crowded pages, covering such things as tangency, curvature, centers of gravity, 
and area. The work, which Newton seems never to have given a de nite title, is known 
in the learned literature as the October 1666 Tract. By seeking a pattern from tabulated 
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values of fod +t)! dt, he was able to show at that time that the area of the rectangular 
hyperbola (x + l)y = | was 


ee oe 
cee CR er Ve ia 
which is the series expansion for the natural logarithm of 1 + x. With boyish enthusiasm, 
Newton demonstrated his newly found numerical facility by calculating this expression, 
for particular values of x, to impractically large numbers of decimal places (as many as 
68 decimal places, using the series through terms involving x”>). 

In mid-1669, Newton came across Nicholas Mercator’s Logarithmotechnia (1668). 
The rst two parts of that book were devoted to a table of common logarithms; the 
third part contained various approximation formulas for logarithms, one of which was 
Newton’s own reduction of log (1 + x) to an in nite series, 


og oF se 


log(i+x)=x 7 +3 q05 ee, 

Feeling crestfallen that Mercator had anticipated him in publication, Newton, spurred by 
a desire to protect his priority in individual topics, hurriedly set to work to write up his 
earlier research in series expansions. The resulting compendium was submitted to Isaac 
Barrow, then Lucasian professor of mathematics, for his approval. By the summer of 
that year, Barrow con ded enthusiastically to the mathematician John Collins, “A friend 
of mine here that hath a very excellent genius to those things, brought me the other day 
some papers, wherein he set down methods of calculating the dimension of magnitudes 
like that of Mr. Mercator concerning the hyperbola, but very general.” These “papers” 
turned out to be the short tract De Analysi per Aequationes Numero Terminorum Infiitas 
(On the Analysis by Equations Unlimited in the Number of Their Terms). The receipt of 
this work is supposed to have persuaded Barrow to recommend Newton, “an unparalleled 
genius,” as his successor in the Lucasian professorship. 

The De Analysi opens with a rule, stated without proof, for computing the area under 
the curve y = ax”/": 


To the base AB of some curve AD, let the ordinate BD be perpendicular and let AB be 
called x and BD, y. Let again a, b, c,... be given quantities and m, n integers. Then 


D 


A B 


na 


Rule 1. If ax'"/” = y, then will x!" — area ABD. 
n 


m 


Because the x-coordinate of A is zero, Newton correctly evaluated the integral 
iy at™/"dt. Later in the De Analysi, he elaborated the demonstration of Rule 1. Newton 
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assumed that he had a curve and that the area under the curve was given by 


(1) z= n axintn/n 
m+n 


If o is an innitesimal increase in x, then the new abscissa would be x +0, and the 
increase in area (the area bounded by the curve, the x-axis, and the ordinate at x + 0) 
would be 


(2) ok oe ( n ) a(x +4 art 

m+n 
where oy is the increment by which the area increases. Newton then applied the binomial 
expansion to the right-hand side of this equation, subtracted (1) from (2), divided through 
by o, and discarded terms still containing 0, to arrive at the result 


m/n 
y=ax", 


He said that conversely, if the curve were y = ax”/”, the area under it would be 


n 
z= axntn/n 
m+n 


Although Newton seemed to grasp the relation between differentiation and integration, 
he argued obscurely. There was no attempt to explain the logic by which terms involving 
powers of o could be neglected in the calculation. 

After much persuasion, Barrow convinced his “pupil” that he should let Collins 
see the De Analysi. Collins quickly circulated copies among eminent mathematicians of 
his acquaintance, retaining a transcript of it for himself, before returning the original 
to Newton. It was Collins’s desire to offer the tract to the presses as soon as possible, 
but Newton had already conceived the bolder scheme of expanding De Analysi into 
a comprehensive account of his uxional—that is, calculus—methods. By 1671, the 
project had grown into the De Methodis Serierum et Fluxionum (On the Methods of 
Series and Fluxions). Between 1671 and 1676, Newton tried to arrange for the printing 
of this important work, either as a treatise complete in itself or as an appendix to one 
of his Lucasian lectures; but all the attempts failed, because booksellers found little 
market for severely mathematical publications. It rst appeared 65 years later in 1736 
in John Colson’s English translation, The Method of Fluxions and Infiite Series ; it was 
retranslated (1744) into Latin under the title Methodus Fluxionum et Serierum Infiitarum . 
Meanwhile, the substance of the manuscript was revealed privately to interested parties, 
and updated into the De Quadratura Curvarum (prepared in the period 1691-1693), 
which formed an appendix to the 1704 edition of Newton’s Opticks. 

In the De Methodis Fluxionum, Newton abandoned his use of in nitesimals in favor 
of uxions. Newton conceived of mathematical quantities as generated by a continuous 
motion analogous to that of a point tracing out a curve. Each of these owing quantities 
(variables) was called a “ent,” and its rate of generation was known as the “ uxion 
of the uent” and designated by a letter with a dot over it. Thus, if the uent was 
represented by x, Newton denoted its uxion by x; and denoted the uxion of x, the 
second uxion, by *, and so on. (In modern language, the uxion of the variable x 
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relative to an independent time-variable ¢ would be its velocity dx/dt.) The in nitely 
small part by which a uent was increased in a small interval of time, designated 0, was 
called the moment of the uent, and was denoted in uxional notation by xo. Newton 
did not actually introduce the familiar dot for uxions until 1691. Earlier he used the 
literal symbols p, g, and r for the uxions of x, y, and z. 

In the De Methodis Fluxionum, Newton stated the fundamental task of the calculus: 
“The relation of the uents being given, to nd the relation of their uxions [and con- 
versely].” He then illustrated this by several examples. The following extract shows the 
similarity between the approach taken by Newton and the modern method of differenti- 
ating a function: 


Thus let any equation x? — ax* + axy — y> = 0 be given and substitute x + xo for x, y + yo 
for y, and there will arise 


x? + 3x?x0 + 3xx0x0 + x30? — ax? — 2axxo — axoxo + axy + axyo 
+ ayxo + axoyo — y> — 3y’yo —3yyoyo — y> = 0. 


Now, by supposition, x? — ax? + axy — y3 = 0, which therefore, being expunged and the 
remaining terms divided by o, there will remain 


3x?x — 2axx + ayx +axy —3y7?y + 3xxxo0 — axxo +axyo — 3yyyo 
+ x00 — y300 = 0. 


But whereas o is supposed to be innitely little, that it may represent the moments of 
quantities, the terms that are multiplied by it will be nothing in respect to the rest; I therefore 
reject them, and there remains 


3x?x — 2axx + ayx +axy— 3y7y = 0. 


From this, it is clear that the method of uxions was not essentially different from that 
used in the De Analysi, nor were the obscurities of the earlier work removed. Newton still 
dropped all terms containing o once he allowed himself to divide through by o—yet o 
could not be treated as zero, because that would render the division illegitimate. The 
real advance was in treating moments xo and yo as varying with time, where earlier the 
moments o were xed bits of x and y. 

The dif culties in understanding Newton’s creation were due in part to the change 
of approach in each of his three works on the calculus. In nitesimals were emphasized in 
the De Analysi but abandoned in favor of a theory of uxions in De Methodis Fluxionum; 
and still later, that theory came to rest on prime and ultimate ratios in the De Quadratura 
Curvarum. The De Quadratura Curvarum, the last-written of Newton’s trio but the _ rst 
published, was the climax of his effort to establish the calculus on sound foundations. 
In earlier works, Newton neglected terms involving the quantity o with the doubtful 
justi cation that they were in nitely small compared with the other terms. Yet as long 
as o was a quantity, however small, it could not be rejected without affecting the result. 
Newton seems to have been aware of this, for in De Quadratura Curvarum, he declared 
that “in mathematics the minutest errors are not to be neglected.” Thus, he removed all 
traces of in nitely small quantities (“in the method of uxions there is no necessity of 
introducing gures innitely small”), replacing them with a new doctrine of ultimate 
ratios. According to Newton: “By the ultimate ratio of evanescent quantities is to be 
understood the ratio of quantities, not before they vanish, nor afterwards, but with which 
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OF 


ANALYSIS 


BY 


Equations of an infinite Number of 
Terms. 


1 HE General Method, which I had devifed fome confiderable Time 
ago, for meafuring the Quantity of Curves, by Means of Series, 
infinite in the Number of Terms, is rather flortly explained, 

than accurately demonfirated in what follows. 

2, Let the Bafe AB of any Curve AD have D 

BD for it’s perpendicular Ordinate; and call 

AB=x, BD=y, and let a, 4, ¢, &c. be given 

Quantities, and m and n whole Numbers, 

Then 

A 


The Quadrature of Simple Curves, 
RULETL 


mfe 


x” =Area ABD. 


an 
m+n 
The thing will be evident by an Example. 
1. If xt (—=1x%) sy, that is aia, and m2; it thall be tx 
= ABD. 


3. If ax* =y; it fhall be 


Te 2. Suppofe 


First page of the English translation (1745) of Newton’s De Analysi. (Printed by 


permission of the Johnson Reprint Corporation.) 


they vanish.” Although he may have been irting with the limit concept, Newton’s 
de nition was far from clear; even among his ablest admirers the word vanish caused 
much confusion and severe criticism. 

After Leibniz had returned to Paris in 1673, he maintained an active correspondence 
with Oldenburg, through which he was kept informed of the latest work of the English 
mathematicians. Oldenburg himself had never done any serious mathematics; thus, he 
was advised on all mathematical questions by John Collins (1625-1683), a self-educated 
man who developed a wide correspondence with the leading scholars of his time. In 
a letter (July 1676) from Oldenburg to Leibniz, based on a draft by Collins, mention 
was made that Newton had already developed a rule for nding tangents to algebraic 
curves. Probably as a result of this communication, Leibniz was moved, in the middle 
of October 1676, to make his return trip to Hanover by way of London and Amsterdam. 
During his week’s stay in London, he was permitted by Collins to examine a copy of 
Newton’s tract De Analysi, from which he made copious extracts, and also various letters 
of mathematicians that Collins had collected. There is much difference of opinion among 
the partisans of Newton and Leibniz about whether the uxional method, brie y exposed 
in the De Analysi, caught Leibniz’s eye or whether his interest was limited to the solution 
of problems by in nite series. But in transcribing portions of manuscripts con ded to 
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Collins, he opened himself to the suspicion that he had illicitly appropriated Newton’s 
ndings. 

Throughout the year, Leibniz had become keenly interested in the mathematical writ- 
ings of Newton and had asked Oldenburg for further information about them. Under the 
united persuasions of Oldenburg and Collins, Newton wrote Oldenburg two letters giving 
the status of his current research, with the request that Latin versions be transmitted to 
Leibniz. These celebrated letters, known now as the Epistola Prior of June 13, 1676, and 
the Epistola Posterior of October 24, 1676, established Newton’s priority for numerous 
results in his mathematical hoard. The rst letter contained the binomial theorem, as well 
as a summary of Newton’s more important work on series. Although nothing was said 
of his secret discovery of uxions, there was a vague hint that Newton had an important 
method in mind. Leibniz must have guessed that Newton’s methods were essentially 
similar to his own, for he wrote back to Oldenburg asking that Newton amplify the more 
decisive points in the letter. 

Newton could not refuse Oldenburg’s request to give Leibniz the information he 
wanted but was still disinclined to divulge his secret. The response to Leibniz’s pleas 
was a letter of 15 closely written pages sent through Oldenburg, a veritable treatise on the 
construction and application of in nite series. Having reached a point in the letter where 
an explanation of the method of uxions would logically follow, he carefully concealed 
the basic principle in a Latin anagram: 


b6accdel3ef fi3l9n4o04qrr4s8tl2vx. 


Anyone who could unscramble this jumble of six a’s, two c’s, one d, and so on, into a 
sentence might arrive at “Data zquatione quotcunque uentes quantitates involvent ux- 
iones invenire et vice versa.” Freely translated, this means “Given an equation involving 
any number of uent quantities, to nd the uxions, and conversely.” By this anagram 
Newton, without disclosing his method, could later establish a claim to priority in the 
invention of the calculus. 

Anagrams were frequently used by scientists in Newton’s time, and indeed long 
before that, as a means of establishing priority of discovery without revealing what had 
been found. The writer thereby had time to publish at his leisure. A notable instance 
occurred in 1610, when Galileo adopted the device to communicate to Kepler the news 
that the planet Saturn appeared to have close satellites, one on each side. (Actually, 
his telescope was too weak, and a better one would have revealed that the “handles,” as 
Galileo called them, were Saturn’s rings.) The triple nature of Saturn was announced by a 
jumble of letters that read, when properly disentangled, “I have observed the most distant 
planet is triple.” The mystery of Saturn was solved, nearly a half-century after Galileo’s 
rst observation, by Christiaan Huygens. Recognizing that the planet’s peculiarity was 
due to its encirclement by a thin plane ring, Huygens published (1656) his discovery 
in the cryptic form a’cod'e>g'h!i7l4m?n?o* p*q'r*s't*u>, meaning in English, “It is 
surrounded by a thin at ring, nowhere touching, and inclined to the ecliptic.” 

The Epistola Posterior later acquired great importance when it became part of the 
documentary foundation for the Royal Society’s report on the calculus priority dispute. 
It is now generally agreed that Newton’s anagram of transposed letters could not have 
given Leibniz much of a clue to the invention of the calculus; even the enunciation con- 
tained therein is so brief and obscure as to be practically useless. “If Leibniz,” wrote the 
mathematician Augustus De Morgan, “could have taken a hint, either from the preceding 
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letters in alphabetical order [in the anagram], or, had he known it, in their signi cant 
arrangement, he would have derived as much credit as if he had made the invention 
independently.” In any event, the forwarding of the Epistola Posterior was delayed for 
months, not to be conveyed to Leibniz before the end of June 1677. Leibniz replied 
immediately to the letter-packet, hoping to keep the correspondence alive. Addressed 
to Oldenburg, the response imparted a full and complete statement of the principles of 
his form of the differential calculus—notation apart, all but identical with Newton’s. 
Although Leibniz’s “noble frankness” appeared in marked contrast to Newton’s basic 
reluctance to discuss his thoughts, it did not extend to informing Newton of the week 
only lately spent examining Collins’s copy of the De Analysi. For Newton, however, 
the matter was closed; he had no further interest in continuing a correspondence that 
might steal his time and draw him into another controversy. He had already written to 
Oldenburg: 


I hope that this will so far satisfy Mr. Leibniz that it will not be necessary for me to write 
any more about the subject. For having other things in my head, it proves an unwelcome 
interruption to me to be at this point of time put upon considering these things. 


During the next several years matters stood thus. In 1684, Leibniz published the 
particulars of his differential calculus in a newly established scientic monthly, the Acta 
Eruditorum. (The Acta Eruditorum was established in Leipzig in 1682 and did in Germany 
what the Journal des Savants had been doing so well in France since 1665. Because it 
was published in Latin, the journal soon gained an international audience.) The paper, en- 
titled Nova Methodus pro Maximis et Minimis, itemque Tangentibus -.. (A New Method 
for Maxima and Minima, as well as Tangents ---), contained mechanical rules, with- 
out proof, for computing the differentials—which Leibniz always called differences—of 
powers, products, and quotients. It also had the familiar d notation for differentials. The 
rst published account of the calculus was too barren and obscure to make the subject 
generally understood. It was so unenlightening that the Bernoulli brothers called it an 
enigma rather than an explanation. Two years later, in another paper in the Acta Erudi- 
torum, Leibniz dealt brie y with the integral calculus; this marked the rst appearance 
in print of the integral notation fx (later {x dx) for summation. 

Thus it came about that although Newton had invented his method of uxions many 
years before Leibniz had invented his rival method of “differences,” Leibniz was the rst 
of the two to publish and to make known to the learned world his results. Had Newton 
secured publication for his De Methodis Fluxionum when it was originally written in 
1671, he would have had no competitor, and mathematical history might well have taken 
a different course. 

At this moment, relations between the two great contemporaries were harmonious, 
with no apparent trace of enmity. Even the rst edition (1687) of the Principia contained 
no accusation on Newton’s part, nothing but an ambiguous acknowledgment of Leibniz’s 
achievement in the eld. A scholium to Book II reads: 


In letters which passed between me and that most excellent geometer, G. W. Leibniz, ten 
years ago, when I signied that I knew a method of determining maxima and minima, of 
drawing tangents and the like, and when I concealed it in transposed letters --- the most 
distinguished man wrote back that he had also fallen upon a method of the same kind, and 
communicated his method, which hardly differed from mine, except in his forms of words 
and symbols. 
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NOVA METHODUS PRO MAXIMIS ET MINIMIS, ITEMQUE TAN- 
GENTIBUS, QUAE NEC FRACTAS NEC IRRATIONALES 
QUANTITATES MORATUR, ET SINGULARE PRO 
ILLIS CALCULI GENUS *). 


Sit (fig. 111) axis AX, et curvae plures, ut VV, WW, YY, ZZ, 
quarum ordinatae ad axem normales, VX, WX, YX, ZX, quae vo- 
centur respective v, w, y, x, et ipsa AX, abscissa ab axe, vocetur x. 
Tangentes sint VB, WC, YD, ZE, axi occurrentes respective in 
punctis B, C. D, E. Jam recta aliqua pro arbitrio assumta vocetur 
dx, et recta, quae sit ad dx, ut v (vel w, vel y, vel z) est ad XB 
(vel XC, vel XD, vel XE) vocetur dv (vel dw, vel dy, vel dz) sive 
differentia ipsarum v (vel ipsarum w, vel y, vel z). His positis, 
calculi regulae erunt tales. 

Sit a quantitas data constans, erit da aequalis 0, et dax erit 
aequalis adx. Si sit y aequ. v (seu ordinata quaevis curvae YY 
aequalis cuivis ordinatae respondenti curvae VY) erit dy aequ. dv. 
Jam Additio et Subtractio: si sit z—y + w+x aequ. v, erit 
dz—y-+ w+ x seu dv aequ. dz —dy + dw + dx. Multiplicatio: div 
aequ. xdv + vdx, seu posito y aequ. xv, fiet dy aequ. xdv + vdx. 
In arbitrio enim est vel formulam, ut xv, vel compendio pro ea 
literam, uty, adhibere. Notandum, et x et dx eodem modo in 
hoc calculo tractari, ut y et dy, vel aliam literam indeterminatam 
cum sua differentiali. Notandum etiam, non dari semper regressum 
a differentiali Aequatione, nisi cum quadam cautione, de quo alibi. 


Porro Divisio: d—vel (posito z aequ. 5 dz aequ. tvdy Fydv 


Quoad Signa hoc probe notandum, cum in calculo pro litera 
substituitur simpliciter ejus differentialis, servari quidem eadem 
signa, et pro + z scribi + dz, pro —z seribi —dz, ut ex addi- 


) Act. Erud. Lips, an. 1684, 
A page from Leibniz’s rst page on the differential calculus. Published in the 


Acta Eruditorum (1694). (From A Concise History of Mathematics by Dirk Struik, 1967, Dover 
Publications, Inc., N.Y.) 


In 1713, when the second edition of the Principia appeared, the priority dispute over 
the invention of the calculus was at its height. The scholium of the rst edition was 
allowed to stand, but contained the added phrase “and the concept of the generation 
of quantities,” so that the last clause read “which hardly differed from mine, except in 
his forms of words and symbols, and the concept of the generation of quantities.” Soon 
afterward, Newton denied that the scholium “allowed him [Leibniz] the invention of 
the calculus differentialis independently of my own.” When in 1726 the third edition of 
the Principia was issued, a new scholium appeared in the place of the one just quoted; 
neither Leibniz’s name nor his work was mentioned. With regard to the admission in the 
scholium of Leibniz’s rights as a second or simultaneous inventor, De Morgan remarks 
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that Newton was weak enough “ rst to deny the plain and obvious meaning, and secondly, 
to omit it entirely from the third edition of the Principia.” 

The death of Oldenburg in 1677 ended Leibniz’s tenuous link with the English math- 

ematicians, because there was no other intermediary suf ciently interested in carrying on 
the correspondence. By this time, Leibniz had arrived in Hanover, where he was to serve 
under three successive dukes—John Friedrich, Ernst August, and Georg Ludwig—in var- 
ious of cial capacities. Residence at the Court of Hanover (a Versailles in miniature, with 
Handel conducting the chamber concerts) meant intellectual isolation for Leibniz and the 
end of his mathematical aspirations. The r st of his masters, Duke John Friedrich, used 
him as a kind of all purpose civil servant: diplomat, counselor, and librarian. With his 
air for engineering, Leibniz also acted as a consultant on many technical projects, one 
of which was a scheme to increase the production of the Harz Mountain silver mines 
by using wind-powered pumps to clear the mine passages of seeping groundwater. But 
Leibniz’s functions and duties became increasingly trivial, with much time devoted to 
business of no permanent value. When the duke was succeeded by his brother Ernst Au- 
gust in 1679, Leibniz was appointed court historian and given the task of writing a history 
of the House of Brunswick. There was nothing that did not arouse a fruitful interest in 
him. In pursuit of genealogical lines, he made a series of travels (1687-1690) throughout 
the German states and Italy, ransacking archives to nd original documents. Leibniz de- 
clined, during his stay in Italy, an offer to be Vatican librarian, for its acceptance would 
have required that he change his religion. 

With the passing of Ernst August in 1698, the most brilliant period in Leibniz’s life 
came to a close. The old duke’s son, Georg Ludwig (afterwards George I of England), 
made little use of Leibniz’s services except to urge him to complete the history of 
the Hanoverian royal house. In 1700, Leibniz succeeded in persuading the elector of 
Brandenburg to found the Berlin Academy of Sciences, and was named its rst president; 
however, 10 years’ further effort was required to get the learned body properly running, 
with a meeting place of its own (Frederick the Great later remarked of Leibniz’s many- 
sided activities, “Leibniz himself comprises a whole Academy.”) Aside from occasional 
visits to Berlin—and to Vienna where he attempted to establish a similar academy— 
Leibniz grew into a recluse, not so much from personal inclination as from a lack of 
interest on the part of his masters. In his last years, Hanover became his prison, and his 
life there a misery. 


The Dispute over Priority 


We have seen that by 1693, Newton had written out three accounts of his approach 
to calculus. Although he had circulated manuscripts on the subject among friends in 
England, never once after the early 1670s did he evince any desire to make public 
his invention. This excessive reluctance to publish and apparent indifference to popular 
approval was not an indication of modesty, but a strong dislike for anything bordering 
on controversy. It was an ironic misfortune that Newton’s hesitation to secure priority 
of invention was punished by raising up a formidable rival, one who was to give him 
more trouble than what he had sought to avoid by withholding publication. 

The rst printed account of Newton’s long-hidden treasure appeared in the writings 
of John Wallis (1616-1703), the venerable Savilian professor of geometry at Oxford. 
Volume II of Wallis’s Opera Mathematica (1693), an augmented Latin translation of his 
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John Wallis 
(1616-1703) 


(Extract taken from A History of Science, Technology 
and Philosophy in the 16th and 17th Centuries, by A. 
Wolf. Reproduced by kind permission of Unwin Hyman 
Ltd.) 


Algebra, contained a direct statement of Newton’s method of uxions with its use of 
dots, “pricked letters,” x, ¥, ... for velocities, or uxions. In 1695, Wallis warned his 
young colleague at Cambridge not to delay further in putting forward his ideas: 


Your notions of uxions pass there [on the Continent] with great applause, by the name of 
Leibniz’s calculus differentialis. ... You are not so kind to your reputation, and that of the 
Nation, as you might be, when you let things of worth lie by you so long, till others carry 
away the reputation that is due you. I have endeavoured to do you justice in that point. 


The “justice” that Wallis had done was to state in the preface of Volume I of his collected 
Opera Mathematica (which did not appear until 1695, two years after the second volume) 
that Newton’s method of uxions was similar to Leibniz’s differential calculus, and that 
Newton’s method had been sent to Oldenburg in 1676 to be communicated to Leibniz. 
With that, the seeds of the great controversy on the invention of the calculus were already 
being sown. 

The rst serious volley in the battle between Leibniz and Newton and their re- 
spective supporters was red by Nicolas Fatio de Duiller, an obscure Swiss mystic and 
mathematician living in London. Fatio felt slighted at having been omitted from a list, 
compiled by Leibniz, of mathematicians capable of solving John Bernoulli’s problem of 
the path of quickest descent. He was determined to have his revenge. When he emigrated 
to London in 1687 he soon ingratiated himself with Newton and his inner circle of fol- 
lowers to such an extent that he was considered a possible editor for a new edition of the 
Principia. (Fatio is the author of the epigram on Newton’s tomb: “Let mortals rejoice 
that there has existed such and so great an ornament of the human race.”) In 1699, when 
Fatio prepared a tract on a Geometrical Investigation of the Solid of Least Resistance, 
he made a thinly veiled charge that Leibniz’s ideas were obtained from Fatio’s idol, 
Newton: 


Iam now fully convinced by the evidence itself on the subject that Newton is the rst inventor 
of this calculus, and the earliest by many years; whether Leibniz, its second inventor, may 
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have borrowed anything from him, I should rather leave to the judgement of those who had 
seen the letters of Newton, and his original manuscripts. 


Leibniz was indignant, but pointed out to friends that because he had been given credit 
in the rst edition of the Principia as an independent inventor of the calculus, Newton 
would doubtless vindicate him. When Newton maintained a haughty silence, Leibniz 
defended himself in the Acta Eruditorum, whose editors then refused to publish a reply 
written by Fatio. 

For almost ve years nothing more happened, until Newton’s Optiks came out in 
1704. Published with the main body of the work were two mathematical tracts; one was 
Tractatus de Quadratura Curvarum, dealing with his method of uxions. In the preface, 
Newton gave the following reason for including these tracts: 


In a letter written to Mr. Leibniz in the year 1676, and published by Dr. Wallis [in volume II 
(1699) of the Opera], I mentioned a method by which I found some general theorems about 
squaring curvilinear gure s.... And some years ago I lent out a manuscript containing such 
theorems; and having since met with some things copied out of it, I have on this occasion 
made it public. 


The reference to material purloined from a manuscript concerns Leibniz’s second visit 
to London. Newton became convinced that Leibniz had in some way gotten the key to 
the calculus from perusing De Analysi and other papers entrusted to Collins. 

The next January, Leibniz countered with an anonymous review in the Acta Erudi- 
torum of Newton’s two mathematical tracts. After criticizing the mathematical contents, 
Leibniz went on to claim that Newton’s method of uxions was his own calculus, merely 
renamed, in slightly different notation: 


The elements of this calculus have been given to the public by its inventor, Dr. Wilhelm 
Leibniz, in these Acts.... Instead of the Leibnizian differences, then, Dr. Newton employs, 
and has always employed, uxions, which are very much the same as the arguments of 
uent s produced in the least intervals of time; and these uxions he has used elegantly in 
his Mathematical Principles of Nature and in other later publications, just as Honoratus Fabri, 
in his Synopsis of Geometry substituted progressive motions for the methods of Cavalieri. 


This innocent-sounding statement provoked a storm of wrath in England. For Fabri was a 
notorious plagiarist, who had greatly damaged his reputation by his Synopsis of Geometry, 
in which he had published Cavalieri’s work as his own. The likening of Newton to Fabri 
was an outrageous insult, and together with the rest of the passage, it was taken as a 
direct accusation of plagiarism. 

This charge of plagiarism was hurled back at Leibniz in 1708 by John Keill, a 
Scottish mathematician soon to be professor of astronomy at Oxford. In a paper on The 
Laws of Centripetal Force, addressed as a letter to Halley, Newton’s “avowed champion” 
inserted the following: 


All these laws follow from that very celebrated arithmetic of uxions which, without any 
doubt, Dr. Newton invented rst, as can readily be proved by anyone who reads the letters 
about it published by Wallis; yet the same arithmetic afterwards, under a changed name and 
method of notation, was published by Dr. Leibniz in Acta Eruditorum. 


The paper was published in the Philosophical Transactions for 1710, and the copy sent 
to Leibniz was delayed until 1711. Leibniz demanded that the Royal Society intervene, 
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asking that “such vain and vociferous clamours” be suppressed and that a public apology 
be extracted from Keill. 

In the early spring of 1712, the Royal Society responded to Leibniz’s appeal by 
appointing a committee of 11 to examine all documents in the society’s possession that 
bore on the matter. Grandiloquently called ‘“‘a large committee of distinguished persons of 
several nations,” the group comprised only two foreigners, and only one of those two, De 
Moivre, was a mathematician. Six members of the committee were mathematicians, and 
at least seven were intimate personal friends of Newton, who had been president of the 
society since 1703. Halley was one of the committee, and as senior member, probably 
acted as its secretary. No one was the of cial representative of Leibniz, although the 
Prussian minister to England served on this ostensibly independent body. 

After a month’s research of the question, the committee delivered its report to the 
Royal Society. The report is a statement of the case for Newton, presented as if it were 
a set of impartial ndings. It left the question whether Leibniz was guilty of conscious 
plagiarism completely unresolved, and asserted gratuitously that Newton was the rst 
inventor of the in nitesimal calculus. The committee’s conclusion runs: 


That the differential method is one and the same with the method of uxions, excepting 
the name and the mode of notation; Mr. Leibniz calling those quantities differences, which 
Mr. Newton calls moments or uxions; and marking them with the letter d, a mark not 
used by Mr. Newton. And therefore we take the proper question to be, not who invented 
this or that method, but who was the rst inventor of the method. And we believe that 
those who reputed Mr. Leibniz the rst inventor, knew little or nothing of his correspon- 
dence with Mr. Collins or Mr. Oldenburg long before; nor of Mr. Newton’s having the 
method above fteen years before Mr. Leibniz began to publish it in the Acta Eruditorum of 
Leipzig. 

For which reasons, we reckon Mr. Newton the rst inventor; and are of the opinion, that 
Mr. Keill, in asserting the same, has been no ways injurious to Mr. Leibniz. 


Leibniz had never seriously questioned that Newton invented the method of uxions, 
but had claimed an independent discovery of the calculus. He had asked the society to 
vindicate him of the charge of having stolen the idea from Newton, merely fabricating 
another notation for it. Although the report af rmed that Newton was not a plagiarist 
(because he had been adjudged the rst inventor of the calculus), it failed to pronounce 
on Leibniz’s originality, and therefore on the truth of Keill’s accusation. The dispute 
could not, nor did it, terminate with the utterly illogical conclusion that Keill had not 
injured Leibniz by his abusive attack. 

The biased investigators based their judgment on a super cial examination of the 
available evidence. They assumed that Leibniz had seen a letter of December 1672 from 
Newton to that scientic intermediary, John Collins, in which “the method of uxions 
was sufciently described to any intelligent person.” Newton would always maintain 
that by reading this letter Leibniz hit on the invention of his differential calculus. It has 
been established that the Latin version that Oldenburg passed on to Leibniz was not an 
accurate rendering of the essential content of the letter, but only a résumé. The whole 
portion in which Newton explained his tangent method by an illustrative example was 
missing in the copy dispatched to Leibniz. Even allowing the possibility that Leibniz 
had inspected the original during his short visit to London in the autumn of 1676, this 
would have taken place six months after he had devised his own version of the calculus 
independent of Newton’s previous investigations. 
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The nished report was approved unanimously at a meeting of the Society in April 
1712 and ordered printed. No one bothered to inform Leibniz of the ndings. A sub- 
committee of three, including Halley as a member, was named to see it through the 
presses. The report was published the following January and appeared in Latin for the 
bene t of continental readers. It is known for short as the Commercium Epistolicum, 
the rst two words of its title Commercium Epistolicum D. Johanns Collinsii et Aliorum 
de Analysi Promota. Few copies were printed and these were not for sale but given as 
gifts to important individuals and institutions. 

Although Leibniz had his adherents among continental mathematicians, few were 
willing to risk their reputations for him. A notable exception, the bellicose John Bernoulli, 
dared to intervene on Leibniz’s behalf. On receiving a copy of the Commercium Epis- 
tolicum, Bernoulli wrote to Leibniz, “You are, at the start, accused before a tribunal 
which, it appears, is composed of the plaintiffs themselves and their witnesses.... You 
fall under the law, you are condemned.” He attempted indirectly to weaken the evidence 
by charging that “Newton was ignorant of the correct way of taking second differentials 
a long time after it was known to us.” Leibniz had little choice but to present his case 
in public, for that supposedly impartial body, the Royal Society, had heard no testi- 
mony from him. Without waiting to see the Commercium Epistolicum, he responded by 
publishing, under the cover of anonymity, a broadsheet known as the Charta Volans. 
Bernoulli’s letter was printed in it, without naming names, as the judgment of a “very 
eminent mathematician.” The Charta Volans raised the wrath of the English; Keill, New- 
ton’s gladiator-in-chief, exclaimed, “They have thrown all the dirt and scandal they could 
without proving anything,” and referred to Leibniz and his colleagues as “those Leipzig 
rogues.” 

Because Newton implacably resented any re ection on his honor, the dispute became 
yet more heated. Following the printing of the Commercium Epistolicum, there appeared 
an anonymous review of it (which has passed into history under the name of the Recensio) 
in the Philosophical Transactions of 1715. This too was the handiwork of Newton, though 
it was deceptively palmed off as Keill’s. The review, An Account of the Book Entitled 
Commercium Epistolicum, purported to be a rectication of the imperfect summaries 
published abroad. Full of unsupported innuendoes, it was a polemic against Leibniz and 
revealed the lengths to which Newton would go to destroy an enemy. It is reported that 
Newton once “had pleasantly” said to a friend that “he had broken Leibniz’s heart with 
his reply to him.” 

By waiting nearly 30 years before taking up his case, Newton had placed his rival 
at a disadvantage for which Leibniz was not prepared. Leibniz, when the battle was 
joined, lacked the textual evidence that would prove that he was not a latecomer to 
calculus (the Royal Society refused him access to the documents in its possession). 
Newton had more numerous allies; he had become something of a national hero in 
his old age and was fully supported by the members of the Royal Society, with their 
of cial organ the Philosophical Transactions. Leibniz commanded no such backing. As 
time wore on, his inuence at court had decreased more and more, so that toward the 
end of his life he found little consideration and at last only neglect from the Duke of 
Hanover. When his employer left Hanover (1714) to assume the crown of England as 
George I, Leibniz hoped to be given the post of royal historian; instead he was ordered 
to stay behind like a lazy schoolboy until he had completed the history of the House of 
Brunswick. 
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Leibniz spent the last decades of his life in philosophical speculation, with his 
thoughts set out for the most part in short occasional papers. The crown of his philosoph- 
ical thinking, the Theodicy (1710), was the only lengthy work published in his lifetime. 
Dealing with the question of whether the claims of rationalism were at odds with Christian 
doctrine, it became one of the most popular books of the eighteenth-century Enlighten- 
ment. Leibniz’s contention was that although any created world is bound to be imperfect, 
the evil that we see about us is the least possible amount of evil. This “best of all possible 
worlds” optimism was ridiculed later in the gure of Dr. Pangloss in Voltaire’s novel 
Candide (1759). 

Leibniz died in 1716 from a noxious potion he took during an attack of the gout. 
Unlike Newton, who was buried in Westminster Abbey amid great pomp and surrounded 
by eminent persons, Leibniz was buried in near solitude. There was not a single rep- 
resentative of the Court of Hanover at his funeral; the only mourner at the graveside 
was his faithful secretary. A friend wrote in his memoirs that Leibniz “was buried more 
like a robber than what he really was, the ornament of his country.” Little notice was 
taken either in Berlin or in London of the passing of the greatest mathematician Germany 
had produced in the seventeenth century, and only in Paris was an adequate eulogy to 
his memory pronounced before the Académie des Sciences. Diderot, the editor of the 
great Encyclopédie, said of Leibniz that he brought as much fame to Germany as Plato, 
Aristotle, and Archimedes together had brought to Greece. Yet not until 50 years later 
was his grave at Court Church in Hanover made recognizable by a suitable inscription. 

Newton, in his vendetta, continued to pursue Leibniz even beyond the grave. He 
superintended a second edition of the Commercium Epistolicum in 1722, with the Recen- 
sio pre xed as a kind of introduction and an anonymous new preface “To the Reader” 
(written by Newton himself). The title page appeared under the old date of 1712, which 
was tantamount to a declaration that this edition was an exact reprinting of the old; but 
numerous insertions and omissions were apparent when the edition was compared with 
the original text. Among the most notable variations was the addition of the sentence, 
“This Collectio was sent to Dr. Leibniz on June 26, 1676.” What gave signi cance to this 
statement was that the Collectio of John Collins—a 50-page set of memoranda and ex- 
tracts of letters presenting the history of the development of series expansions—contained 
the full text of Newton’s celebrated letter of December, 1672, the letter the Commer- 
cium Epistolicum states “suf ciently described the method of uxions to any intelligent 
person.” We know now that because the Collectio was too extensive a document for 
communication, Oldenburg sent Leibniz only a drastically curtailed survey of it. This 
is just a further instance of the carelessness on the part of those stooping to frame an 
accusation of plagiarism against Leibniz. 

The bitterness of the calculus priority dispute materially affected the history of 
mathematics in Western Europe. In England, it was regarded as an attempt by impudent 
foreigners to defraud Newton, her most eminent son, of the fruits of his genius. The 
natural reaction was that the purely geometrical methods preferred by Newton over the 
analytic ones were alone studied and used. English mathematicians were also blinded 
to the obvious advantages of Leibniz’s d notation over Newton’s “dottage” (the use 
of dotted or pricked letters to denote differential operations). For more than a century, 
excessive loyalty to the great man’s reputation kept them out of touch with developments 
on the European continent. The loss of cooperation was detrimental to both English and 
continental mathematics, but most especially to the English school. England produced 
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few creative mathematicians during the eighteenth-century “Age of Newton,” and none 
who could rightly be called great. 


Maria Agnesi and Emilie du Chatelet 


Few readers understood the elements of the new calculus contained in Newton’s 
Principia or the Acta Eruditorum articles written by Leibniz: neither of these was a 
proper textbook on the subject. Such a work, con ned simply to the differential calculus, 
did appear in 1696 when the Marquis de L’ Hospital published the lectures of his private 
teacher John Bernoulli, in a volume entitled Analyse des Infiiment Petits . Enjoying a 
wide circulation through its numerous editions, L’ Hospital’s Analyse dominated the eld 
for more than 50 years. In the middle of the eighteenth century, it found a worthy rival 
in the /nstituzioni Analitiche of Maria Agnesi. 

Maria Gaetana Agnesi (1718-1799), a brilliant linguist, philosopher, and mathe- 
matician, was the rst of 21 children of a professor of mathematics at the University 
of Bologna. Educated by private tutors, she early developed an exceptional gift for lan- 
guages; by her eleventh year she was uent in Latin, Greek, Hebrew, French, Spanish, 
and German, as well as her native Italian. Her father gathered in his home the most 
distinguished intellectuals from Italy and abroad, with his young daughter acting as host- 
ess. She would engage these scholars in academic disputations on a variety of scienti c 
and philosophical topics, usually speaking in Latin; and in ordinary conversation with 
a foreign visitor would respond in his own language. Her Propositions Philosophicae, 
based on these domestic discussions, appeared when Agnesi was 20 years old. 

Thereafter followed a decade of concentrated work—undertaken at rst for her own 
interest and subsequently for the instruction of her younger brothers—which resulted in 
the publication of the Instituzioni Analitiche (1748). It is the rst surviving mathematical 
work written by a woman. In two huge volumes containing more than 1000 pages, 
the Instituzioni Analitiche attempted a complete and unied treatment of algebra and 
analysis, one that would keep the reader abreast of the most recent advances in the 
differential and integral calculus. Widely acclaimed as a model of clarity and exposition, 
and translated from Italian into French (1775) and English (1801), it was soon superseded 
by the classic textbooks on the calculus published by Euler in 1768-1770 and 1775. The 
English translation of Agnesi’s book was done by John Colson, the Lucasian Professor of 
Mathematics at Cambridge who had also previously (1736) rendered Newton’s Principia 
from Latin into English. Colson went to the trouble of learning the Italian language late 
in life specically to translate the Instituzioni Analitiche, “so that British Youth might 
have the bene t of it as well as the Youth of Italy.” 

Of the many curves investigated in the Instituzioni Analitiche, Agnesi’s name is 
most frequently associated with the versed sine curve or versiera (from the Latin vertere, 
“to turn”), whose Cartesian equation is yx? = a*(a — y). Through confusion with the 
Italian word avversiera, which means “wife of the devil” or “witch,” this cubic curve 
has subsequently come to be known as the “witch of Agnesi.” 

Maria Agnesi is often said to have been the rst woman professor of mathematics on 
a university faculty. In recognition of her exceptional accomplishments, Pope Benedict 
XIV appointed her (1750) to the chair of mathematics and natural philosophy at the 
University of Bologna, but there is a difference of opinion as to whether she actually 
held that position. Some writers believe that she taught at Bologna from 1750 until 
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1752 while her father was seriously ill; other sources insist that Agnesi declined the 
Pontiff’s offer, having no ambition for public life and nding such a display of her talent 
distasteful. After her father’s death in 1752, she withdrew from all mathematical activity, 
devoting the rest of her long life to charitable projects and religious meditation. In 1771, 
Agnesi became directress of a home for the poor and inrm, a post she held until her 
death at 81 years of age. 

The year following the publication of Agnesi’s Institutzione Analytiche saw the 
sad, premature death of Emilie de Breteuil, the Marquise du Chatelet (1701-1749), the 
only Frenchwoman of her time to develop a serious talent for mathematics and science. 
Outside of Italy, the state-chartered scienti c societies—the Royal Society of London and 
the Académie Royale des Sciences—hindered rather than advanced women’s involvement 
in the scientic enterprise. Women were formally excluded from membership in these 
bodies. A woman of rank and privilege had more subtle ways of participating in scienti c 
discourse: she might be privately tutored by someone of recognized intellectual standing, 
or gain access to the informal salons where electrical and mechanical demonstrations 
were the great fashion. 

Emilie du Chatelet was well connected, born into one noble family and marrying 
into another. Like many aristocratic women of the period, she was highly educated in 
literature, poetry, and foreign languages; by the age of 12 she was uent in Latin, English, 
German, Italian, and Spanish. But du Chatelet’s persistent interest was mathematics. Her 
station enabled her to begin serious pursuit of the subject under the tutelage of two 
of the Académie Royale’s strongest mathematicians, Pierre-Louis de Maupertuis (1698— 
1759) and his young protégé Alexis-Claude Clairaut (1713-1765). De Maupertuis was 
renowned for having led, in 1736 with Clairaut, an expedition above the Arctic Circle in 
Lapland to measure the length of 1° of arc—that is, the length of an arc that subtended 
an angle of 1° at the earth’s center. Their calculations showed that the earth’s polar 
diameter was about 17 miles shorter than its equatorial diameter, conrming Newton’s 
conjecture that the earth attened toward the poles. On his return, Clairaut published a 
classic work on the earth’s shape, Théorie de la Figure de la Terre (1743). 

Emilie du Chatelet’s name is most often associated with that of the literary philoso- 
pher Voltaire, her longtime companion and sometime lover. When Voltaire wrote his 
popularization of Newton’s world system, Elements de la Philosophie de Newton (1738), 
du Chatelet was able to supply the mathematical expertise that he lacked. The preface in- 
deed gives full credit to her collaboration, implying that her contribution was the greater 
of the two. In 1738, both partners entered (she without informing him) the Académie’s 
prize competition for the best essay on the nature and propagation of re. Although 
neither was successful in the contest—the prize was divided between Euler and two 
lesser professionals—the Académie agreed to print their essays anyway, along with the 
winners’ entries. 

Du Chatelet next produced an introductory physics textbook for the bene t of her 
son. Designed to replace the outmoded Cartesian works then available, the /nstitutiones 
de Physique (1740) introduced Leibniz’s principles to a French audience, together with 
Newton’s viewpoint. 

In 1745, du Chatelet began the undertaking for which she is most remembered: a 
French translation, with detailed commentary, of Newton’s Principia. The translation 
project was to take many years to complete, and in fact it remained uncompleted at her 
untimely death. 
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When Mme. du Chatelet was nearly 44, an age at which most women of her time 
were grandmothers, she became unexpectedly pregnant. Not anticipating that she would 
survive childbirth, she dedicated all her efforts to completing her translation of Newton’s 
work. She was indeed working at her desk at a friend’s country estate when her daughter 
was born. She placed the baby on a quarto volume of geometry, Voltaire wrote to his 
friends, until she could call her maid. Initially all seemed well and du Chatelet’s suite 
was lled with visitors, but a few days later she became ill; she died of an infection 
within the week. Her death was sensational news in Europe’s intellectual circles, and she 
was given a funeral be tting a queen. No one seems to have given more thought to her 
infant, who died shortly after the mother. Mme. du Chatelet’s old friend Clairaut saw to 
the publication of her work on the Principia; not until 1759 did it appear in print, as a 
two-volume edition. 

It may be said of Emilie du Chatelet that she was more an interpreter of the accom- 
plishments of others than a creator of original science. Yet by her great contribution, the 
French scientic community was convinced that Newton’s law of universal gravitation, 
a British import, explained the behavior of the solar system. 

Clairaut went on to ever-greater fame in celestial mechanics by determining the 
time of the next return of Halley’s comet. Halley had boldly predicted that it would 
be visible again in late 1758 or early 1759. The actual arrival was dif cult to forecast 
because the comet’s motion would be slowed by the gravitational attraction of the large 
planets Jupiter and Saturn, both of which it would pass closely. After months of laborious 
calculation, Clairaut appeared before the Académie on November 14, 1758, to announce 
that the comet would reach the point on its orbital path closest to the sun—the perihelion 
of the orbit—on April 13 of the following year. He emphasized, however, that owing 
to a lack of accurate values for the masses of Jupiter and Saturn, the given data might 
vary by a month on either side of the estimate. When the comet rounded its perihelion 
exactly one month earlier than the day assigned by Clairaut, his brilliant feat was hailed 
throughout France. At the same time it provided visual con rmation of Newton’s work 
on the dynamics of the heavenly bodies. 


2P,, -_ (n 1)Ph-1 = P,, +h Pai 
and P? = Pr-1i(Pr+t = P,,). 


n 


3. Verify Leibniz’s famous identity, 
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2. Given that P, =n! denotes the number of 
permutations of n objects, establish the following 
identities from Leibniz’s Ars Combinatoria: l+x 


for —1 <x <1, by rst calculating by long division 
the series 


=l—-xtx?—-x34--., 
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and then integrating termwise between 0 
and x. 


5. Prove that 


\ l+x > Poe ae 
a ee 
"el Tox ,° 5" % 


for — 1 <x <1, and hence 
log2 = 2 Hee ee = 
aces ec ae ae ae 
6. Supply the details of the following derivation, due to 


Euler, of the in nite series expansion for log(1 + ~x): 


(a) Show that log(1 + x) can be given by the limit 


log(1 +x) = lim n[(1 + x)!" — 1]. 


[Hint: Since 
d d 
ade 1+ ae elt log(1+x) 
it = ae ] 


= (1+x)" log(1 + x), 


the above limit is the value of the derivative of 
(1+ x)" atu=0.] 
(b) Next use the binomial series expansion of 
(1 + x)'/" to obtain 
(/n—1) 5 


log +x) = lim E + ———x 
noo 2! 
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3! 
a (1/n — 1)(1/n — 2)(1/n — 3) ee], 
4! 
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(c) From the fact lim (- - k) = —k, conclude 
n>o \n 
that 
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7. Show that the binomial theorem, as stated by Newton 
in his letter to Oldenburg, is equivalent to the more 
familiar form 


rr—1l) , 
x 
2! 


(+x) =1lt+rx+ 


r(r — 1)(r — 2) 
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where r is an arbitrary integral or fractional 
exponent. The necessary condition |x| < 1 for 
convergence was not stated by Newton. 


8. Use the binomial theorem to obtain the following 
series expansions. 


(a) Utxyl=1l—-x4x?-334.-. 


+ (-1)"x" +---. 
(b)  (l—xy? =142x43x7+... 
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9. Prove that the inequality 2 <(1+1/n)" <3 holds 
for all integers n > 1. [Hint: By the binomial 
theorem, 


10. Supply the missing details in the following derivation 
of Newton’s series 


(a) From the expansion 


1 
1+ x4 


=1]-—x44x8 
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and termwise integration, show that 


Vx? 1 1 
dx=14+--- 
[(S)e ue a 


(b) Verify that 
1/1 4x? i 1 
;)dx =~ = 
o \l+x 2Jo \L 42x +x? 


1 
+ ———_ J dx 
2] 


=if dx 
2d 4 1452+ a? 


(c) Use the integration formula 


dx 1 x 
eee! = — arctan — 
a <p x a a 


to get 
1/7! dx 1 
= —— = —[arctan(1 + J2 
aa Won : 


—arctan(1 — J/2 y] 
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11. Newton’s method for approximating a real root of 
the equation y = f(x) is to start with an estimated 
value x, and in place of the curve use the line 
y — f(x1) = f'(%1)(« — x1), which is tangent to the 
curve at the point (x;, f(x:)). The intersection of this 
tangent with the x-axis, 


f(%1) 
f'(x)’ 


gives the next approximation of the desired root. 
Newton applied (1669) his method of tangents to 

x3 —2x —5=0. Find to ve decimal places the root 
of this cubic lying between 2 and 3. 


=X 


12. (a) Apply Newton’s iterative method to the 
polynomial f(x) = x" — a, where n is a 
positive integer, to show that if x; is an 
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approximation to ./a, then 


(n—1)x; ta/xt 
n 


XQ = 


is a better approximation and so on. 

(b) Find 3/2 correct to ve decimal places. [Hint: 
Take x; = 1 asa rst approximation and use 
part (a).] 


13. The construction of the witch of Agnesi curve can be 
accomplished as follows: Consider a circle, with 
diameter OA = a, lying between two parallel 
tangents. Draw a secant through the lower point O of 
tangency and meeting the upper tangent at B and the 
circle at C. Let the point P be the intersection of 
lines CP and BP parallel and perpendicular, 
respectively, to the lower tangent. 


To see that the locus of the point P as the secant 
varies is the desired curve, show that 


(a) Triangles BPC and OAB are similar, whence 
(BP)(AB) = (OA)(PC). 

(b) If O is the origin of a rectangular coordinate 
system and P and C have coordinates (x, y) 
and (u, v), respectively, then 
(a — y)x = a(x —u) and so u = (xy)/a, while 
v=y. 

(c) Substituting the values u and v into the equation 


ee) 


yields y(x? +a?) =a. 
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CHAPTER Q 


The Development of Probability Theory: 
Pascal, Bernoulli, and Laplace 


The theory of probabilities is at bottom nothing but common sense reduced to calculus; it enables us 
to appreciate with exactness that which accurate minds feel with a sort of instinct for which ofttimes 
they are unable to account. 

LAPLACE 


According to legend, probability theory began as a 


The Origins of Probability Theory branch of mathematics with the correspondence be- 


tween Blaise Pascal and Pierre de Fermat in 1654. 
Graunt’s Bills of Mortality In ne detail this is wrong. Years before Pascal and 

Fermat ever thought of de ning the “true worth of 
a chance,” isolated problems of a probabilistic nature had been tackled by some mathe- 
maticians. It would be more appropriate to say that Pascal and Fermat supplied vital links 
in a chain of reasoning that gave us probability theory as we now know it. The dif culty 
in trying to trace this chain to its origin is that probability theory started essentially as an 
empirical science and developed only later on the mathematical side. The subject had its 
twin roots in two fairly distinct lines of investigation: the solution of wagering problems 
connected with games of chance, and the processing of statistical data for such matters 
as insurance rates and mortality tables. 

Straightforward counting of population and economic wealth goes back to antiquity. 
The numbering of the people of Israel, Emperor Augustus’s balance sheets on the Ro- 
man Empire, and William the Conqueror’s inventory of his possessions, the Doomsday 
Book (so named by the English because there was no appeal from it) are a few of the 
most outstanding historical examples of gathering data. Although quantitative facts were 
collected at various times, such information was of less value to a feudal society than 
to a society that based its economy on trade and manufacture. Systematic and extensive 
statistical investigations began only when various institutions of the new merchant class, 
particularly insurance companies, called for the probabilistic estimation of certain events. 

The oldest type of insurance seems to have been designed for protecting merchant 
vessels and was in vogue even in Roman times. In the fourteenth century, the rst ma- 
rine insurance companies were established in Italy and Holland; and the idea spread to 
other countries by the sixteenth century. (The famous Lloyd’s of London was founded 
sometime before 1688.) Although a life-insurance policy was known to have been 
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underwritten by a small group of men in London in 1583, a well-organized company for 
this purpose was not established until 1699. Certainly some sort of scientic determina- 
tion of probabilities was required to put these operations on a sound actuarial footing. 
Yet for many years, the Dutch nanced public business through annuities—loans paying 
an annual interest for life to the lender—which were so badly calculated that the towns 
were regularly losing money; despite a brisk commerce in these contracts, no government 
appeared to have made the cost of an annuity a function of the age of the purchaser. 

The London merchant John Graunt (1620-1674) was the rst to draw an extensive 
set of statistical inferences from mass data. In 1662, Graunt produced a tract entitled 
Natural and Political Observations Made upon the Bills of Mortality, a work that may 
be said to have launched the discipline we now call mathematical statistics. The Bills 
of Mortality, on which Graunt based his deductions, were originally weekly and yearly 
returns of the number of burials in several London parishes. The practice of keeping 
such tallies seems to have arisen as early as 1532 out of a desire to know about the 
current state of the Plague. The bills were regularly made out for all of London starting 
in 1563, although they were not actually published until 1625; they gradually became 
more explicit in the information they imparted. An attempt was made to classify all 
deaths in London with respect to several causes (only two, disease and accident, were 
listed at rst). The sex of the victim was distinguished, but the bills did not specify an 
age at the time of death. The data were obtained by searchers, usually “ancient matrons,” 
sworn to their of ce, and their work included viewing the body and inquiring about the 
disease or casualty that had led to death. 

Graunt had noticed that most of those who regularly bought the bills made little 
use of them. They merely scanned the total number of burials, looked for something 
unusual among the listed causes of death, and stored up the results for gossip at the 
next social occasion. Sure that better use could be made of this information, Graunt set 
out to collect as many of the past bills as were available. The statistics for the 57 years 
from 1604 to 1661 were reduced to a series of tables, which he published at some length 
in his Natural and Political Observations. Although Graunt claimed that his pamphlet 
required less than two hours’ reading, it was studded with many conclusions of varying 
generality and validity, drawn from inspecting the data. Among the important statistical 
regularities Graunt observed: the number of male births was greater than the number 
of female births; women live longer than men; the number of people dying from most 
causes except epidemic disease was fairly constant from year to year; at the suitable age 
for marriage the numbers of women and men were about equal, so that “every woman 
may have a husband, without the allowance of polygamy.” On the light side, contrasting 
burial statistics for men and women with physicians’ claims that they had twice as many 
female as male patients, Graunt concluded that either physicians generally cured women’s 
in rmities—or the compensating numbers of men died from their vices without recourse 
to medical aid. 

In studying the Bills of Mortality, Graunt inquired “how many die usually before 
they can speak, or how many live past any assigned number of years.” Because the bills 
lacked age data on childhood mortality, Graunt selected certain of the listed causes of 
death—such as thrush, teeth and worms, abortives, convulsion, infant complaints, and 
smothering—which would indicate a child up to the age of four or ve. To the numbers 
associated with these childhood causes of death, he added an estimated proportion of the 
deaths from more general causes such as smallpox, measles, and plague. Deaths among 


The Origins of Probability Theory 441 


LON DON'S Dreadful Vifiation: 


Or, A Correction of All the 


Bills of Mortality 


For this Prefent Year: 

Beginning the 27" of December 1564, and 
ending the 19th. of December following : 
Asalfo, The GENERAL or whole years BILL: 
According to the Report made to the 
Kino’s Molt Excellence Majelty, 


By the Company of Parife Clerks of London. 
Snes sn case ad 


BE 
aes Ae 


i= 


a, y 
Nee Ie 

Z, = 
AS i, 


- --¢ 
—ay 
— 


> p——___~¢ 
St sean 


— 


2 


7 ae LONDON: 
1 Printed and are to be fold by &. Coses living in Alderfgate-fircet, 
’ Printer to the faid Company t 6 6 §. 


Xe > 


f 


Bills of Mortality recorded by London parish clerks. (“Bills of Mortality” [p. 183] in 
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the elderly were more readily counted, as “aged” was a category listed by the searchers. 
Graunt assumed that this number represented persons over the age of 70, because “no 
man can be said to die properly of Age, who is much less.” 

To form some sort of mortality table, Graunt concluded from such conjectures that 
36 out of every 100 people die by the age of 6, and at the other end of the life scale, 
hardly anyone lives to be 75. He then made an estimate of how many people die in each 
successive decade and summarized the results in a famous table, the London Life Table. 
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Age Survivors 


0 100 
6 64 
16 40 
26 25 
36 16 
46 10 
56 6 
66 3 
76 1 


Graunt’s importance both as statistician and empirical probabilist lay in this novel 
idea: a group of births followed through life and gradually (in those days it was not 
so gradual) reduced in number by death. Although based on scanty and uncertain data 
and containing the curiously arbitrary notion of a constant death rate after age six, 
the London Life Table represented a tremendous advance from simple death totals to 
a new and graphical method of representing the age pattern of mortality. (The deaths 
in each decade really amount to about 3/8 of the survivors at the beginning of the 
decade.) 

These researches secured for Graunt the honor of being one of the charter fellows of 
the Royal Society of London when it was founded in 1660. The 119 original fellows were 
scientists, doctors of medicine, and doctors of divinity, noblemen, lawyers, civil servants, 
and literary men; Graunt was the only shopkeeper. There is a story that the of cers were 
uncertain whether to admit Graunt to their number, because he was only a tradesman in 
the city, and they sought the opinion of Charles II on the matter. The king replied that 
“they should certainly admit Mr. Graunt, and if they found any other such tradesmen, 
they should be sure to admit them also without delay.” As it happened, the society was 
able to follow this advice while Charles was still alive and elect another distinguished 
shopkeeper a fellow in 1679. Antoni van Leeuwenhoek, of Delft, was recognized for his 
pioneering microscopical observations. 

Graunt’s work drew the attention of others, notably the Huygens brothers in Holland, 
to the question of the length of life. Ludwig Huygens wrote to his older brother Christiaan 
(1629-1695) in 1669: “I have just been making a table showing how long people of a 
given age have to live... . Live well! According to my calculations you will live to 
about 565 and I to 55.” From Graunt’s gures, Christiaan then constructed his own 
mortality curve (the earliest known graph produced from statistical data) and used this 
to dene such notions as the probability of death in a given time and the probability of 
surviving to a given age. These ideas were put on a sound mathematical footing in 1657, 
when Christiaan Huygens wrote De Ratiociniis in Ludo Aleae (On Reasoning in Games 
of Chance), the rst text on probability theory to be published. It is only fair to point out 
that Huygen’s treatise was prompted to a large extent by the work of Pascal and Fermat, 
with which he became acquainted on a visit to France in 1655. 
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Games of Chance: Dice and Cards 


The other source to which probability theory owes its birth and development is 
gambling. Gambling originated in the early stages of human history; it was not the 
invention of a single people but appeared at many places in the world. It is reasonable 
to guess that it evolved from some sort of divination rites, most likely divination by 
lot. (These rites are schemes whereby the deity consulted is given an opportunity of 
expressing its wishes.) The question was posed by the priest or suppliant, the dice were 
cast on the sacred ground, and the answer of the god deduced. Nothing was random, 
there was no chance; because the gods had in uence over the course of earthly events, 
they could in particular interfere with the throwing of the dice. In this way, the ancient 
and universal practice of betting may have developed from casting lots in religious 
ceremonies. 

Almost every primitive culture engaged in some form of dice play. The predecessor 
of the die, and the most common gambling device of early peoples, was a tarsal bone (the 
astragalus) from the hind foot of a hooved animal. Archeologists digging in prehistoric 
sites have uncovered many of these hard, solid bones among the debris. The astragalus, 
because of its peculiar oblong shape, can rest on only four sides, two of them broad 
and two narrow. The four sides differed suf ciently in appearance to make it possible 
to distinguish among them, so that the early astragali were not marked. One of the 
games in ancient Greece consisted in throwing four astragali together and noting which 
sides fell uppermost. The most widely used rule attributed the highest value to the 
throw that showed different faces of each of the four bones (the throw of Venus). The 
throw least valued among all others was called the dog, so that to throw the dog was 
to lose. This perhaps accounts for the phrase “going to the dogs,” that is, playing a 
losing game. 

The six-sided die may have been obtained from the astragalus by grinding it down 
until a rough cube was formed. Whether the die evolved in this or in some other way, 
it was known well before the birth of Christ. The earliest known dice were excavated 
in northern Iraq and it is estimated they date from 3000 B.c. They were made of red 
pottery, and the faces marked with from one to six dots (probably because there was no 
convenient symbolism for numbers). About 1400 B.c., the present arrangement of dots, 
with two partitions of seven imprinted on opposite faces, was introduced. 

One question we shall never clearly resolve is when people rst started to play 
games of chance. In the Egypt of 3500 B.c., a board game called “Hounds and Jackals” 
was played, in which counters were moved according to certain rules by throwing the 
astragali. In reference to a famine in Lydia (about 1500 B.c.), Herodotus wrote in his 
History: 


For some time the Lydians bore the afiction patiently, but nding that it did not pass away, 
they set to work to devise remedies for the evil. Various expedients were discovered by 
various persons; dice, hucklebones [astragali] and ball, and all such games were invented, 
except table [an early form of backgammon?], the invention of which they did not claim as 
theirs. The plan adopted against the famine was to engage in games one day so entirely as 
not to feel any craving for food, and the next day to eat and abstain from games. In this 
way they passed eighteen years. 
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In the same vein, Girolamo Cardan wrote in his Liber de Ludo Aleae [Book on Games 
of Chance] that various games were invented during the 10-year siege of Troy (circa 
1200 B.C.) to bolster the morale of the surrounding Greek soldiers, who suffered from 
boredom. At some unknown time, dice nally supplanted astragali as instruments of 
play, and because cards were not introduced into Europe until the 1300s, gaming for 
amusement or prot must have been conducted mainly with dice for a thousand years. 

By the time of the emergence of Rome, gambling with dice was a commonplace 
recreation, not only among the educated classes but also among the middle and lower 
classes. We are told that the Emperor Claudius (10 B.cC—A.D. 54) was so devoted to 
dicing that he wrote a book called How to Win at Dice, and that he used to play while 
driving, throwing on a board especially tted into his carriage. Another regal player, Em- 
peror Marcus Aurelius (121-180), was accompanied everywhere by his personal croupier. 
Gaming reached such popularity with the Greeks and Romans that it was found neces- 
sary to issue laws forbidding it, except during certain seasons. Although gambling was 
in great favor among the Greeks and Romans, it was strictly forbidden by the Jews under 
penalty of death. The reason for the condemnation was that the gambler always expects 
to win, and therefore to get something for nothing; the rabbis considered this immoral 
and akin to robbery. 

Throughout the Middle Ages, the Christian Church appears to have carried out a 
campaign against playing with dice and also cards. Their bans and prohibitions were not 
directed against the games as such but rather against the accompanying vices of drinking 
and swearing. For example, participants in the Third Crusade (A.D. 1190) were issued 
instructions that no person below the rank of knight was to gamble for money, whereas 
knights and clergy might play but could not lose more than 20 shillings in 24 hours. 
Medieval history is full of such attempts to prohibit gaming, with just such results as 
might be expected. In the thirteenth century, Louis IX of France (1214-1270) issued an 
edict forbidding not only the play of dice, but even their manufacture in his realm. 

The exact origin of card playing is murky, with the invention variously credited to 
the Egyptians, the Chinese, and the Indians. It is known that card playing was introduced 
into Europe through the Crusades (the Crusaders also learned chess from the Arabs) and 
became rmly established in Western society during the 1300s. The rst playing cards 
made in Europe were hand-painted, a luxury of great beauty available only to the upper 
classes. With the invention of printing, cards were manufactured in quantity; the presses 
of Johann Gutenberg turned out the 78-card tarot deck in the same year as his famous 
Bible (1440). About 1500, the French developed the present-day variety of spades, hearts, 
clubs, and diamonds. The now stylized face cards were originally portraits of actual 
persons; for instance, the four kings were the biblical David (spades), Charlemagne 
(hearts), Alexander the Great (clubs), and Julius Caesar (diamonds), representing the 
four monarchies or empires of the Jews, the Franks, the Greeks, and the Romans. 

No sooner did playing cards become widespread than special rules were issued 
limiting their use—an effort as ineffective then as today. A series of English laws was 
passed during the reign of Edward III (1312-1377) that added dice and cards to the 
list of unlawful amusements in order to promote manly sports, such as archery. In 1397 
the provost of Paris issued an edict prohibiting playing certain games on working days, 
among them games of cards. Nothing indicates more the persistence of gambling than 
the continued efforts on the part of civil and ecclesiastic authorities to control the evils 
associated with it. 
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One might assume that during the several thousand years of dice play preceding, 
say, the Renaissance, some rudiments of a probability theory would have appeared. Yet 
no link between mathematics and gambling was suspected. There is little evidence to 
suggest that anyone thought that calculating the frequency of dice falls was possible, 
or even that each face would turn up with equal probability. Several explanations for 
the late emergence of probabilistic notions have been offered. Possibly the dice used 
were imperfectly balanced, precluding noticeable regularity in their falls; or perhaps the 
absence of a suitable mathematical notation hampered a critical investigation into the 
laws of chance. A more powerful reason might be that randomness itself was an idea 
alien to the prevailing modes of thought. Not only according to the Christian Church’s 
teachings on absolute truth but in accord also with far more ancient ideas of divination, 
God or the gods directed earthly events. Whatever the reasons might have been, it is 
undeniable that a doctrine of chance happenings took a remarkably long time to develop. 

Cardan’s Liber de Ludo Aleae contains the rst reasoned considerations relating 
games of chance to a rudimentary theory of probability. Written around 1550, the 
manuscript was discovered posthumously among Cardan’s papers in 1576. It was not 
printed until 1663; but Cardan’s lectures were popular, so the contents were no doubt 
familiar to his pupils and friends. Cardan gave a rough de nition of probability as a 
ratio of equally likely events: the probability of a particular outcome was the sum of the 
possible ways of achieving that outcome divided by “the whole circuit,” the totality of 
possible outcomes of an event. (The “whole circuit” is 6 for one die.) Cardan investi- 
gated the probabilities of casting astragali and one or more dice, and he undertook to 
explain the occurrence of card combinations for the game of primero (similar to poker). 
A signi cant passage in a chapter entitled “On the Cast of One Die” reads: 


I am as able to throw 1, 3, or 5 as 2, 4, or 6. The wagers are therefore laid in accordance 
with this equality if the die is honest and, if not, they are made so much the larger or smaller 
in proportion to the departure from true equality. 


For the rst time, we nd a transition from empiricism to the theoretical concept of a fair 
die. In making it, Cardan probably became the real father of modern probability theory. 

One of the earliest problems that can be classi ed as a question in probability theory 
concerns the fair division of stakes between two players when the game is interrupted 
before its conclusion. This is rst found in print in Fra Luca Pacioli’s Summa de Arith- 
metica, Geometrica, Proportioni, et Proportionalita (1494), where among the “unusual 
questions,” he proposed the following: 


A team plays ball so that a total of 60 points is required to win the game and the stakes are 
22 ducats. By some incident, they cannot nis h the game and one side has 50 points, and 
the other 30. What share of the prize money belongs to each side? 


The form of the problem suggests that it is Arabic in origin, although it is not contained 
in Fibonacci’s Liber Abaci (1202), which brought many Arabic puzzles to the West. The 
question is certainly not original with Fra Luca, having appeared in Italian mathematical 
manuscripts as early as 1380. 

Fra Luca’s response to the “problem of points” is that the stakes should be divided 
in the proportion 5:3, the ratio of the points already scored. In 1539, in a treatise entitled 
Practica Arithmeticae Generalis (Practical General Arithmetic), Cardan wrote with acid 
superiority about the errors in Fra Luca’s work, “And there is an evident error in the 
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determination of the shares in the game problem as even a child should recognize, while 
he [Fra Luca] criticizes others, and praises his own excellent opinion.” Cardan pointed 
out that the solution did not take into account the number of points yet to be won by 
each team. He then propounded an alternative solution, equally incorrect: namely, if x 
and y are the number of points won by each of the teams when the total number required 
to win is z, the stakes should be divided in the ratio 


142434---+(—x)1+2434+---+@ -y). 


Cardan’s arch enemy, Tartaglia, also dealt with fair stakes in his monumental Generale 
Trattato (General Treatise) of 1556. In a section entitled “Error di Fra Luca dal Borgo.” 
Tartaglia pointed out, “His [Fra Luca’s] rule seems neither agreeable nor good, since if 
one player has, by chance, ten points and the other no points, then by following this rule, 
the player who has the ten points would take all the stakes, which obviously does not 
make sense.” As opposed to Fra Luca, who recommended that the stakes be divided in 
proportion to the number of points won by each player, Tartaglia recommended that the 
deviation from half of the stake should be proportional to the difference of the points 
won by the players. That is, Tartaglia’s rule required that if z were the total number of 
points needed for completion of the game and x and y were the points in hand, then the 
player with the most points should take a portion 

1 x-y 


2 2z 


of the stake. (Thus, the total stake in Fra Luca’s problem would have been divided 
in the ratio 2:1.) Again we have an incorrect solution, because a fair division should be 
proportional to the probability of winning the whole stake. Toward the end of the section, 
Tartaglia seems to have given up, with the remark, “Therefore I say that the resolution of 
such a question is judicial rather than mathematical, so that in whatever way the division 
is made there will be cause for litigation.” The “problem of points” remained in most 
arithmetic texts until well into the 1600s. The problem is so dif cult that its solution (the 
correct ratio is 7:1) by Pascal in the spring of 1654 may well be considered a decisive 
breakthrough in the history of probability theory. 


The Precocity of the Young Pascal 


Blaise Pascal (1623-1662) is popularly linked with Fermat as one of the joint 
founders of probability theory. There is no doubt that Pascal was an exceptionally ver- 
satile man. It is unusual to nd a person who is at the same time a gifted writer and a 
religious philosopher, as well as a creative mathematician and an experimental physicist. 
He is best known to the general reader as author of the rst great prose classics in the 
modern French language, Lettres Provinciales and Pensées. As a mathematician, Pascal 
has been described as the greatest might-have-been in history. His mathematical repu- 
tation rests more on what he might have done than on what he actually accomplished. 
During much of his life, his researches were impeded by poor health—he suffered from 
intolerable stomach pains accompanied by violent migraine headaches—and religious 
concerns. Pascal’s three bursts of mathematical activity fell almost wholly in 1640, 1654, 
and 1658; hardly enough time to create the body of work of a great mathematician. 
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Blaise Pascal 


(1623-1662) 


Blaise Pascal’s mother died when he was but three years old, and he was brought up 
by his father, Etienne Pascal, a local judge who was himself a mathematician of some 
stature. Etienne Pascal was so desirous of supervising the education of his children, 
especially that of the precocious Blaise, that he sold his post and moved the family 
to Paris in 1630. Etienne’s theories on education were somewhat unorthodox, and the 
young Pascal was reared without ever having gone to school or ever having studied at 
a university. He was taught exclusively by his father, who would permit no one to help 
or to interfere with his plan for teaching the boy. From an early age, Blaise appears to 
have been a sickly child, subject to nervous disorders. It is not clear whether Etienne 
Pascal feared that the charm of mathematics would distract the little boy from other 
interests or whether he was afraid of overtaxing his son’s fragile health, but according 
to the father’s planned course of education, the youngster was to be kept from studying 
mathematics until he was about 15 years old. To enforce this prohibition, all textbooks 
on mathematics were locked up, out of sight, and the subject was a forbidden topic in 
the daily instruction. But Blaise Pascal was one of those prodigies whose genius would 
have surmounted any method of schooling or restraints on his curiosity. Although the 
elder Pascal was an excellent tutor, he made a poor psychologist. 

At the age of 12, the youngster’s natural curiosity got the better of his father’s 
educational scheme. Blaise insisted on knowing what geometry was, and when he was 
given evasive answers, he set to work on it himself. Etienne Pascal entered the playroom 
quietly one day and found his son drawing charcoal gures on the oor tiles. Calling 
lines and circles “bars” and “rounds,” the boy had been working out for himself the 
principles of geometry, and was at the moment trying to prove his guess that the sum 
of the angles of a triangle is two right angles (Euclid’s Proposition 32). With tears of 
joy the father made quick revision of his educational plans; Blaise was given a copy of 
Euclid’s Elements to study. 

Although the substance of this oft-told tale is no doubt true, it is frequently exag- 
gerated to assert that the little Pascal rediscovered for himself all the propositions of 
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Euclid up to the thirty-second and found them in the same order in which Euclid set 
them forth. But the order of Euclid’s rst 32 propositions is not so logically inevitable 
that any mathematical innocent, however gifted, would reproduce them as they stand. A 
more credible explanation is that with the help of the axioms that he had worked out, 
and through his own course of reasoning, Pascal reached Proposition 32. For a youngster 
of 12 to have discovered this on his own is already enough to remind us of a Mozart 
composing symphonies at the same age. 

A feature of the time was the formation of sociable groups in which the arts and 
sciences could be discussed seriously, but informally, outside academic settings. When 
Father Marin Mersenne founded an “academy” to exchange information on mathematical 
and scientic topics of the day, Etienne Pascal was one of the rst members. The evident 
genius of young Blaise, then 14 years old, prompted his father to bring the youth to the 
weekly gatherings and even encouraged him to contribute his own share of ideas. Having 
mastered Euclid’s Elements without assistance, the boy was now making rapid progress 
in geometry. The bent of his mind, eager to know the reason for everything, could have 
found no more satisfying subject to study. At 16, the young Pascal put his mathematical 
abilities to the test before Mersenne’s august circle by offering for discussion a handbill 
that bore the simple title Essai pour les coniques. Printed in 1640 on a single piece of 
paper, this broadsheet is, for all its brevity, one of the most fruitful pages in the history 
of mathematics. 

The Essai contained the statements of a number of general theorems of a projective 
nature, including the equivalent of what has since become known as Pascal’s mystic 
hexagon theorem: If a hexagon is inscribed in a circle, the three points of intersection 
of pairs of opposite sides lie on a straight line. To spell it out in more detail, Pascal had 
this to say: Inscribe any six-sided polygon in a circle (or more generally, in a conic) and 
label the vertices A, B, C, D, E, and F.. Extend a pair of opposite sides, say, AB and 
DE, until they meet at a point P; extend another pair of opposite sides, say, AF and CD, 
until they meet at a point Q; nally extend the third pair of sides so that they meet at 
a point R. The mystic hexagon theorem states that P, Q, and R will always lie on one 
straight line. 

(By convention, the extensions of parallel opposite sides are presumed to meet at a 
ctitious point on the line, called the point at infinity.) 


F 


Pascal closed his essay by saying, “There are many other problems and theorems, 
and many deductions that can be made from what has been stated above.” The Essai pour 
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les coniques was an announcement of a treatise he was preparing. Mersenne reported 
that in this treatise Pascal had deduced no fewer than 400 propositions on conic sections 
from the mystic hexagon theorem (unfortunately, the work was never published). One of 
the propositions is the construction of the tangent to a conic at a given point; for this, it 
is enough to regard the tangent at a point £ as a line through two coincident points and 
apply the mystic hexagon theorem to these and four other points of the conic. 


Amid the chorus of praise that greeted the Essay, the voice of Descartes was the 
exception. In reply to an enthusiastic letter from Mersenne, he could only grumble, “I 
cannot pretend to be interested in the work of a boy.” The Essai pour les coniques was 
an auspicious beginning for a mathematical career, but it was not Blaise’s habit to think 
long about any particular aspect of mathematics. He soon turned his attention, and limited 
energy, in another direction. 

At the end of 1639, Etienne Pascal was appointed by the government to a high 
administrative post, where his principal task was to straighten out the chaotic tax records 
of the province of Normandy. The sight of his father wearily coping, night after night, 
with endless columns of gures put an idea into Blaise’s head. Because these calculations 
required a machinelike precision, why not turn over this drudgery to its proper agent, 
a machine? The thought of using a mechanical calculator to speed up the execution of 
the arithmetic process had occurred to other scientists. Already, a clumsy device called 
Napier’s rods, or Napier’s bones, was used in some countinghouses. Invented by the 
Scotsman John Napier in 1617, this calculator was similar to a multiplication table with 
movable parts. The rst nine multiples of a unit were inscribed on rods made of wood or 
bone, which were placed side by side, and then matched to produce the desired product 
of numbers. 

For two years Blaise was completely absorbed in designing a clockwork appa- 
ratus that would reduce the two operations of elementary arithmetic—addition and 
multiplication—to the simple movement of gears and shafts. The new contrivance, which 
was called the Pascaline, resembled the desk calculator of the 1940s; a complete rev- 
olution of a numbered wheel communicated a movement of one unit to the wheel of 
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the next higher order. A complex mechanism the size of a shoe box (14 inches long, 5 
inches wide, and 34 inches high), it would add a column of eight gures. 

Because the greatest shortcoming of the “arithmetic machine” was the excessive 
friction of the gears, Pascal spent the next 10 years constructing some 50 models of 
various design and materials. The nal version, for whose manufacture a royal monopoly 
had been obtained, was exhibited in Paris in 1652. Pascal had hopes of enriching himself 
by this invention; but its high price—100 livres was quite a sum—limited its sale and 
rendered the Pascaline more a curiosity than a useful device. The mechanism had no 
immediate successors, nor did it establish a vogue, so that it would be only fair to call 
Charles Babbage (1792-1871), rather than Pascal, the prophet who foresaw the future of 
the computer. 

In 1651, Etienne Pascal died and Blaise was left alone in Paris, unfettered by family 
associations; he is described as being, for the next two or three years, a man of the 
world leading a dissolute life. If he allowed the charms of society to capture him, Pascal 
had in no sense given up mathematics and science entirely. During this “profane period” 
of 1651-1654, he conducted various experiments on the pressure exerted by gases and 
liquids. (Earlier, in 1648, Pascal had demonstrated that the height of a column of mercury 
varied with altitude, by having measurements carried out simultaneously at the top and 
foot of a small mountain.) In this period, he also made numerous discoveries relating 
to the binomial coef cients, which were set forth in his Traité du Triangle Arithmétique 
(1654). And he laid the foundations of probability theory through his correspondence with 
Fermat. More than ever, he felt himself possessed by a zeal for scientic activity which, 
aside from its intrinsic interest, conferred on him a certain renown in the intellectual 
world and in society. But at the age of 31, Pascal experienced an intense religious 
ecstasy, which was to transform his life completely. 

November 23, 1654, was a dividing point in Pascal’s life; on that day, he resolved to 
abandon the study of mathematics to devote himself single-mindedly to religious activity. 
A commonly dismissed story is that his conversion was provoked by an accident on the 
bridge at Neuilly over the Seine. As the tale goes, Pascal was driving in a carriage with 
either four or six horses when the animals bolted. At a place on the bridge that had no 
protective barrier, the runaway horses leaped into the water and would have dragged 
the carriage after them had not the traces miraculously snapped. Pascal is supposed to 
have regarded this lucky escape from death as a symbol of God’s will that he should 
thereafter consecrate his talents to the Christian faith. If we are to believe Pascal, his 
religious crisis was the result of a mystical experience in which God came to him and 
spoke for two hours. On emerging from his vision, Pascal seized a piece of paper and 
wrote at headlong speed an incoherent account of his revelation, copying it afterward 
onto parchment. Both were found after his death, sewed into the lining of his clothing 
near his heart. 

During the years 1655-1658, Pascal composed those works that are at the center of 
his literary importance. At this time, the Catholic Church was involved in a doctrinal 
controversy over the matter of divine grace. The antagonists in the dispute were the 
Jesuits and another group within the Church known as the Jansenists, because of their 
adherence to the beliefs of Cornelius Jansen, Bishop of Ypres. When one of Pascal’s 
friends, embracing the Jansenist cause, was condemned for heresy, Pascal undertook a 
vigorous defense. In a series of 18 satirical pamphlets, under the title A Letter Written to 
a Provincial by One of His Friends, Pascal displayed the moral teachings of the Jesuits 
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Pascal’s Essay pour les Coniques (1640). (From A Concise History of Mathematics by Dirk 
Struik, 1967, Dover Publications, Inc., N.Y.) 


in the worst light. Written under the pseudonym Louis de Montalte and addressed to 
a nonexistent friend in the country, these Provincial Letters were published secretly at 
great risk to printer and author. (In seventeenth-century France, Catholicism was the 
state religion and ridiculing sacred things was a political crime.) Yet the Letters caused 
a sensation, nding their way into every corner of France. The number of copies run 
off was enormous for the time (12,000 for the rst letter printed in 1656) and the 
readership estimated at a million. In time, the controversy attending the Provincial Letters 
was forgotten, leaving only the clarity of Pascal’s thoughts and the persuasiveness of 
his ideas. 

At the height of the popularity of the Provincial Letters, Pascal suddenly abandoned 
the world of clandestine journalism to write an apology for the Christian religion. How- 
ever, his always fragile health began to fail steadily in 1658; he died four years later at 
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The calculating machine built by Pascal in 1642. (Extract taken from A History of 
Science, Technology and Philosophy in the 16th and 17th Centuries, by A. Wolf: Reproduced by kind 
permission of Unwin Hyman Ltd.) 


the age of 39, before the book was completed. The accumulated notes in which he had 
jotted down his thoughts in preparation for this monumental project were subsequently 
collected and published as the Pensées de M. Pascal sur la Religion. As the Provincial 
Letters must be considered a masterpiece of polemical religious writing, so the Pensées 
must be regarded as a vindication and exaltation of faith. These works remain classics 
for their literary value in an age when their content has ceased to be important. Their 
inuence on the literature of France and of the world was permanent. 


Pascal and the Cycloid 


For only one brief period, toward the middle of 1658, did Pascal’s thoughts revert to 
mathematics. This was in connection with a curve that was enjoying a considerable vogue 
in its own special world, the cycloid (the curve traced out by a point on the circumference 
of a wheel, as the wheel rolls along a straight line). On account of its elegant properties 
and the endless quarrels that is has engendered among eminent mathematicians who 
have pretended to its favour, the cycloid has been called the Helen of Geometry. The 
accompanying gure shows a point P ona cycloid determined by three different positions 
of the circle. 
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It happened that one night Pascal was unable to sleep owing to the tortures of a 
violent toothache. To take his mind off the pain, he began to re ect about the cycloid; 
after a time he realized that his tooth had ceased to ache. Taking this as a sign of divine 
permission to continue, Pascal worked furiously for eight days to solve anew many of 
the problems of the cycloid, such as the area under one arch or the volume of the solid 
obtained by revolving the curve about the base line. The solutions were later published 
in a brief History of the Cycloid and in four Letters written under the pseudonym Amos 
Dettonville (an anagram on the name Louis de Montalte, which Pascal had made famous 
through the Provincial Letters). Perhaps he did not want the world to know that he had 
lapsed from grace and engaged again in the science that he had once renounced. 

Although not publicly available, the solution of the quadrature of the cycloid curve 
had been achieved some years before Pascal by Gilles Personne de Roberval (1602— 
1675), professor of mathematics at the Collége Royal in Paris and a member of the 
circle that gathered around Mersenne. The publication of his proof seems to have been 
delayed until 1693, when his Traité des Indivisibles appeared. Roberval’s secrecy can 
perhaps be attributed to the customs governing the triennial renewal of the professorial 
chair he occupied. The renewal was made on the basis of an open mathematical com- 
petition, the questions for which were propounded by the incumbent. By not disclosing 
his methods, he could have provided himself with a favorable set of questions for the 
coming mathematical contests. (Roberval was indeed proclaimed winner of the 1634 
competition, and held his of ce for the rest of his life.) But this practice also resulted in 
loss of credit to Roberval for priority of discovery. 

To determine the area under half of one arch of the cycloid traced out by a circle of 
diameter OC = 2a, Roberval devised the following procedure: For each point P on the 
cycloidal arch, extend the semichord DF to a point O where DF = PQ. The locus of O 
is a new curve OQB, which Roberval called the “companion” to the cycloid curve. 


Cc B 


O A 


Roberval rst proves that the companion curve divides the rectangle OABC into two 
equal pieces, each of which has area zra’. Indeed, the rectangle as a whole has altitude 
2a and a base whose length is equal to the semiperimeter za of the generating circle. Also 
all the horizontal lines that make up the region between the cycloid and its companion 
(shown shaded) correspond with, and are equal to, all the lines of the semicircle OFC. 
Roberval argues that the “sum of these lines” in this gure must therefore yield the area 
of the semicircle, namely, 57a. It follows that 


OABP = OPBO + OABO = $na? + ma* = 31a’. 


Upon doubling, the area under an arch of the cycloid is found to be three times that of 
the generating circle. 

The work of Pascal on the cycloid had one by-product of surprising importance: It 
served as the immediate inspiration for Leibniz in his invention of the differential and 
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integral calculus. One of Pascal’s Letters, entitled Traité des sinus du quart de circle 
(1658), involved certain calculations that resemble the evaluation of the de nite integral 
of the sine function. Pascal was, however, unable to hit upon the crucial point that would 
give a general theory of integration. Leibniz later wrote how when he was reading the 
Traité a light suddenly burst on him and then he realized what Pascal had not realized. 
(in 1703, in a letter to the mathematician James Bernoulli, he remarked that Pascal 
sometimes seemed to have bandages over his eyes.) Perhaps if Pascal had not died so 
young, or if he had not abandoned mathematics for the religious life, he would have had 
the honor accorded Newton and Leibniz for the discovery of the calculus. 


De Méré’s Problem of Points 


The year 1654 is a landmark in the history of probability theory; indeed, it is usually 
called the date on which this science began. Bothered by certain problems in gambling, 
a member of the French nobility, the Chevalier de Méré, sent some questions to Pascal. 
His interest aroused, Pascal turned in correspondence to Pierre de Fermat, the leading 
mathematician of the day, and in the active interchange of letters that ensued, both men 
produced the basic results on which the subject now rests. 

The initiator of this correspondence, Antoine Gombaud, Chevalier de Méré (1607— 
1684), was an interesting character. A soldier with a brilliant record, a linguist and 
classical scholar, he had traveled in England, Germany, Spain, and even America. The 
perfect courtier, de Méré became the self-appointed mentor of society, devoting his life 
to the vocation of teaching good manners. A favorite guest in the great chateaux, he 
was acquainted with most of the leading mathematicians of the period, including Pascal. 
Because de Méré was not a member of the continuance of what had been Mersenne’s 
circle, they may have met in less academic surroundings—probably at dicing, which was 
a fashionable diversion in the drawing rooms of the seventeenth century. Pascal was 
then at the beginning of his so-called worldly period, and de Méré took some credit for 
having brought it on. 

De Méré made a precarious living at cards and dice, wagering according to a pri- 
vate arithmetic of probabilities. Although not an accomplished mathematician, he knew 
enough about the subject to pose some interesting problems for which he got the ideas 
at his customary gambling resorts. That he had a good opinion of his own mathematical 
talent is clear from an excerpt of a letter written to Pascal sometime after 1656: “You 
must realize that I have discovered in mathematics things so rare that the most learned 
among the ancients have never thought of them and which have surprised the best mathe- 
maticians in Europe.” Leibniz, on the other hand, called him “intelligent but half-learned 
and half-comprehending.” 

One of de Méré’s gaming problems gave Pascal the occasion to begin his now 
famous correspondence with Fermat. In a letter dated July 29, 1654, Pascal wrote: 


I haven’t time to send you a solution of a dif culty which has puzzled M. de Méré. He has 
good intelligence, but he isn’t a geometer and this, as you realise, is a bad fault. 


A little later in the same letter Pascal reported on de Méré’s views: 


The Chevalier de Méré said to me that he found a falsehood in the theory of numbers for 
the following reason. If one wants to throw a six with a single die, there is an advantage 
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in undertaking to do it in four throws, as the odds are 671 to 625. If one throws two sixes 
with a pair of dice, there is a disadvantage in having only 24 throws. However, 24 to 36 
(which is the number of cases for two dice) is as 4 to 6 (which is the number of cases on 
one die.) This is the great “scandal” which makes him proclaim loftily that the theorems are 
not constant and Arithmetic is self-contradictory. 


De Méré’s reasoning in the passage just cited seems to be this. A perfect die has six 
faces, marked with from one to six dots; any one of these faces is equally likely to show 
up on a single cast, because nothing in the shape of the die or method of throwing it 
favors a particular face. Thus, the probability that a six will appear in a single throw is Z. 
This might be restated by saying that the probability of not getting a six is 2. If the die 
is thrown twice, there are 6 - 6 possible outcomes and 5 - 5 of them do not produce a six 
either time, so that the probability of not getting a six in two throws is (2). Continuing 
the argument, it can be concluded that in n throws the probability of not getting a six 
is (2)". Hence, the opposite situation, having a six turn up at least once, occurs with 


probability 
(7 ) n 
1-—[-]. 
6 


To have a better than even chance, one must have 


S\ all 
b= =) Ss. 
6 2, 
It turns out that when n = 4, 


(3 ‘ ,_ £25 _ 67) 

(z) 7 1296 1296’ 
which is greater than 5. Thus, de Méreé could pro tably bet on throwing a six at least 
once in four throws with a single die. 

When throws with two dice are considered, the computations become more laborious. 
Suppose we paint one die red and the other green so that we can tell them apart. Let 
(a, b) denote the event in which the red die shows an a and the green die shows a b. Then 
all possible outcomes for a throw of these colored dice can be exhibited in a six-by-six 
array: 


(1.1) (1.2) (1.3) (1.4) (1.5) (1.6) 
(2.1) (2.2) (2.3) (2.4) (2.5) (2.6) 
3.1) G.2) (G3) G4) 6.5) (.6) 
(4.1) (4.2) (4.3) (4.4) (4.5) (4.6) 
(5.1) (5.2) (5.3) (5.4) (5.5) (5.6) 
(6.1) (6.2) (6.3) (6.4) (6.5) (6.6) 


Each of the combinations in which the faces have different values is counted twice, 
because they represent different throws. For instance, the (1, 2) that corresponds to a one 
on the red die and a deuce on the green die is a different throw from the reverse order 
(2, 1). A direct count shows that there are 36 possible outcomes in casting a pair of dice. 
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De Méré inquired into the number of rolls necessary to ensure a better than even 
chance of getting a double six. He thought that the smallest advantageous number of 
tosses should be 24, but patient observation had shown him that 25 throws were required. 
The answer 24 is gotten by an ancient but mistaken gambling rule that runs as follows: 
Consider a situation in which there is one chance in N of winning on a single trial, and 
let n be the number of trials required to have a better than even chance of success. De 
Mére’s “rule” is that n/N is a constant. In the case in which one attempts to make a six 
with a single die, N = 6 and n = 4, so that n/N = z. Hence if N = 36, as for two dice 
in tosses aiming at a double six, m must be 24. 

Pascal had little dif culty in disposing of de Méré’s “great scandal.” In the case of 
two dice, there are 36 possible throws and 35 of these do not give a double six; hence, the 


probability of not getting two sixes is 3. If the dice are cast n times, then the probability 


of no throw of double six is (33)", This means that the opposite event, getting double 


sixes at least once, occurs with probability 


i-(2) 
36 


When n = 24 and 25, it is found that 


Therefore, 25 is the smallest number of rolls of two dice that offers a probability greater 
then 4 of achieving a double six. 


The arithmetic triangle, now generally known as Pas- 


Pascal’s Arithmetic Triangle cal’s triangle, is an in nite numerical table in “triangular 
form,” where the nth row of the triangle lists the succes- 
The Traité du Triangle sive coef cients in the binomial expansion of (x + y)". 
Arithmetique 

(« + y)° 1 Row 0 

(x+y)! 1 1 Row 1 

(+y/ 1 2 1 Row 2 

(x+y) 1 3 3 1 Row 3 

(x+y) 1 4 6 4 1 Row 4 

(x+y)p 1 5 10 10 5 1 Row 5 


Thus, for example, the numbers 1,5, 10, 10,5, and 1 in our Row 5 are the coef cients 
in the expansion of (x + y)°. The beginnings of the binomial theorem were found rela- 
tively early in the development of mathematics—the identity (x + y)? = x*+2xy + y 
occurred, in geometric language to be sure, in Book II of Euclid’s Elements. Therefore, 
tables of binomial coef cients appeared in one form or another long before the publica- 
tion of Pascal’s famous Triangle Arithmétique. It is largely an accident of history that the 
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arithmetic triangle should be named after Blaise Pascal rather than for one of his many 
anticipators in this arrangement of numbers. 

A triangular arrangement of binomial coef cients through the eighth power, written 
in rod numerals and a round zero sign, is found in the treatise The Precious Mirror 
of the Four Elements, written by the Chinese mathematician Chu Shih-Chien in 1303. 
But a knowledge of the triangle is probably much older, for Chu spoke of the diagram 
as an ancient method that was not his. Chinese references indicate that the triangle 
appeared in a work of Chia Hsien about 1050. Indeed, a similar arrangement of binomial 
coef cients was known to the Arabs about the same time that it was being used in China. 
Omar Khayyam (circa 1050-1130), known in the West as the author of the Rubaiyat, 
referred in On Demonstrations of Problems of Algebra and Almucabola to a lost work 
in which he dealt with the arithmetic triangle. Another Persian mathematician, Al-Tusi 
(circa 1200-1275), in a work called Collection on Arithmetic by Means of Board and 
Dust, approximated the value of the square root /a” +r by a+r/[(a + 1)” —r], where 
the denominator was calculated by the binomial expansion. For this purpose, Al-Tusi 
furnished a table of binomial coef cients in triangular form up to the twelfth power. 
Thus, the so-called Pascal triangle, like the so-called Pythagorean theorem, is in reality 
the product of a much earlier Eastern culture. 

The rst triangular arrangement of the binomial coef cients to be printed in European 
books appeared on the title page of the Rechnung (1527) of Peter Apian (1495-1552). 
Apian, a professor of astronomy at the University of Ingolstadt, is interesting if only 
because he taught in the Germanic tongue at a time when the prevailing custom was to 
use Latin. Although the arithmetic triangle had not previously been described in the West 
until depicted in Apian’s text, it seems to have been discovered almost simultaneously 
by several authors of the 1500s. Michael Stifel (1486-1567) in his Arithmetica Integra 
(1544) calculated the device as far as the seventeenth line though, as he pointed out, 
there was no reason to stop there. Stifel’s diagram took the form 

1 


21 35 35 

28 56 70 

36 84 126 126 
0 45 120 210 252 


FP OCOMONADMN PWN 
— 
Nn 
N 
=) 


where each column after the rst starts two places lower than the preceding one. 
Other schematic arrangements for the binomial coef cients were given by the old ri- 
vals Tartaglia and Cardan. Tartaglia in the Generale Trattato (1556) gave the numbers 
in the triangle through the eighth line, claiming the idea as his own invention. Cardan, 
in the work Opus Novum de Proportionibus (1570), presented a table to 17 lines, citing 
Stifel as the original discoverer. 

Although Pascal was not the originator of the arithmetic triangle, being nearly the 
last of a long line of “discoverers,” his name is forever linked with the triangle because 
he was the rst to make any sort of systematic study of the relations it exhibited. The 
merits of Pascal’s work in this regard are enough to justify the use of his name. The 
printing of Pascal’s Traité du Triangle Arithmétique was  nished toward the end of 


458 


Chapter 9 The Development of Probability Theory: Pascal, Bernoulli, and Laplace 


1654 (Fermat received his copy sometime before September of 1654), but as Pascal had 
withdrawn from worldly matters, it was not generally distributed before 1665. The work 
is an exposition of the properties and relations between the binomial coef cients, and it 
includes a few general probability principles. In particular, in a section entitled Utilization 
of the Arithmetic Triangle to Determine the Number of Games Required Between Two 
Players Who Play a Large Number of Games, Pascal applied the arithmetic triangle to 
the problem of stakes in games of chance. 

In the Triangle Arithmétique, Pascal did not write the arithmetic triangle from the 
top down as we now do, but instead expressed the table as shown: 


36 


Parallel ranks 


Perpendicular ranks 


The numbers on the mth upward-sloping diagonal in this arrangement give the coef cients 
in the expansion of the binomial (x + y)’~!. Thus, the gure that modern texts call 
Pascal’s triangle differs from the triangle examined by Pascal himself by a rotation 
through 45°. In his scheme, Pascal called positions in the same vertical column cells 
of the same perpendicular rank, and those in the same horizontal row cells of the same 
parallel rank. Cells in the same northeast-running diagonal were said to be cells of the 
same base. 

Pascal observed that his table could readily be enlarged by af xing further numbers, 
obtained without recourse to the binomial theorem. Once the Is in the top line and left- 
hand column have been set down, any other entry is the sum of the number directly 
above the entry and the number immediately to the left of the entry. As he put it: “The 
number of each cell is equal to that of the cell which precedes it in its perpendicular rank, 
added to that of the cell which precedes it in its parallel rank.” For instance, underneath 
the 5 in the second row and alongside the 10 of the third, a new number can be placed; 
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this new entry is 15, the sum of 5 and 10. Pascal had no claim to originality here, for this 
generating rule had been accurately explained by several earlier European and Arabic 
“discoverers” of the arithmetic triangle. 

When the arithmetic triangle is written in its present-day form, 


1 
1 1 
1 2 1 
1 3 3 1 
1 4 6 4 1 
1 5 10 10 5 1 


it is equally easy to obtain the next horizontal row of numbers from any given row; each 
entry in the interior of the triangle is the sum of the numbers nearest (diagonally) to it in 
the row above. According to this rule of addition, the next horizontal row of the triangle, 
which has not been Iled in, would consist of 


1, 14+5=6, 5410=15, 10+10=20, 104+5=15, 54+1=6, 1 


Knowing how to write down the arithmetic triangle permits quick computation of (x + y)” 
for small values of m. For instance, our newly added row furnishes the coef cients for 
(x + y)®, so that 


(x + y)® = x9 + 6x5 y + 15x4y? + 20x73 y? + 15x7y* + Oxy? + y®. 


Straightforward expansions of (x + y)" had been known long before Pascal’s time, but 
these were obtained by direct multiplication, not by applying any recursion formula for 
the coef cients. 


Custom has settled on (") as the standard symbol for the coef cient of x”~"y" in 
r 


the expansion of (x + y)", so that we can write 


(x + y = n xt n xn! a n xn? Pr he ey 
Jp = 0 1 y ) y 
n n—-1 n n 
- xy + y". 
n—1 n 


Euler designated the binomial coef cients by ( *) in a paper written in 1778, but not 
r 


published until 1806, and used the device [=| in a paper of 1781, published in 1784. The 
r 


notation ( ), which was to become the more common convention, appeared about 1827. 
la 


Stated in this symbolism, the property of the numbers (") on which the arithmetic 
r 


(2)+C)-C7) 


This rule, of course, does not give a formula for the binomial coef cients but tells us 
only how to build them from two numbers on the previous base. It would be desirable 


triangle is based is 
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to have a computation scheme that is independent of such earlier knowledge. Contained 


within Pascal’s treatise there is an explicit formula for ( ) to wit, 
r 


n\ n(n—V(n—2).-G@—r+)) 
(")- [2-3 au7 FT 


If we use the space-saving notation m! as an abbreviation for the product of the rst 
n integers, then when | <r <n-—1, the foregoing formula can be expressed more 


simply as 
n\ _ n} 
rr) r\n—r)l 


Of special interest to us is the symbol 0!, which we shall de ne to have the value 1. With 
n! 


—r)! 


this convention, the expression represents ( ) even for r = 0 andr = 2. For 
r r 


\(n 


in these cases, 


(‘) n! (") n! 
= | = —— and = | = ——. 
0 Ol(m — 0)! n n\(n —n)! 


The symbol n!, called “n factorial,’ was rst introduced by Christian Kramp of 
Strasbourg in his Elements d’Arithmétique Universelle (1808) as a convenience to the 
printer. It did not gain immediate acceptance, and the alternative symbols |m and x(n) 
were used until late in the nineteenth century. Some English textbooks suggested the 
reading “n-admiration” for n!, because the exclamation point ! is a note of admiration. 

Pascal listed 19 properties, or consequences as he called them, of the binomial 
coef cients that could be discovered from the arithmetic triangle. Among them are the 
identities 


n n—1 n—2 n—3 r-1 
(")=( )+( )+( Jae( ) (Consequence 1 
r r—-1 r—-1 r—1 r—1 
n n 
( )=( ) (Consequence V) 
r n—-r 
n n n n 
= (S)+()+(S)t--+(%). (Consequence VIII) 
0 1 2 n 


Loosely interpreted, these tell us that each number in the triangle is the sum of the 
entries in the preceding “northeast-running” diagonal, beginning with the entry above 
the given number; that the numbers in any base are symmetric with respect to the middle 
term or terms; and that the sum of the numbers in the mth base (that is, the sum of the 
coef cients in the expansion of (x + y)") equals 2”. A less obvious relation developed 
by Pascal involves the ratio of successive terms in any base of the triangle. Consequence 
XII amounts to the statement 
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though Pascal had no symbols like ” available to him. He formulated this arithmetic 
FE 


fact in words, instead of symbols, as 


In every arithmetic triangle, if two cells are contiguous in the same base, the upper is to the 
lower as the number of cells from the upper to the top of the base is to the number of those 
from the lower to the bottom, inclusive. 


Consequence XII is of historical importance because its proof involves the rst ex- 
plicit formulation of the demonstrative procedure known as induction. All in all, it would 
perhaps be best for us to use the term “mathematical induction,” for the word “induction” 
by itself conveys an entirely erroneous idea of the method. It really has nothing to do 
with induction in the sense commonly understood in the experimental sciences, where it 
signies drawing a general conclusion from particular cases. Mathematical induction is 
not a method of discovery but a technique of proving rigorously what has already been 
discovered. 


Mathematical Induction 


Although the method is too well-known to need an elaborate explanation here, let 
us brie y examine the principle of mathematical induction. It is perhaps the single most 
useful tool in the mathematician’s kit. A theorem provable by mathematical induction 
involves a proposition—a formula, a statement of equality or of inequality, or the like— 
about an integer n that is to be shown true for all positive integers n, or at least for 
all integers greater than or equal to some one integer mo. Such a proposition is usually 
denoted by the symbol P(n), allowing us to speak of individual cases such as P(1) or 
P(10) corresponding to the integers | and 10. The proof involves two stages. The rst 
part establishes that the proposition P(7) is true for the lowest meaningful value of n, say, 
n = ng. The second part of the process consists in constructing a proof of the proposition 
P(k + 1) based on the assumption that P(A‘) is true. The rigorous veri cation that P(k) 
implies P(A + 1) is a purely deductive procedure using logic and known mathematical 
facts. Because of the “hereditary nature” of P(n), we know that P(no + 1) is true because 
P(no) is true. Then because P(np + 1) is true, P(mp + 2) is also true, and so on, until 
all the positive integers n > no have been exhausted. 

To recapitulate, the proof of a proposition P(n) by the mathematical induction con- 
sists in showing that 


1. The statement P(no) is true for some particular integer no. 
2. The assumed truth of P(x) implies the truth of P(A + 1). 


We can then conclude that the proposition P(m) is true for all n > 19. The proof of 
condition | is usually called the “basis for the induction,” and the proof of condition 2 is 
called the “induction step.” The assumptions made in carrying out the induction step are 
known as the “induction hypotheses.” Induction has been likened to an in nite row of 
dominoes, all standing on edge and arranged in such a way that when one falls it knocks 
down the next in line. If either no domino is pushed over (if there is no basis for the 
induction) or the spacing between them is too large (the induction step fails), then the 
complete line will not fall. 
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Title page of the arithmetic of Peter Apian, showing the Pascal triangle. (From Rara 
Arithmetica by David Eugene Smith, published by Chelsea Publishing Company in 1970.) 


In a typical illustration of mathematical induction, let us verify the assertion that 
the sum of the cubes of the rst m positive integers is equal to the square of the nth 
triangular number. The appropriate statement P(n) in this case is 


1 2 2 1% 
PED +3 peta a [MOD =r 
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We shall show that P(1) is true, and show how from P(k) we can prove P(k + 1). The 
veri cation of P(1) presents no dif culty, since it says that 

12 4 pe 
~ 4 


We now go to the so-called induction step. Suppose that P(x) is true, so that for this 
value of k, 


p=1 


_ PE+1? 
=<. 


To conrm that P(k+ 1) holds, we simply add the term (k + 1)° to both sides of the 
last equation, to obtain 


PL atau? 


2 2 
P4243 4---+h +(k+1% = eer set 

ke + IPP + 4k + DI 

> 4 

(k +1 (k + 2) 

= F , 
which is just the statement P(k + 1). Notice that in this calculation we merely assumed 
P(k) to be true to show that it would guarantee the truth of P(k+ 1). We did not 
concern ourselves with the actual validity of P(x‘), but only with the fact that P(A + 1) 
is a consequence of the assumed truth of P(x). Having thus veri ed conditions | and 
2, we see that P(n) must hold for all positive integers according to the principle of 
mathematical induction. 

Although Pascal gave an eminently satisfying explanation of mathematical induction, 
the gift of the idea of “reasoning by recurrence” can be found earlier, in the work of 
Francesco Maurolico (Franciscus Maurolycus). Still, Pascal was the rst to recognize the 
value of this logical process and, through his Triangle Arithmétique, brought it into the 
common domain of the working mathematician. Fermat gave no evidence in his writings 
that he knew of the technique, although his method of in nite descent may be described, 
loosely, as “reverse induction.” 


Francesco Maurolico’s Use of Induction 


Francesco Maurolico (1494-1575) is generally acknowledged to have been one of the 
foremost mathematicians of the sixteenth century. Born in Sicily of Greek parents, he was 
an ordained priest, at one time an abbot, and for many years a professor of mathematics 
at Messina. Except for short stays in Rome and in Naples, he lived his whole life in 
his native Sicily. Unfortunately, most of Maurolico’s books were not published until 
after his death and hence came too late to have much in uence on the sixteenth-century 
renaissance of mathematical thought. For instance, of his treatises on the treasures of 
classical mathematics—a text on the Conics of Apollonius and a collection of writings 
of Archimedes—the works of Archimedes did not appear until 1685, the rst edition of 
1594 having been lost at sea. In Maurolico’s time, only the rst four books of Apollonius’s 


464 


Chapter 9 The Development of Probability Theory: Pascal, Bernoulli, and Laplace 


Conics had survived in the original Greek. He tried to reconstruct the then missing Book 
V on maxima and minima from the brief references that the great Greek geometer had 
provided in the preface to the entire work. (The reconstruction of the lost works of 
classical antiquity, and of the last four books of the Conics in particular, was something 
of a vogue in Renaissance Italy.) Maurolico’s reconstruction of the Conics was completed 
in 1547, but not published until 1654, by which time several other versions had appeared. 

Maurolico, one of the great geometers of his time, also occupied himself with a 
geometric analysis of lenses, spherical mirrors, and the human eye. The chief record 
of his optical research, Photismi de Lumine et Umbra (1611) anticipated many of the 
ndings in Kepler’s own Astronomiae Pars Optica of 1604, although Kepler seems to 
have been unaware of his predecessor’s work. Maurolico came remarkably close to the 
correct explanation of the origin of the rainbow. 

The greatest number of Maurolico’s mathematical writings are gathered together 
in his Opuscula Mathematica and Arithmeticorum Libri Duo, which were published as 
companion pieces at Venice in 1575. In the Pythagorean tradition, the Arithmeticorum 
contains a variety of the properties of integers noted by Nicomachus in the Jntroduc- 
tio Arithmeticae, to which Maurolico added his own simple and ingenious proofs. For 
example, his Proposition 11 shows that the mth triangular number plus the preceding 
triangular number is equal to the mth square number. Expressed in modern notation, this 
is t, + t)—1 = Sn, Where ¢, is represented by the formula 


_ a(n+ 1) 


Maurolico’s simple proof is this: By the rule of formation of triangular numbers, ¢, = 
tn-1 +n. Adding ¢,—-; to both sides of this equation gives 


th =1+2434---4+7 


ty + ty—1 = 2tp-1 +n 
=(n—lntn=n’=Sy, 


which was the relation to be established. 

The rst result in Maurolico’s Arithmeticorum that is actually proved by mathemat- 
ical induction is the proposition that the sum of the rst m odd integers is equal to the 
nth square number; in present-day symbols, this is 


14+34+54+7+---+(Qn-l=n’. 
Maurolico’s proof is essentially translated as follows: 


By Proposition 13 [this proposition asserts that n? +(2n + 1)=(n+1)?] the rst square 
number | added to the following odd number 3 makes the following square number 4; 
and this second square number 4 added to the third odd number 5 makes the third square 
number 9; and likewise the third square number 9 added to the fourth odd number 7 makes 
the fourth square number 16; and so successively to in nity the result is demonstrated by 
repeated application of Proposition 13. 


This is a clear case of mathematical induction, with Maurolico’s Proposition 13 
providing the justi cation for the induction step. 
Let us put the proof in the format that we have used earlier. Here 


14345474+-:-+(2n-l=n’ 
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is the statement P(n). First observe that P(1) is true, since 1 = 1°. Now, to obtain 
P(k +1) from P(k), assume that 


14345474+---+(2k-1l =P. 
Adding the next odd integer 2k + 1 to both sides gives 
14-345474+---+(2k-1)+Qk4+1) =P +(2k+1 
=(k+1), 


which is just P(A + 1). Therefore, the assumed truth of P(x) implies the truth of P(k + 1). 
It follows that P(7) is true for all positive integers n. 

Pascal was, as far as we know, the next person to use mathematical induction, and he 
did this repeatedly in the Triangle Arithmétique in connection with the arithmetic triangle 
and its applications. The induction proofs of Maurolico were presented in a somewhat 
sketchy style, but Pascal’s followed more nearly modern lines. Although nowhere in 
the treatise was Maurolico’s name mentioned, it is clear that Pascal was familiar with 
that part of the Arithmeticorum in which Maurolico used the new logical process. In a 
well-known letter from Pascal (written under the pseudonym Amos Dettonville) to the 
mathematician Carcavi, Pascal referred to Maurolico for a proof of the proposition that 
“twice the nth triangular number minus n equals n*.” He then said, “Cela est aise par 
Maurolico.” (“That is easy, using Maurolico’s [method].”) 

Contained in the proof of Pascal’s Consequence XII of the Triangle Arithmétique, 
the result that we would write as 

n 
(") r+l 


n nr’ 
r+l1 


there is an explicit formulation of the principle of mathematical induction. The rst part 
of Pascal’s argument reads as follows: 


Although this proposition has an in nite number of cases, I will give a rather short 
demonstration by supposing two lemmas. 

The rst one, which is self-evident, is that this proposition occurs in the second base; 
because it is apparent that @ is to o as | is to | [that is, form = 1, 


(0) ox 
oo 


The second one is that if this proposition is true for an arbitrary base, it will necessarily 
be true in the next base. From which it is clear that it will necessarily be true in all bases, 
because it is true in the second base by the rst lemma; hence by means of the second 
lemma, it is true in the third base hence in the fourth base, and so on to in nity. 

It is therefore necessary only to prove the second lemma. 


Pascal described here precisely the logical process now called mathematical induction. 
It is interesting to translate Pascal’s proof of his second lemma into modern notation, 

for as he said, “This proof is based only on the assumption that the proportion occurs in 

the preceding base, and that each cell is equal to its preceding plus the one above it.” 
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As indicated in the statement of the lemma, we assume that the ratio in question 
holds in the kth case. That is, for all permissible values of r(r = 0,1,2,...,4— 1), 


suppose 
k 
r r+1 


( k\ k-r 
r+l1 


On the basis of this assumption, it is to be proved that 


k+1 
r r+1 


k+1\ (k+1)-r' 
r+l1 

We start by using the rule of formation of the numbers in the arithmetic triangle (namely 

that each entry is the sum of two numbers in the preceding base), to write 
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If the induction hypothesis is now applied to the quotients of binomial coef cients that 
appear on the right-hand side of this equation, it follows that 


a an — 


() 


(**') 
r I+[/A+1—-r)) — &+)/K+1-r) r+ 


(1) ~~ (k-n/e+D]4+1  (k&+D/7+)D  &k4+1-r’ 
r+1 


which completes the induction step. Thus, the validity of the desired ratio for a certain 
value of n, namely, n = k, implies its validity for 4 + 1. This is just what Pascal’s second 
lemma asserts. 

Maurolico and Pascal had no special designation for the technique of reasoning by 
recurrence. The earliest mathematician who appeared in the xing of a name to this 
process of argumentation was John Wallis; in his Arithmetica Infinitorum (1656), he 
found per modum inductionis the ratio of the sum of the squares 0,1,4,9,..., n? to the 
product n?(n + 1). Augustus De Morgan’s article Induction (Mathematics) in the Penny 
Cyclopedia (1838) suggested the name successive induction for the method, but at the 
end of the article De Morgan incidentally referred to it as mathematical induction, which 
is the rst use of the term. The expression complete induction attained popularity in 
Germany after Dedekind used it in a paper in 1887. In the present century, the name 
mathematical induction has gained complete ascendancy over other descriptive terms. 


From this, we arrive at 
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7. Show that the coef ci ent of x” in the expansion of 


9.2 Problems : Fi 
(x+ +) is a 


8. Find the coef cient of the middle term in the 


1. Prove by mathematical induction each of the 
following identities: 


n(in+1 ; = 2n 
i: 14-49494.04%= ( ) expansion of (x — 1/x)*”. 
: 9. (a) Forn > 1, use induction to prove that 
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(c) 1-242-343-44.---+n(n+)D= 0 1 2 np 
n(n + In + 8 (b) From part (a), conclude that the sum of the 
3 . numbers in any base of Pascal’s triangle is 
(d) 1224+3245?+.--4(2n-1P = . twice the sum of the numbers in the preceding 
base. 
e) 14+2-243-2?4.--4+n27 1 = 
(e) aie 10. For n > 1, derive the following identities involving 
1 1 , 1 1 the binomial coef cients: 
” Ta Sa oa Gea honeD (a) (1) +2(5) #3(5) +--+ 0(") = nar. 
n 1 2 3 n 
~ In+1° [Hint: The left-hand side equals 
2. Use mathematical induction to prove that for n > 1, 
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Consider the identity 0 1 2 
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x — ye = x*(x — y)+ y(x* — y*)] +(" )| for n > 2.] 
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3. Prove each of these divisibility statements by "4 "4 - i 
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4. Use mathematical induction to verify the formula - z 
-()()-Ge- 
1(1!) + 272!) 4+ 38) +---+aM!)=(n4+ D!I- 1 1 3 5 
rales i [Hint: Use part (b) and Problem 9(a).] 
; 1 (/n 1 /n (-1)" [n 
5. Prove the following: d a ee = yes 
© (au) se) a) 
(a) 2” <n! foralln > 4. 1 
(b) n° <2” for all n > 10. = aa [Hint: The left-hand side equals 
n 


6. Prove that for n > 1: 
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11. Prove that for n > 1: 


12. 
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if and only if0 <r < s(n — 1). 
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if and only ifn —1>r > 3(n—1). 
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if and only if 7 is an odd integer and 


r=i(n—1). 


Creating an analogy with Pascal’s arithmetic triangle, 
Leibniz formed the harmonic triangle: 
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The Bernoullis and Laplace 


Christiaan Huygens’s Pamphlet 
on Probability 


The right-hand edge consists of the reciprocals 
of the positive integers. (These are the so-called 
harmonic numbers.) Each number not on this edge is 
the difference of two entries, one diagonally above 
and to the right, and the other immediately to the 
right of the number in question. Thus, 

1 1 1 1 1 
0 1 30 4 5 

(a) Write the next horizontal base of the harmonic 
triangle. 

(b) Prove that the numbers in the nth horizontal 
base of this triangle are the reciprocals of the 
numbers in the corresponding base of the 
arithmetic triangle divided by n + 1, that is, if 
("] denotes the rth entry in the nth base of the 
harmonic triangle, 


1 
Garces a, Osrs 


[Hint: Use mathematical induction on n and the 
recursion formula 


Oe C]-Cail 


(c) Show that the harmonic triangle is symmetric 
by verifying that 


Ci forO<r <n. 


In the seventeenth century, the Dutch 
Republic—a league of provinces of northern 
Netherlands—was the wealthiest and most 
prosperous nation in Europe. A major part of 
the country’s economic success resulted from 
its dominance in overseas trade; a_ eet of 


new Dutch ships was specially designed for economy, carrying maximum cargo with a 


small crew. 


The religious and intellectual freedom to be found in the Dutch Republic distin- 
guished it from other European societies of the time. Although the Calvinist church was 
dominant, those of other persuasions were free to practice their faith. Descartes, fearing 
the censure of the Catholic Church, chose to live for 20 years in the Dutch Republic 
where he was able to publish the Discours de la Méthode pour bien conduire sa Raison. 
The book trade in general was carried on with relative freedom. Amsterdam, with its 
host of printers (some 90 of them in 1650) became the chief center of publishing on 
the continent. Many works were produced there for export. Simon Stevin’s Die Thiende, 
was translated as Disme: The Art of Tenths, or Decimall Arithmetike. This was the _ rst 
appearance of the word “decimal” in print. 
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Christiaan Huygens 
(1629-1695) 


(Extract taken from A History of Science, Technology 
and Philosophy in the 16th and 17th Centuries, by A. 
Wolf. Reproduced by kind permission of Unwin Hyman 
Ltd.) 


The Dutch Republic was not without its own universities. The oldest was Holland’s 
Leiden University, founded in 1575, with those in Groningen and Utrecht being estab- 
lished in 1614 and 1616, respectively. Their renown attracted a substantial number of 
students from other countries, among them Henry Oldenburg. A leading Dutch scienti c 
gure was Antoni van Leeuwenhoek, a producer of superior microscopes, the best of 
which achieved magni cation up to 270 times. This pioneering microscopist made the 
rst observations of bacteria. Because the country had no scientic organization compa- 
rable to the Royal Society of London or the Académie des Sciences, van Leeuwenhoek 
tended to communicate the results of his research through letters to these bodies. 

The leading continental scientist of the late seventeenth century was Christiaan Huy- 
gens (1629-1695), a native of the Hague. Taught by tutors at home until the age of 16, he 
attended the University of Leiden from 1647 until 1649. At Leiden he learned advanced 
mathematics from Franz van Schooten. He subsequently spent two years in the study of 
law at the short-lived College of Orange in Breda. An allowance from his well-to-do 
father allowed the young Huygens to devote himself to a life of scientic enquiry. By 
1666 his renown as a physicist, astronomer, and mathematician was such that Louis XIV 
offered him a senior position in the newly founded Académie des Sciences, along with 
a substantial stipend and free living quarters. His stay in Paris lasted until 1681 when he 
returned to his Dutch homeland to recover from illness. Part of his 15 years in France 
was spent guiding the mathematical development of the young Leibniz. 

The revocation of the Edict of Nantes in 1685 ended any thoughts Huygens had of 
returning to Paris, for the rights of Huygens, a Protestant, were now severely curtailed 
in Catholic France. However, he did spend three months in England in 1689 to attend 
a session of the Royal Society (he had been a fellow since 1663) as well as to meet 
his celebrated contemporary, Isaac Newton, for the rst time. Huygens had immediately 
read and greatly admired a copy of Newton’s Principia, which had been dispatched to 
him on its publication. 

In the last years of his life, Huygens brought out the treatise Traité de la Lumiere, 
describing his radically new wave theory of light. In the later part of the book, he studies 
the phenomenon of double refraction, that is, the property of certain crystals (Huygens 
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worked with Iceland spar, calcite) to refract an incoming ray of light into two rays each 
traveling in a different direction. 

The need for reliable methods for determining longitudes at sea was of paramount 
importance for a maritime country like Holland. This led Huygens to invent the pendulum 
clock in 1656, and secure its patent the following year. This was an enormous step 
forward for navigators and also astronomers, who could now time the progress of the 
stars, sun, and planets with accuracy. Unfortunately, the period of oscillation of a simple 
pendulum remains approximately constant only when the arc of its swing is kept small— 
a requirement hard to meet on a pitching and rolling ship. To design an accurate marine 
clock, Huygens translated the mechanical problem into the mathematical problem of 
nding a tautochronous (Greek for “equal time”’) curve, that is, a plane curve down which 
a body, descending under gravity, will reach the lowest point in exactly the same time no 
matter where it starts. Because he was familiar with the cycloid problems proposed by 
Pascal in 1658, Huygens soon realized that the desired curve was an arch of an inverted 
cycloid. (For this reason, the cycloid is sometimes called the tautochrone.) Thus, if the 
pendulum bob is compelled to describe a cycloidal path, all its swings whether through 
large or small arcs will take the same time: the pendulum’s period is independent of how 
high the bob swings. 

The practical problem confronting Huygens was how to constrain the pendulum 
bob to follow a cycloidal path rather than the circular arc that it would normally de- 
scribe. His solution was to control the unequal oscillations of the pendulum by allowing 
the exible part of the suspension to wrap itself alternately about two curved metal 
strips, or “cheeks” as he called them. In this way, the bob would be pulled back from 
its unrestricted swing. After many trials, Huygens made the surprising discovery that 
the correct path is executed when the curved cheeks themselves have the form of a 
cycloid. 

An important by-product of Huygens’s investigations was his formula T = 27./1/g 
relating the oscillation period 7 of a simple pendulum undergoing small swings with its 
length /. This afforded a practical way of measuring g, the acceleration of gravity, with 
greater precision than that afforded by any direct experiments on falling bodies. Huygens 
found it to be 32.17 feet per second per second, at the latitude of Paris. 

The earliest work on the mathematical treatment of probability is the De Ratiociniis 
in Ludo Aleae of Huygens. It appeared rst as an appendix to Frans van Schooten’s 
Exercitationes Mathematicae (Mathematical Exercises), printed in Leyden in 1657. In 
the Exercitationes Mathematicae, van Schooten took up the most important individual 
results achieved by his pupils, one of whom was Huygens. Although Huygens wrote the 
original draft in Dutch, Latin was still thought to be the most appropriate language for 
expounding mathematics. When Huygens experienced some dif culty in coining suitable 
Latin terms for Dutch technical language, van Schooten provided the translation that was 
nally published—later the De Ratiociniis in Ludo Aleae came out in the vernacular. 
The most memorable of Huygens’s discoveries in the tract was the important concept 
of mathematical expectation, or as he called it, “the value (price) of the chance” to win 
in a game. For half a century, this little pamphlet was essentially the only text available 
on probability theory. Then, in the early 1700s, it was superseded by elaborate treatises 
from the hands of James Bernoulli and Abraham De Moivre. 
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The Bernoulli Brothers: John and James 


The Bernoulli family, Protestant in faith, ed Antwerp in 1583, after it had been 
captured by Catholic Spain. They rst sought refuge in Frankfort, moving shortly there- 
after to Switzerland, where the family settled in the free city of Basel. Basel had become 
a university town in 1460, a center for early printing that had attracted such Renaissance 
scholars as Erasmus, and a haven for the Huguenots during their long persecution. The 
founder of the Bernoulli dynasty married into one of the oldest families of the city and 
established himself as a prosperous merchant. During a century, this gifted family pro- 
duced no fewer than eight mathematicians, several of whom brought fame to the country 
of their adoption. The rst to attain prominence were the brothers James (Jacques, Jacob) 
Bernoulli and John (Jean, Johann) Bernoulli, grandsons of the fugitive from Antwerp. 
Their careers form the connecting link between the mathematics of the seventeenth and 
eighteenth centuries. 

James Bernoulli (1654-1705), carrying out his father’s wish for him to enter the 
ministry, took a degree in theology at the University of Basel in 1676. Meanwhile, he 
secured additional training in mathematics and astronomy, much against the wishes of 
his parents. Later in life, he chose as his motto /nvito patre sidera verso (“I study the 
stars against my father’s will”). During the years 1676—1682, Bernoulli traveled widely in 
France, England, and the Netherlands, familiarizing himself with the work of the leading 
mathematicians and scientists. There is no record that he encountered Christiaan Huygens 
at this time, but in England he met both Robert Boyle and Robert Hooke. In 1682, he 
opened a school for mathematics and science in his native city of Basel, presumably 
distilling the fruits of his new learning. Five years later, Bernoulli was appointed professor 
of mathematics at the University of Basel, holding the chair until his death. 


James Bernoulli 


(1654-1705) 


(Extract taken from A History of Science, Technology 
and Philosophy in the 18th Century, by A. Wolf: Repro- 
duced by kind permission of Unwin Hyman Ltd.) 
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The essentials of the differential calculus rst appeared in print in a six-page paper 
by Leibniz in the Acta Eruditorum of 1684, and the essentials of the integral calculus 
followed in 1686. Marred by misprints and poor exposition, these papers encountered an 
almost universal lack of comprehension. It is said that when the aging Huygens in 1690 
wanted to master the newly proposed methods, there were not half a dozen men quali ed 
to expound on the subject. One of the rst of the Continental mathematicians to achieve 
full understanding of Leibniz’s abbreviated presentation was James Bernoulli, who taught 
the techniques to his younger brother John. (John, in turn, instructed L’Hospital in the 
calculus, and the latter passed the knowledge on to Huygens.) Unlike the reticent New- 
ton, Leibniz corresponded extensively with fellow mathematicians on the subject of the 
in nitesimal analysis. He soon produced a group of enthusiastic adherents, who were 
convinced of its great potentialities and tried to expand the calculus in several different 
ways. Its rapid development into an instrument of great analytical power and_ exibility 
was largely due to the ability of the Bernoulli brothers. The Acta Eruditorum, in which 
Leibniz had published most of his papers, also opened its pages to the mathematical 
contributions of the Bernoullis. They achieved, with Leibniz, such a productivity that 
by the late 1600s almost all of what we now call elementary calculus had been created, 
along with the beginnings of ordinary differential equations. 

It is curious that James Bernoulli should have been born in that same year in which 
the correspondence between Pascal and Fermat laid the foundations of probability theory, 
for of all Bernoulli’s contributions to mathematics, the work for which he is best known is 
the Ars Conjectandi [The Art of Conjecturing]. He had been writing the book off and on 
for 20 years but was never satis ed with it. When he died in 1705, the all-but-complete 
manuscript was given to his 18-year-old nephew, Nicholas Bernoulli, with a view to 
editing it for publication. Nicholas, who was a former pupil of James, had already turned 
his attention to the theory of probability and so seemed the person best tted to nish 
the project. Apparently the young man did not consider himself adequate to the task; 
by his advice, the work was nally given to the printer in the state in which its author 
had left it. The Ars Conjectandi was published in Latin in 1713, eight years after James 
Bernoulli’s death. 

The Ars Conjectandi comes in four parts. The rst is a reproduction of Huygens’s 
De Ratiociniis in Ludo Aleae, accompanied by a commentary on all but one of Huy- 
gens’s propositions. Bernoulli’s commentary, which often offers alternative proofs of the 
fundamental propositions and in some cases extends them, is of more value than the 
original tract of Huygens. The second part of the Ars Conjectandi contains practically all 
the standard results on permutations and combinations in the form in which they are still 
expressed. Bernoulli said that others had treated these topics before him, especially van 
Schooten, Leibniz, and Wallis, and so intimated that his subject matter was not entirely 
new. In this part, he gave an array 
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that was substantially the same as Pascal’s arithmetic triangle, and provided the rst 
adequate proof of the binomial theorem for positive integral powers. Bernoulli’s proof 
was by mathematical induction, a method he rediscovered from reading the Arithmetica 
Infinitorum (1659) of Wallis. Newton seems to have rst stated the binomial theorem 
in general form for fractional exponents in a famous letter (1676) to Henry Oldenburg, 
the secretary of the Royal Society of London. But Newton gave no regular proof of his 
theorem, contenting himself with illustrating it by actual multiplication in some well- 
chosen examples. In 1774, Euler devised a proof for negative and fractional exponents, 
although the argument was faulty in that he failed to consider the convergence of the 
in nite series that arises in the case of negative exponents. 

The third part of the Ars Conjectandi consists of 24 problems relating to the various 
games of chance that were popular in Bernoulli’s day, and that were designed to furnish 
examples for the theory that had gone before in the book. The nal part of the treatise is 
entitled “Applications of the Previous Study to Civil, Moral and Economic Problems.” It 
was left incomplete by the author but nevertheless must be considered the most important 
part of the whole work. After a general discussion of philosophical problems connected 
with probability theory—probability as a degree of certainty, moral versus mathematical 
expectation, and so on—Bernoulli gave a proof of the celebrated theorem that bears his 
name. That he was proud of this result, the rst limit theorem of probability theory, is 
apparent from his own words in the Ars Conjectandi: 


This is that problem which I am now about to impart to the public after having suppressed 
it for twenty years. It is dif cult and it is novel, but is of such excellent use that it places a 
high value and dignity on every branch of this doctrine. 


Bernoulli’s theorem (which the French mathematician Poisson later called the “law 
of large numbers’’) is accurately stated as follows: If p is the probability of an event, 
if k is the actual number of times the event occurs in nv trials, if € > 0 is an arbitrarily 
small number, and if P is the probability that the inequality |k/n — p| < € is satis ed, 
then P increases to | as n grows without bound. Of course, it may happen that for some 
n, no matter how large, the difference |A/n — p| will be greater than € in a particular 
sequence of 7 trials. What Bernoulli’s theorem asserts is that those sequences in which 
the inequality |k/n — p| <€ is satised are much more likely to occur. 

With Bernoulli’s theorem, the Ars Conjectandi ended abruptly. It is especially to be 
regretted that the promised applications to matters of economics and politics remained 
un nished, because Bernoulli’s theorem was to become the stepping-stone from insignif- 
icant problems with urns containing colored balls or dice tossing and card playing to 
valuable and scienti cally justi ed applications of probability theory (such as mathemat- 
ical statistics, demography, and the theory of random errors). 

John Bernoulli (1667-1748), like his older brother James, ran counter to his father’s 
plans regarding his work in life. Originally scheduled to take charge of the family’s 
thriving trading interests, he escaped an unsuccessful apprenticeship as a salesman by 
getting permission to study medicine. Simultaneously, he was being privately tutored by 
his brother in the mathematical sciences. When Leibniz’s papers began to appear in the 
Acta Eruditorum, John mastered the new methods and followed in James’s footsteps as 
one of the leading exponents of the calculus. Through the intervention of Huygens, he was 
appointed professor of mathematics at Groningen in the Netherlands in 1695. At the death 
of James in 1705, he succeeded to his brother’s vacated chair at Basel, where he remained 
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for the rest of his life. Unfortunately, antagonism had developed and steadily increased 
between the two brothers, and their early collaboration eventually changed to rivalry. 

The custom of issuing challenge problems to rival scholars was resumed by the early 
practitioners of the calculus. One such problem, proposed by Leibniz in 1687, was that 
of nding an isochronous curve; that is, a curve along which a body affected only by 
gravity will fall with constant velocity. In the Acta Eruditorum of 1690, James Bernoulli 
published his solution to the problem of constant descent; solutions had already been 
given by Huygens in 1687 and Leibniz in 1689. The problem led him to the differential 
equation 


dy\/b?y — a2 = dxVa}, 


from which Bernoulli concluded the equality of the integrals of the two sides. In solving 
the equation the word “integral” was used for the rst time in its present mathematical 
sense. The required curve turns out to be a semicubical parabola (in simplest form, 
y> =x’). 

As a counterproblem, James Bernoulli used the same article in the Acta to challenge 
mathematicians to nd the theoretical form assumed by a exible chain, hanging freely 
under its own weight from two support points. This was an old problem, going as far 
back as the fteenth century. Galileo had maintained (1636) that the chain hangs in the 
shape of a parabola; but in 1646 the 17-year-old Christiaan Huygens proved that this 
cannot be the case, though at the time he was unable to determine the correct form. Three 
solutions were published in the Acta in 1691, coming from Leibniz, John Bernoulli, and 
the much older Huygens. As published, none of the three revealed the approach by 
which the result was obtained; but we know from their later publications that Huygens 
relied predominantly on classical geometric methods, whereas Leibniz and Bernoulli gave 
much more direct proofs by applying the new differential calculus. The actual shape of 
the suspended chain is a curve known as the catenary (the name given it by Huygens 
from the Latin catena, meaning “chain”), whose equation is y = (e + e~™)/2a. 

In a sense, the solution of the catenary problem was a test between the Archimedean 
style of freely partitioning plane gures into “in nitely small pieces” and the new trends 
as embodied in the work of Leibniz and Bernoulli. Calculus, in the eighteenth century, 
was to become less geometrical and took on a more analytic character. 

The solution of the catenary problem was John Bernoulli’s rst independent pub- 
lished work, and immediately assured the 24-year-old of a place among the leading 
mathematicians of the day. He was no doubt immensely proud that he had been success- 
ful, whereas his older brother, who posed the problem, had sought its solution in vain. 
Perhaps some perceived patronizing manner on the part of his sibling led the younger 
brother to brag, 


The efforts of my brother were without success. For my part, I was more fortunate, for I 
found the skill (I say it without boasting; why should I conceal the truth?) to solve it in full. 
... It is true that it cost me study that robbed me of rest for an entire night. 


Not entirely overshadowed, James Bernoulli went on to solve various generalizations 
of the problem, in which the hanging chain was allowed to be elastic, or of variable 
density. 

For years there had been a certain friction between the brothers stemming from 
the arrogance of the older one in dealing with the younger, and from the desire of 


The Bernoullis and Laplace 475 


the younger brother for fame. What nally brought the quarrel into the open was the 
famous problem of quickest descent, or the brachistochrone problem (the name is derived 
from the Greek words meaning “shortest” and “time”). In a lea et addressed (1696) to 
“the shrewdest mathematicians of all the world,” John Bernoulli posed the challenge of 
determining the curve down which a particle will slide in the shortest possible time from 
a given point to another not directly below. The search for the brachistochrone path had 
begun with Galileo, who believed incorrectly that the curve sought was an arc of a circle. 
Leibniz provided the answer on the day he received notice of the challenge, and accurately 
predicted a total of exactly ve solutions—from Newton, Leibniz, L’ Hospital, and the two 
Bernoullis. All these solutions were printed in the May issue of the Acta Eruditorum of 
1697. John Bernoulli rst found an incorrect solution, then tried to substitute his brother’s 
proof for his own. The path of quickest descent turned out to be part of a cycloid, that 
fateful curve of the seventeenth century. When the Bernoullis found out that the solution 
involved an arc of an appropriate cycloid, they were amazed. As James Bernoulli pointed 
out in publishing his proof, a curve that had been investigated by so many mathematicians 
that there seemed to be nothing more to discover about it, now suddenly displayed an 
entirely new property. By itself the problem may not seem so very important, but its 
solution laid the foundations on which Euler, Lagrange, and Legendre were later able to 
construct an entirely new branch of mathematics, called the calculus of variations. 

John Bernoulli’s bitterness increased when a French nobleman and amateur mathe- 
matician, the Marquis de L’ Hospital (1661—1704), published under his own name various 
discoveries communicated to him by Bernoulli. Not quite convinced that he could master 
the newly invented calculus by himself, L’ Hospital engaged the services of Bernoulli to 
spend several months in Paris in 1691 instructing him in the subject. L’ Hospital even 
induced Bernoulli, in exchange for a yearly allowance, to continue these lessons after 
the latter’s return to Basel. It was a curious nancial arrangement, in which Bernoulli 
agreed to send L’Hospital and no one else his latest discoveries in mathematics, to be 
used as the marquis might please. This correspondence was subsequently organized by 
L’Hospital into Analyse des Infiniment Petits (1696), the rst textbook on the differential 
calculus to appear in print. Supplying what many had awaited, namely an elementary 
introduction to the new in nitesimal analysis, its in uence and popularity dominated the 
whole of the eighteenth century. In the preface, L’ Hospital acknowledged his indebted- 
ness to Leibniz and Bernoulli, especially “the young professor at Groningen,” but only in 
very general terms: “I have made free of their discoveries, so that whatever they please 
to claim as their own I frankly return to them.” John Bernoulli watched, with appar- 
ent jealousy, the growing success of his protégé’s book. Following the author’s death, 
he wrote letters claiming that much of the content of the Analyse was really his own 
property, practically accusing L’Hospital of plagiarism. Matters were claried in 1921, 
when a manuscript of Bernoulli on the differential calculus, dating from 1691-1692, was 
discovered in the Basel University library. A comparison of Bernoulli’s manuscript and 
the text of L’Hospital revealed a considerable overlapping, so that John Bernoulli seems 
to have been the true author of almost all of mathematical substance in the Analyse. One 
of the most noted contributions of Bernoulli that was incorporated in the work is the 
theorem, since known as L’Hospital’s rule, for nding the limiting value of a fraction 
when numerator and denominator both tend to zero at the same time. 

Leibniz recognized the importance of good notation in mathematics, writing to John 
Bernoulli, “As regards signs, I see clearly that it is to the interest of the Republic of 
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Letters, and especially of students, that learned men should reach agreement on signs.” In 
their correspondence, they discussed both the name and principal symbol of the integral. 
Leibniz favored the name calculus summatorius and i as a symbol, bearing witness to 
the summation of a number of in nitely small areas. When Leibniz used the / in his 
1686 paper, it did not quite take its modern form. It was simply the small letter s as 
printed at that time, which resembled the modern type form for f. Bernoulli put forth 
the name calculus integralis, the idea being that some whole area was obtained by the 
summation of its parts, and the capital / as a sign of integration. In the end, Leibniz and 
Bernoulli reached a happy compromise, adopting Leibniz’s elongated S for the symbol 
of the integral and Bernoulli’s name integral calculus for the inverse of the calculus 
differentalis. The word “integral” itself was rst used in print by James Bernoulli in 
the Acta Eruditorum for 1690, although John Bernoulli characteristically claimed the 
introduction of the term for himself. 
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De Moivre’s Doctrine of Chances 


Returning to the development of probability theory, we nd that the next big mile- 
stone was the publication of De Moivre’s Doctrine of Chances: or, a Method of Calcu- 
lating the Probability of Events in Play (1718). This was the rst English treatise on the 
subject. Abraham De Moivre (1667-1754) was a French Protestant who was forced to 
seek asylum in London after the revocation of the Edict of Nantes and the expulsion of 
the Huguenots (1685). There he took up his lifelong, unpro table occupation as a private 
tutor in mathematics. He had hoped to obtain a university professorship, but he never 
succeeded in this, partly because of his non-British origin. A friend of Newton, he was 
appointed in 1710 to the partisan commission by means of which the Royal Society of 
London sought to review the evidence in the Newton—Leibniz dispute over the origin of 
the calculus. De Moivre supported himself for many years by solving problems proposed 
to him by wealthy patrons who wanted to know what stakes they should offer in games 
of chance. It is probable that the interest of these gentlemen led to De Moivre’s memoir, 
the Doctrine of Chances; it contained numerous problems on throwing dice and drawing 
balls of various colors from a bag, as well as questions relating to life annuities. After 
both sight and hearing failed, he sank into a state of lethargy, sleeping 20 hours a day. 
Like Cardan, De Moivre predicted the day of his own death, or so the story goes. Finding 
that each day he was sleeping a quarter of an hour longer than on the preceding day, 
he calculated that he would die in his sleep on the very day in which he slept up to the 
limit of 24 hours. De Moivre died in 1754, at the age of 87; the cause of his death was 
recorded as “somnolence.” 

Because factorials increase with extreme rapidity, it is desirable to have an approxi- 
mate expression for n! that is convenient to manipulate both practically and theoretically. 
In the rst edition of the Doctrine of Chances, De Moivre obtained a result (namely, 
for very large n, it is known that n! ~ cn"*'/2e~"), which is complete except for the 
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determination of an unknown constant factor c. In the second edition of 1738, he gave 
the nal formula with due credit to Stirling: “I desisted in proceeding farther till my 
worthy friend Mr. James Stirling, who had applied after me to that inquiry, discovered 
that c = 27.” Hence, the celebrated approximation, 


n! & /27nnn"e", 


which is usually known as Stirling’s formula, is at least as much the work of De Moivre. 

Before the 1770s, the calculus of probabilities had been largely restricted to the study 
of gambling and actuarial problems. The works of James Bernoulli and De Moivre had 
excited interest about the prospects for wider applications—to the estimation of errors in 
observation, to changes in the composition of population, or to the study of regularities 
that arise in political and social phenomena (for instance the measure of mean duration of 
life or marriage). But these had been little more than touched on. It was largely Laplace 
who went on to broaden the mathematical treatment of probability to areas of scienti c 
research well beyond games of chance. 


The Mathematics of Celestial Phenomena: Laplace 


Pierre Simon Laplace (1749-1827) came from relatively humble origins in Nor- 
mandy, a fact that in false shame he later found convenient to forget. Between the ages 
of 7 and 16, the young Laplace was educated in a school attached to a Benedictine priory 
in Beaumont; most of his teachers were members of the order. He entered the University 
of Caen at 16, intending to study theology, but soon decided that his true vocation lay 
in mathematics. Laplace seems to have remained at Caen for ve years, where he wrote 
his rst mathematical paper and was named a “provisional professor.” When he decided 
to go to Paris, he was suitably furnished with several letters of recommendation to Jean 
d’Alembert, then the leading mathematician in France. The letters remained unnoticed, 
but Laplace caught d’Alembert’s eye by addressing a long paper to him on the general 
principles of mechanics. This brought the enthusiastic reply, “You need no introduction; 
you have recommended yourself; my support is your due.” 

For men of science born without the advantages of rank or wealth, there were 
only a limited number of remunerative positions open in France in the 1700s. Apart 
from the opportunities to teach, one of the few salaried positions available was the 
coveted membership in the Académie Royale des Sciences; the small number of places 
(about 60 at a time) together with the large number of candidates provoked a high 
degree of competition for the honor. Mainly through d’Alembert’s in uence, Laplace 
was appointed professor of mathematics at the Paris Ecole Militaire in 1769 and elected 
to the select Académie in 1773. He was now launched on a career that was to become truly 
illustrious. 

The great success of Newton’s Principia made celestial mechanics an attractive eld 
for research, and as soon as Laplace reached Paris, he developed a de nite purpose for his 
life: to write a great treatise that would embrace all that was known about the mechanics 
of the heavens. He did not permit this activity to be greatly interrupted by the French 
Revolution or by the successive governmental changes through which he lived. During 
the most tumultuous period of French history, Laplace contrived to win appreciation 
and support for his talent from each ruling party in turn. Even though the stories about 
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Laplace’s political opportunism might be exaggerated, they are too much of a kind not 
to represent some core of truth. 

In 1785, as examiner of the scholars of the royal artillery corps, Laplace happened 
to examine a 16-year-old sublieutenant named Napoleon Bonaparte (1769-1821); and 
fortunately for both their careers, the examinee passed. When a coup d’état brought 
Napoleon to power in 1799, Laplace set aside whatever republican principles he might 
have had and offered his complete support to his old friend. Napoleon rewarded this 
devotion by naming Laplace minister of interior, but dismissed him six weeks later in 
favor of his brother, Lucien Bonaparte. Looking back years later, Napoleon quipped that 
Laplace had “sought everywhere for subtleties, had only problematic ideas, and carried 
the spirit of the ‘in nitely small’ into administration.” Yet if Napoleon held a low opinion 
of Laplace as an administrator, it did not dim his respect for him as a mathematician. 
In compensation for having removed him from of ce, Napoleon raised Laplace to the 
senate and made him a count. Laplace eventually became president of the senate. Because 
the recipient of patronage is likely in turn to be generous with compliments, it is not 
surprising that Laplace dedicated the next volume (Volume Three, 1802) of his Mécanique 
Céleste to “Bonaparte, the Paci cator of Europe to whom France owes her prosperity, 
her greatness and the most brilliant epoch of her glory.” After the senate voted to depose 
Napoleon in 1814 (Laplace had had the foresight to absent himself from Paris during 
this awkward moment), Laplace shifted his allegiance to the Bourbons. When the French 
monarchy was restored, Louis XVIII recognized this apparent loyalty by bestowing the 
title of marquis on Laplace and naming him to the Chamber of Peers. 

The man who became known as the Newton of France died in March 1827 at age 78. 
His contemporaries were quick to note that the month and year were exactly a century 
after the death of Newton himself. When those around his deathbed were recalling to 
him the great discoveries he had made, Laplace replied, “What we know is but a little 
thing; what we are ignorant of is immense.” 
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Although posterity may fault Laplace for his political maneuvering, his mathematical 
ability is beyond question. Laplace’s greatest achievement was the Traité de Mécanique 
Céleste, published in ve large volumes over 26 years (1799-1825). This modern A/- 
magest was designed, in the author’s own words, “to solve the great mechanical problems 
of the solar system and to bring theory to coincide so closely with observation that empir- 
ical equations should no longer be needed.” The Mécanique Céleste completed the work 
Newton had begun, for it showed that all the movements of the known members of the 
planetary system were deducible from the law of gravitation. Within its 2000 pages were 
incorporated all the important discoveries of the previous century—of Newton, Clairaut, 
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d’Alembert, Euler, and Lagrange—plus an immense amount of material that was origi- 
nal with Laplace himself. Even though it is granted that scientists of the time were far 
from conscientious in their citation practices, Laplace did not adequately acknowledge 
the contributions of others, but left the distinct impression that all the results contained 
in the Mécanique Céleste were his own. 

The Mécanique Céleste was one of the great scientic works of the early 1800s. 
John Playfair in a review (1808) spoke of it as “the highest point to which man has 
yet ascended in the scale of intellectual attainment.” The mathematical exposition in 
the Mécanique Céleste was extremely concise and not easy reading. In preparing the 
manuscript for press, Laplace was frequently unable to reconstruct the details in his 
chains of reasoning. If he was satis ed that a conclusion was correct, he was content to 
insert the optimistic remark, “It is easy to see that .. .” and give the result without further 
explanation. The American astronomer Nathaniel Bowditch (1773-1838), who translated 
four of the ve volumes into English, observed, “I never came across one of Laplace’s 
‘Thus it plainly appears’ without feeling sure that I had hours of hard work before me 
to Il up the chasm and nd out and show how it plainly appears.” 

There is a well-known anecdote of an encounter between Laplace and Napoleon over 
the Mécanique Céleste. When Laplace presented Napoleon with a copy of the monumental 
work, the latter teasingly chided him for an apparent oversight: “They tell me that you 
have written this huge book on the system of the universe without once mentioning its 
Creator.” Whereupon Laplace drew himself up and bluntly replied, “I have no need for 
that hypothesis.” Greatly amused by the comment, Napoleon mentioned the incident to 
the mathematician Lagrange, who is reported to have exclaimed, “Ah, but it is a beautiful 
hypothesis, it explains so many things.” 

Although many historians see Laplace’s famous reply to Napoleon as an indication 
that he was not always ingratiating himself with the current holder of power, it is possible 
to read into the story something that does not appear at rst glance. From an extensive 
comparison of earliest recorded and modern observations, Edmund Halley discovered 
in 1675 an irregularity in the motions of the planets Jupiter and Saturn. He noted that 
for many centuries the average angular velocity of Jupiter around the sun had been 
continually increasing, while for Saturn this measure had been continually decreasing. 
The implication was that the mean distance from the sun was always decreasing for 
Jupiter and increasing for Saturn. If the irregularities were to become progressively 
larger, the ultimate fate of Jupiter would be to fall into the sun, whereas Saturn would 
edge off course and be lost altogether in the solar system. Newton speculated that these 
seeming aberrations were somehow due to the mutual attraction of the planets but was 
unable to provide a mathematical theory that would account for it. In his opinion, the 
solar system was the deliberate design of the Creator (“This most beautiful system of 
the sun, planets and comets could only proceed from the counsel and dominion of an 
intelligent and powerful Being.”) and at some moment God would intervene so as to 
redress the disorder introduced by the action of the planets on one another. 

In a series of memoirs presented to the Académie des Sciences during the years 
1784-1788, Laplace proved that the gradual changes in the elements of planetary orbits 
were periodic, during a very long period, about 900 years in the case of the mean motion 
of Jupiter and Saturn. This periodicity meant that an increase in the eccentricity of an orbit 
would be followed, not by a catastrophe, but by a decrease reestablishing the status quo. 
The need for divine intervention to correct erratic behavior now seemed less likely than 
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Newton had assumed. This is what Laplace probably had in mind when he eliminated 
the action of God in celestial mechanics. Far from being an atheist, he thought of God 
as creating the solar system in the beginning in such a way that He did not subsequently 
need to intervene directly in its affairs. 


Mary Fairfax Somerville 


If any important work of the period required an explanatory commentary, it was cer- 
tainly Laplace’s Mécanique Céleste. Mary Fairfax Somerville (1780-1872), an extraordi- 
nary, self-educated Scotswoman who had studied the treatise in Edinburgh, was persuaded 
by the Society for the Diffusion of Useful Knowledge to prepare a popular exposition of 
this work of Laplace for English readers. She was nearing 50 years of age, lacked formal 
training, and had never written for publication. Yet Laplace is said to have observed that 
Mary Somerville was the single woman who understood his great treatise. According 
to one story, when Laplace dined with the Somervilles he told Mary Somerville—not 
realizing that she had been the widow of Samuel Grieg—‘‘Only two women have ever 
read the Mécanique Ceéleste, and both are Scotch, Mrs. Grieg and yourself.” 

Although she was unsure of her quali cations, she accepted the assignment with 
the understanding that she would write in secret and that if the manuscript proved 
unsatisfactory it would be destroyed. Adding full mathematical explanations and dia- 
grams to make Laplace’s work comprehensible, her popularization was completed in 
1830 under the title The Mechanisms of the Heavens. The rst printing was small, 750 
copies, because apparently the printer was dubious of the nancial success of a book on 
such a subject. Nevertheless, it met with such great praise that the preface, containing 
the necessary mathematical background, was issued separately in 1832 as A Preliminary 
Dissertation on the Mechanisms of the Heavens, and both it and the previous volume 
went through many printings. They became required textbooks on higher mathematics 
and astronomy in British universities for a century, “as essential works to those who 
aspire to the highest places in examinations.” Hailed by the English press a “Queen of 
Nineteenth-Century Science,” Mary Somerville was the last of the great amateur ex- 
positors of science, producing compilations of the state of knowledge in a number of 
different areas. Chief among these were her Connection of the Physical Sciences (1834) 
and Physical Geography (1848). Physical Sciences ran through some 10 editions during 
the next 40 years and was translated into French, German, and Italian. Her last work, 
the two-volume Molecular and Microscopic Science, was published when she was 89 
years old. Accorded numerous honors, Mary Somerville was elected to the American 
Philosophical Society in 1869, at the same time as Charles Darwin. 

Laplace also wrote a popular work, Exposition du Systeme du Monde (1796), which 
was addressed to the nonprofessional reader. Serving in effect as a preface to the highly 
technical Mécanique Céleste, the Exposition put forth a speculative hypothesis about the 
origin of the solar system. Laplace was trying to account for the remarkable circumstance 
that the motions of all the members of the solar system were in the same direction. (The 
motion of the satellites of Uranus is in the opposite direction, but this was not known 
at the time.) Laplace suggested that the bodies of the solar system originated from a 
vast, rotating cloud of incandescent gas, a solar nebula. In cooling, such a cloud would 
condense and contract, forming a more concentrated uid mass, whose rate of rotation 
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would be increasing in the process. A stage would be reached in which the cohesive 
forces could no longer hold it together, and small pieces would then be thrown off; these 
would quickly cool and coalesce to form separate planets. This “nebula hypothesis” struck 
the imagination of the age and was in vogue for nearly a century. It has not survived as 
Laplace put it forth, but in modi ed form represents what is now thought to take place 
in the birth of stars. 


Laplace’s Research in Probability Theory 


Laplace’s genius was also shown in his contributions to probability and to the emerg- 
ing theory of mathematical statistics. From 1774 on, he published a series of memoirs that 
were eventually expanded into a mature nal work, the 464-page Théorie Analytique des 
Probabilités (1812). Laplace seems to have left probability untouched between 1786, 
with a paper on the demography of France, and 1809, when his attention once again 
turned to the subject. In the interval, he was preoccupied with celestial mechanics, incor- 
porating his own research and that of other astronomers in the Mécanique Céleste; and 
he also had such governmental responsibilities as serving on the Committee on Weights 
and Measures, which established the metric system. In 1809, Laplace moved back into 
probability theory by way of several papers on the analysis of probable error in observa- 
tions. The result of greatest novelty here was his derivation of the so-called least-squares 
rule for minimizing the uncertainty in a number of independent observations. Legendre 
had published the rule in 1805, and this may have been what turned Laplace back to 
probability theory after a lapse of a quarter-century. 

The memoirs from his earlier years were revised mathematically, buttressed with 
new material, and published as the Théorie Analytique des Probabilités. By presenting the 
solution of almost every classical problem of probability theory, he traced the evolution 
of the subject, at the same time systematizing and extending the previously known but 
often uncoordinated results of many mathematicians. The remarks on the obscurity and 
lack of rigor in Laplace’s exposition that the Mécanique Céleste called forth apply also 
to the Théorie Analytique des Probabilités. In a review (1837), De Morgan wrote: 


The Théorie Analytique des Probabilités is the Mont Blanc of mathematical analysis; but 
the mountain has this advantage over the book, that there are guides always ready near the 
former, whereas the student has been left to his own method of encountering the latter. 


Although the reading may have been dif cult, his work sustained three editions (1812, 
1814, and 1820) and four supplements (1816, 1818, 1820, and 1825) during Laplace’s 
lifetime. 

Beginning with its second edition of 1814, the Théorie Analytique des Probabilités 
contained a lengthy preface of 153 pages, which was often republished as a separate book- 
let under the title Essai Philosophique sur les Probabilités. The Essai bears the same rela- 
tion to its subject as the Exposition du Systeme du Monde does to the Mécanique Céleste, 
although this time the technical treatise preceded the popularization. It was almost en- 
tirely devoid of mathematical symbols or formulas, and its aim was to acquaint a broader 
circle of readers with the fundamentals of probability theory and its applications without 
resorting to higher mathematics. Indeed, large portions of the Essai are transcriptions into 
everyday language of certain parts of the Théorie Analytique des Probabilités, where the 
same material was developed more mathematically and more succinctly. Both the Essai 
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Philosophique sur les Probabilités and the Exposition du Systeme du Monde originated 
in Laplace’s responsibility for giving lectures on elementary mathematics during the year 
1795-1796 at the Ecole Normale, one of the several new educational institutions within 
which the scientic life of France remained alive during the Revolution. 

The Théorie Analytique des Probabilités consists of two books, the rst of which, 
entitled The Calculus of Generating Functions, is devoted to analysis. In Book II, or 
General Theory of Probability, Laplace divided his subject into three sections: the theory 
of probability proper, limit theorems, and mathematical statistics (not yet distinguished 
as a separate discipline). On the rst page of Book II, he gave what is regarded as the 
classical de nition of the probability of an event: 


The probability of an event is the ratio of the number of cases favorable to it, to the number 
of possible cases, when there is nothing to make us believe that one case should occur rather 
than any other, so that these cases are, for us, equally possible. 
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The general formula embodied in this statement, namely, 


number of favorable outcomes 
Pr [event], = —_—_________—_ 
total number of outcomes 


requires that all outcomes be equally possible. The interpretation of equipossibility as the 
insuf ciency of grounds for preferring one outcome over another was not original with 
Laplace. In the Ars Conjectandi, Bernoulli wrote, “All cases are equally possible, that is 
to say, each can come about as easily as any other.” However, before the publication of 
Ars Conjectandi, no well-circulated work used the notion, and Bernoulli himself did not 
use it much. 

Proceeding to the basic results of probability theory, Laplace derived, among other 
things, the widely used addition and multiplication theorems of probabilities. The modern 
formulations of these follow: 


1. If A and B are mutually exclusive events (that is, they cannot both occur at the 
same time), then 


Pr[A or B] = Pr[A] + Pr[B]. 


2. If A and B are two independent events (that is, the occurrence of one doesn’t 
affect the probability of the other), then 


Pr[A and B] = Pr[ A] Pr[B]. 


For instance, if a card is drawn at random from a standard bridge deck and a second 
card drawn from a pinochle deck we might ask about the probability that both are aces. 
A bridge deck contains 52 cards of which 4 are aces, and a pinochle deck of 48 cards 
has 8 aces. Denoting the events of getting an ace from a bridge and a pinochle deck by 
A and B, respectively, we have 


4 8 1 
52 48 78° 

Laplace went on to consider a whole series of problems drawn from models offered 
by games: the probability of drawing all the numbers of a lottery at least once in a given 
series of selections; the probability of selecting a given number of balls of one color 
from an urn; the probability of selecting numbered balls from an urn in a speci ed order; 
and so on. Many of these involved what is now called a Bernoulli process or a Bernoulli 
trials experiment. Huygen’s problems often were a point of departure for Bernoulli in 
developing new results in the Ars Conjectandi. His commentary on Proposition 12 of 
Huygen’s De Ratiociniis in Ludo Aleae set forth what must have been the rst mention 
in print of a series of independent trials with constant probability—hence the name, 
a Bernoulli trials experiment. In essence, a Bernoulli trials experiment is a sequence 
of independent, repeated trials in which each trial has only two possible outcomes, a 
success or failure. If the probability of success on a single trial is denoted by p and 
the probability of failure by g = 1 — p, then p and q remain constant from trial to trial. 
Bernoulli showed that the probability of observing exactly 7 successes in n such trials 
was expressed by the rth term of the expansion for (p + q)"; that is, 


Pr[A and B] = Pr[A] Pr[B] = 


: n 7 < 
Pr[r successes and n —r failures] = ( ) pq: 
r 
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To see how Bernoulli derived this, let us consider one particular way in which exactly 
r successes can occur in 7 trials, namely, the instance SS---SFF---F of 7 successes in 
the rst r trials followed by n —r failures in the remaining trials. Because the trials 
are independent, the probability of this particular event is the product of p’ for the r 
successes and q”~" for the n —r failures: 


Pr[S---SF---F] = p’q"”. 
Now the total number of ways in which the 7 trials can be divided into exactly r successes 
and n —r failures equals ("). Because these individual ways are mutually exclusive 
r 


events, each having the probability p’q"~", the sum of their probabilities is 


n rr nr 
("loro 
which is the probability sought. 

Bernoulli’s formula might best be understood by two examples, one in which the 
formula applies and another in which it does not. If six well-balanced dice are thrown, we 
might ask what is the probability that exactly two of the dice have top faces showing at 
least a ve. This can be viewed as a Bernoulli trials experiment in which the probability 
p of success is 7 (a ve or a six showing) and failure g = 1—p= 2. The desired 
probability is therefore 


6\ (1\* (2\4 1\ (16\ 240 
Pr[2successes and 4 failures] = (5) (=) (=) =15 (3) (=) = 359" 


Suppose, for a second illustration, that at a county fair a prize is offered to any 
person who in a sequence of four shots can hit the target three times in a row. What 
is the probability that a person will win the prize if his probability of hitting on any 
given shot is 3? Considering the shots to be unrelated, we know that they constitute four 
Bernoulli trials with probability p = ; for success. If it was simply a matter of nding 
the probability of three and only three successes in four trials, Bernoulli’s formula would 
tell us that this occurs with probability 


AN /3N7 71 
Pr[3successes and | failure] = — —}. 
ay ay NA 


But the outcome SSFS, although it involves three successes, will not win any prize, 
because the three shots that hit the target are not three in a row. The prize will be won 
only in the three mutually exclusive cases SSSF, FSSS, and SSSS, whence the probability 
sought is 


3 3 4 
Pr[SSSF] + Pr[FSSS] + Pr[SSSS] = (3) (;) + (3) (3) + (3) - = 


Among the many problems Laplace solved in the Théorie Analytique des Probabilités 
is the celebrated “needle problem” due to Georges Louis Leclerc, Comte de Buffon 
(1707-1788). Buffon, who was the author of a 36-volume Histoire Naturelle, posed 
(1777) the following: 
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A large plane area is ruled with equidistant parallel lines, the distance between two consec- 
utive lines of the series being a. A thin needle of length / < a is tossed randomly onto the 
plane. What is the probability that the needle will intersect one of the lines? 


Using integral calculus, one can show that the formula p = 2//za gives the probability p 
that the needle will fall across a line. Laplace was the rst to point out that by throwing 
the needle many times and assuming the statistical estimate for p, one can use this 
formula to arrive at an approximate value for 2. Sometime between 1849 and 1853, 
Johann Wolf, professor of astronomy at Bern, performed the actual experiment. In his 
demonstration, the width between the parallels was taken to be 45 mm and the needle 
used was 36 mm long. The needle was tossed 5000 times, and it cut a line 2532 times. 
Hence the relative frequency was 


2532 
— = 0.5064. 
5000 
Taking this value for the theoretical probability of intersections and setting 
72 
— = 0.5064 
45a 


produced mz ¥ 3.1596, which differs from the known value of z by less than 0.02. The 
best result of this kind was obtained in 1901 by Lazzerini, who by carrying out only 
3408 trials, found z correct to six decimal places. Laplace was led to extend Buffon’s 
problem to a grid of two mutually perpendicular sets of equidistant parallel lines. If the 
distances between members of the sets are a and J, the probability that a needle of length 
I (less than a and 5) will intersect one of the lines is 


2a +b)—P? 
p=———. 
scab 


The latter part of the Théorie Analytique des Probabilités consists of the many ap- 
plications that were radically to enlarge the realm of probabilistic analysis. In particular, 
Laplace posed and solved numerous questions in demography, the one area for which as- 
sembling statistically signi cant information had begun by the end of the 1700s. Laplace 
considered such matters as the estimated increase in life span if a certain disease, say 
smallpox, was to be eliminated; the probability that the existing ratio of male to fe- 
male births in Paris would continue for the next hundred years; and the evaluation of 
the correctness of witness testimonies and court decisions—Laplace observed that the 
probability of a correct unanimous verdict increases with the number of judges. 

Estimating the population of a given nation had been considered an important prob- 
lem in political arithmetic since the 1600s, for population was seen as an index to the 
prosperity of a nation. Demographers proposed to reach estimates of the population by 
determining a factor by which the average annual number of births should be multiplied 
to give the approximate total population. With data from a sample census, Laplace calcu- 
lated the ratio of the population m in the sample districts to the annual number v of births 
in these districts. He therefore estimated the whole population of France from the formula 


where N is the annual number of births for the nation as a whole. His sample, taken 
from 30 districts scattered over France, produced the gures 2,037,615 inhabitants and 
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215,599 births during a three-year period (1800, 1801, 1802). Assuming the number 
of births in France each year to be | million, he concluded that the whole population 
contained people in the amount 


P=3 a> 10° = 28,352,845 
7 215,599 a rae aes 


Perhaps the more signi cant problem was his estimation of the degree of error in this 
gure (the rst estimate of error inherent in sampling) and the computation of how large 
a sample would be needed to reduce the range of probable error to speci ed limits. 

Although the Théorie Analytique des Probabilités is a monument to probability the- 
ory, it had a certain negative effect on the development of this discipline. The followers 
of Laplace promulgated the view that the political and social sciences held the same 
certainty as the natural sciences, and hence the analysis of social phenomena should fall 
within the scope of probability theory. This led to various unjusti ed and inappropriate 
applications of the theory, extending its methods to areas of no relevance to the subject, 
such as the correctness of court decisions. Many of these unsubstantiated applications had 
so little validity that they cast on probability theory a suspicion of quackery, holding it a 
kind of mathematical entertainment unworthy of serious consideration. The necessity of 
reexamining the logical foundations of the subject to secure its position as a full- edged 
mathematical science became all too clear. 

The concept of mathematical expectation, one of the most important notions of 
probability theory, was introduced by Huygens as “the value of a chance.” We probably 
owe the modern terminology to van Schooten, because the word expectatio appears for 
the rst time in his translation of Huygen’s tract; for a long while spes (“hope”) ran it 
a close second choice. To give a precise de nition: if the possible outcomes of a game 
or experiment are given numerical values, a, a2,..., da, and these outcomes occur with 
probabilities p1, p2,..., Pn, then the mathematical expectation E of the game is 


E =a, pi + a2p2 +++ +anPn. 


If the game has an expectation of zero, then it is said to be fair. The “expectation” is not 
to be interpreted as the value that will necessarily occur on a single play of the game 
but rather as the average payoff in a long run of similar gambles. 

An example will illustrate this idea. Suppose a gambler rolls a fair die after which 
the casino pays him as many dollars as there are dots showing on the upper face. We 
may denote the individual outcomes by 1, 2,3, 4,5, and 6 (the possible number of dots 
showing), each occurring with probability z. Thus, the gambler’s expectation per game is 

[eee ect oy Seer cate eon ae 
= a ow as an ae or 
21 #7 


1 
= —(14+24+34+44+5+6)=—=-. 
poe es tO) = 


This sum might be taken as a fair price to pay for the privilege of playing the game. In 
a long sequence of games, a gambler paying this price would have an average winning 
per game near zero. 

For another illustration of mathematical expectation, consider the game of roulette. 
The wheel has 37 equally spaced slots, which are numbered from 0 to 36, inclusive. If 
a man playing roulette bets, say $1, on a given number and if the ball comes to rest in 
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the slot bearing that number, he receives $36 from the croupier; that is, he recovers his 
bet and receives in addition 35 times the amount wagered. The player wins $35 with 
probability a since all the slots are the same size, and loses $1 with probability 38, so 
that his expectation is 

1 36 1 


: 37 t 35 37 


This means that the player can expect to lose about 2.7 cents per bet in the long run. 

Mathematical expectation need not be limited to games of chance but can be applied 
to any experiment that is repeated frequently and under identical conditions. Laplace, 
in his Essai Philosophique sur les Probabilités, calculated the expectation of a subse- 
quent sunrise if it was given that the rising sun had been observed 1,826,623 times in 
succession, or each day for the previous 5000 years. 


Daniel Bernoulli, Poisson, and Chebyshev 


Probability theory abounds in paradoxes that wrench the common sense and trap the 
unwary. The most famous of these was rst set forth in a letter of Nicholas Bernoulli, 
written in 1713 to Pierre Rémond de Montmort: 


Two players A and B agree to play a game in which A tosses a coin. The game is to continue 
until the rst head appears. Player B will give player A a coin if a head appears on the rst 
throw, two coins if a head appears for the rst time on the second throw, four coins if on 
the third throw, 8 if on the fourth, and so on. What amount should A pay B before the start 
of the game to make it a fair game? 


In 1738, Daniel Bernoulli (1700-1782) tackled the problem proposed by his older 
cousin and published his investigations in the Proceedings of the St. Petersburg Academy 
of Sciences. As a result, the problem has subsequently become famous under the title of 
the St. Petersburg problem, or the St. Petersburg paradox. 

The question is: What sum should A pay B for the privilege of taking part in this 
game? For the game to be fair, A should agree to pay as an entrance fee the number 
of coins equal to A’s mathematical expectation of gain. Let us therefore calculate this 
expectation. The number of outcomes is in nite, because there is no logical limit to the 
number of tails that might appear before the rst head shows up. Thus, the expectation 
involves the sum of an in nite number of terms (an in nite series). If there is a sequence 
of n—1 tails before the rst head appears, then B is to pay out 2”~' coins for this 
outcome; and the probability that heads will show up for the rst time on the nth toss is 
(3)". The total expectation of A is therefore given by 


1(4) +2(4) +4($) 4-427 (4) +-- 


re 
= 2 + 3 + 2 a ae Fy 
which is in nite. The result seems to say that in order to play, A must rst deposit with 
B an innite sum of money; or put another way, no matter how much money A should 


offer B to enter the game, A would still come out the winner if the game was repeated 


490 


Chapter 9 The Development of Probability Theory: Pascal, Bernoulli, and Laplace 


Daniel Bernoulli 


(1700-1782) 


(Smithsonian Institution.) 


on the same terms enough times. From a commonsense standpoint this seems palpably 
absurd. A person would not pay any considerable sum for the advantage that seems to 
be offered him in this game. 

The French scientist Buffon conducted an experiment in tossing coins and found that 
in 2084 games, 1061 gave heads on the rst toss, 494 on the second, 232 on the third, 
137 on the fourth, 56 on the fth, 29 on the sixth, 25 on the seventh, 8 on the eighth, 
and 6 on the ninth. Computing the various expectations, he concluded that B would have 
paid A a sum of 10,057 coins for the 2084 games, so that a modest entrance fee for 
each play would make the game fair. The “paradox” in the St. Petersburg paradox is that 
there is a discrepancy between the results of mathematical reasoning and the dictates of 
common sense and experience. 

The St. Petersburg paradox puzzled mathematicians for generations and became one 
of the most hotly debated problems in the whole realm of probability theory. Numerous 
explanations were offered, some of them bordering on the ridiculous. Probably the rst 
was given by Daniel Bernoulli, who distinguished between what he called “mathemat- 
ical” expectation and “moral” expectation. If the mystical notion of moral expectation 
is applied to the problem in place of mathematical expectation, then a nite value is 
obtained for the expected gain. d’Alembert sought to resolve the paradox by introduc- 
ing two forms of possibilities, “metaphysical” and “physical” (that a head would appear 
for the rst time after 1000 throws was metaphysically possible, but quite impossible 
physically). Others preferred to reconcile the mathematical theory with common sense 
by pointing out that the fortune of the person whom we call A is necessarily nite, so 
that the problem is inherently impossible. 

There is still no generally accepted verdict about how to treat the St. Petersburg 
paradox, except to note that mathematical expectation gives a meaningless result when 
applied to this problem. If one makes additional assumptions not warranted by the express 
conditions of the game, it is possible to arrive at a solution. For instance, suppose that 


The Bernoullis and Laplace 491 


instead of player B’s paying out the amounts 1, 2,4,...,2”~', ... to A, these are replaced 
by amounts 1,7, r?,...,r"—!,... where 0 <r <2. Then the mathematical expectation 
of A’s gain is 


e=i(3)+r(3) +2() 4-4 (BY + 
2 2 2 2 
Se ()e (C) eee) & 
2 2X2 2 \2 2/2. 
=s[t+(S)+(S) ++ (G+ 
2 2 2 2 
1 


i 4 
~F faee) 2=7 


> 


which is nite. For instance, if r = 3, then A should pay the sum of 2 coins to play 
the game. However, as r approaches the value 2, the expectation grows larger without 
bounds; that is to say, it becomes in nite. 

Siméon Denis Poisson (1781-1840) was a faithful follower of Laplace in applying 
probabilities to social areas where signi cant statistical information was already available. 
Indeed, he seemed to have taken it as his mission to continue much of the mathematical 
work of his mentor and collaborator, Laplace. Poisson was born in the town of Pithiviers, 
some 50 miles south of Paris, the son of a junior of cial in the local government. 
After the death of his father, the young boy was sent to live with an uncle, a surgeon, 
in the hope that he too might take up a medical career. But Poisson tended to grow 
faint upon observing the most minor of operations and expressed no interest in the 
profession. 

In 1798 he enrolled instead at the recently founded Ecole Polytechnique, having 
placed rst in its entrance examination. Poisson’s unusual mathematical ability was 
quickly recognized by Laplace and Lagrange; inside of six weeks at the school, he 
had generalized a theorem of Lagrange. With the backing of Laplace, who had come to 
look upon the young man as a son, Poisson was given a lecturer’s position immediately 
upon his graduation in 1800 at the age of 18. Six years later, when Fourier left his po- 
sition to enter Napoleon’s civil administration, Poisson succeeded him as full professor. 
Poisson devoted his academic career to teaching at the Ecole Polytechnique. Regarded as 
an excellent instructor, he once remarked, “Life is good for only two things, discovering 
mathematics and teaching mathematics.” 

In his relatively brief lifetime, Poisson wrote well over three hundred papers on 
mathematics and applied problems. Today his name is attached to a version of the poten- 
tial equation, a differential equation important in the mathematical theory of electricity 
and magnetism, and to a type of probability distribution often used to describe the num- 
ber of events that will occur in a specied period of time. Poisson’s main work on 
probability was the book Recherches sur la probabilité des judgements (1837), the last 
hundred pages of which is devoted to the matter re ected in the title. From his analysis 
of numerous previous court decisions, he determined the probability of a French citizen 
accused of a crime being convicted or acquitted. Poisson contributed treatises of con- 
siderable merit, particularly the two-volume Traité de méchanique (1811, 1833) and the 
Théorie mathématique de la chaleur (1835), a work on heat conduction. 
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Following the deaths of Laplace and Poisson, interest in probability theory tem- 
porarily receded into the background in western Europe. Fortunately, the emerging St. 
Petersburg school of mathematics was to make the subject a focus of its attention. The 
“school,” whose members either taught at or had been educated at St. Petersburg Uni- 
versity, centered around the foremost Russian mathematician of the nineteenth century, 
Pafnuty Chebyshev (1821-1894). Chebyshev came from the Russian nobility, his father 
having been an of cer in the Napoleonic wars. His early training in mathematics was at 
Moscow University, from which he graduated in 1841 and received a master’s degree 
two years later. Because suitable teaching positions were dif cult to obtain in Moscow, 
Chebyshev accepted an appointment as assistant professor at St. Petersburg University, 
where he was to remain until retirement 35 years later. Soon after his arrival in St. Pe- 
tersburg, he was asked to help in the preparation of a new edition of Euler’s collected 
number theoretical works. The task seems to have inspired Chebyshev to write a doc- 
toral dissertation in this area of mathematics, the degree being awarded in 1847. He was 
elected to membership in the St. Petersburg Academy of Sciences in 1859, and almost 
simultaneously was made a full professor at the university. 

While Chebyshev was prominent in many areas of mathematics, his legacy rests 
primarily on contributions to probability theory and the theory of numbers. One of 
his earliest publications, the celebrated Mémoire sur les nombres premiers of 1850, 
provided the rst signicant advance in the study of the function z(x), which desig- 
nates the number of primes not exceeding x. He also proved (1854) that for any integer 
n > 3 there is always a prime number between n and 2n — 2; this result is often called 
Bertrand’s Conjecture after the French mathematician Joseph Bertrand who veri ed, in 
1845, the assertion for all n < 6 - 10°. Chebyshev’s long-standing interest in probability 
went back to his master’s thesis at Moscow University. In 1866, he derived the law of 
large numbers in a form that generalized the Laplace-Poisson version and later (1887) 
established an equally important result known as the central limit theorem. Perhaps his 
greatest in uence on the development of probability theory was the insistence that it 
be treated as a mathematical science with its laws, particularly those involving limits, 
deduced rigorously. The effect was to broaden the scope of its applications to various 
branches of the natural sciences and engineering. Chebyshev attracted talented young 
mathematicians to the eld, his most celebrated pupil being Andrei Markov (1856— 
1922). Markov introduced the notion of “chained probabilities’—now called Markov 
chains—wherein the probability of an event depends only on the outcome of the event 
directly preceding it. 

By the turn of the twentieth century, mathematics displayed an ever greater empha- 
sis on abstraction and formalization. The Moscow mathematician Andrei Kolmogorov 
(1903-1987) furthered the effort during the late 1920s by developing a system of ax- 
ioms for probability theory. These appeared in his Grundbegriffe der Wahrscheinlichkeits- 
rechnung [Foundations of the Theory of Probability], published in Berlin in 1933. His 
approach was linked to the new theory of sets, essentially de ning probability as a 
set-function. 

Kolmogorov takes as his universe of discourse U, all possible outcomes (so-called 
elementary events) of an “experiment.” Each subset 4A of U (an event of the experiment) 
is assigned a real number 0 < P(A) < | as its probability. In particular, P(U) = 1. If 
two subsets A and B of U have no elements in common, then the probability of their 
union P(A U B) is P(A) + P(B). 
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From these simple beginnings, Kolmogorov goes on to derive further properties, 
such as P(@) = 0 for the empty set and P(A) = 1 — P(A) for the complement A of A. 
There are more involved consequences, such as P(AM B) = P(A) + P(B)— P(AU B) 
and P(A) < P(B) when A C B. Its axiomatic formulation gave to probability theory the 
formal validity enjoyed by other branches of mathematics. 


9.3 Problems 


. In the Liber de Ludo Aleae, Cardan held that since 


the probability of throwing a 6 with a single die is :, 
the probability that a 6 will appear when a pair of 
dice is rolled should be 2( +) = i. What is the correct 
probability? 


. An Italian nobleman who was an amateur 


mathematician and an inveterate gambler had, by 
continued observation of a game in which three dice 
were simultaneously thrown, noticed that a sum of 10 
was achieved more often than a sum of 9. He 
expressed his surprise at this and asked Galileo for 
an explanation. Calculate the probability, by 
enumerating the possible equally likely cases, that 
each sum will be rolled. (Assume that each of the 
three dice is painted a different color.) 


. The letters of the word “tailor” are written on 


different cards, and after the cards are thoroughly 
shufed, four are drawn in order. What is the 
probability that the result spells the word “oral”? 


(a) An urn contains four black and ve white balls. 
A ball is drawn at random and then replaced, 
after which a second ball is drawn. Find the 
probability that the rst is black and the second 
is white. 

(b) Ifa pair of fair dice is rolled, what is the 
probability that the sum shown is either exactly 
9 or less than 4? 

(c) A coin is tossed, then a card is drawn from an 
ordinary deck of 52 cards, and then a die is 
rolled. What is the probability of obtaining a 
head on the coin, an ace from the deck, and a 5 
on the die? 

(d) If one letter is chosen at random from the word 
boot and one letter from the word toot, what is 
the probability that both letters match? 


. For any two events A and B, not necessarily 


mutually exclusive, the formula 


Pr[A or B] = Pr[A] + Pr[B] — Pr[A and B] 


holds. Use this to solve the following: 


(a) <A card is selected at random from an ordinary 
deck of 52 cards. What is the probability that it 
is either an ace or a spade? 

(b) If an integer between 1 and 1000, inclusive, is 
selected at random, what is the probability that 
it begins with | or ends in 1? 


. Let A and B be events with Pr[B] 4 0. Then the 


probability of A given B written Pr[A|B], is 


Pr[4 and B 
Pr[4|B] = — 


Use this formula to solve each of the following: 


(a) Two fair dice are rolled and the numbers on the 
uppermost faces noted. Determine the 
probability that one die shows a 4, given that 
the sum of the numbers is 7. 

(b) An urn contains red balls marked 1, 2,3, a 
white ball marked 4, and blue balls marked 
5,6, 7, 8,9, and a ball is selected at random. 
Determine the probability that it is red, given 
that the ball is known to have an even number. 

(c) A single card is drawn from a standard deck of 
52 cards and replaced by a joker, and then a 
second card is drawn. What is the probability 
that both cards are aces? 


. Find the probability that A will win each of the 


following two games proposed by Huygens in the De 
Ratiociniis in Ludo Aleae: 


(a) A wagers B that given 40 cards of which 10 
are of one color, 10 of another, 10 of another, 
and 10 of yet another, 4 will draw 4 so as to 
have one of each color. 

(b) Twelve balls are taken, 8 of which are black 
and 4 white. A plays with B and undertakes, in 
drawing 7 balls blindfolded, to pick 3 white 
balls. 


. In the Ars Conjectandi, Bernoulli obtained the sum 


of the series 
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10. 


11. 


Chapter 9 


by rst writing 


1 1 1 1 
S=1l+-+2-+-4+:---+ 
n+1 
1 


1 1 
S=14+-+—-+4+.---4+-— 
2. 3 aes n + n+l 
and then subtracting. Use this technique to obtain the 
appropriate sum. 


(a) A family has ve children. If the probability of 
birth of either sex is assumed equally likely, 
what is the probability that exactly four of the 
children are boys? 

(b) Suppose that a die is made by marking the 
faces of a regular dodecahedron with the 
numbers | through 12. What is the probability 
that on exactly three of six tosses, a number 
less than 4 turns up? 

(c) Assume that the probability that a married 
couple will get a divorce within the next 10 
years is 4. Given eight randomly selected 
couples, nd the probability that all eight will 
have the same partners after 10 years. 

(d) An urn contains three red balls and two white 
balls. A ball is drawn, then replaced. Find the 
probability that a white ball will be drawn four 
times in a row. 


Prove that the probability of obtaining at least r 
successes in n trials of a Bernoulli trials experiment, 
where the probability of success on a single trial is 
Pp, is given by 


Pr[at least r successes in n trials] 


—_ n r = n—r n r+ 
= (")e (La gy ads (." ie . 


(ee eee eee ("ora apie 


Using the formula in Problem 10, solve the 
following: 


(a) Ifa fair die is tossed ve times, what is the 
probability that at least three 6s are thrown? 

(b) A multiple-choice test has four “answers” listed 
for each of the 10 questions; only one answer is 
correct. If a student who knows nothing about 
the material guesses at the answers, what is the 
probability that the student will get at least 9 
correct answers? 

(c) A weighted coin, when tossed, will land heads 
with probability ; and tails with probability 2. 
If this coin is tossed a total of six times, what is 


12. 


13. 


14. 


15. 


16. 
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the probability that it will land heads at most 
four times? 

(Hint: Pr[less than 5 successes] = 

1 — Pr[5 or more successes]. | 


A fair coin is tossed six times. Find the largest 
number n such that the probability of getting at least 
n heads is + or more. 


(a) Suppose that a gambler tosses three coins and 
receives $8 if three heads appear, $4 if two 
heads appear, $2 if 1 head appears, and $1 if no 
head appears. What is his mathematical 
expectation? 

(b) A game is devised in which a single die is 
thrown. A gambler agrees to pay out in dollars 
the amount shown on the die if the number is 
odd, and to collect the amount shown on the 
die if the number is even. What is his 
mathematical expectation? 


(a) A player randomly draws a single card from an 
ordinary deck of 52 cards. If the card is an ace, 
the player receives $6; if the card is a king, a 
queen, or a jack, the player receives $1. If any 
other card is drawn, the player loses $1. Is this 
game fair? 

(b) Suppose that the rules of the game in part (a) 
are modi ed so that the player wins $3 for an 
ace, $2 for a king, and 50¢ for a queen or a 
jack, but loses 50¢ if the card has a face value 
of 10,9, or 8 and loses 75¢ if it is a 7 or less. 
Is this game fair? 


Nicholas Bernoulli discussed the following problem: 
Each of two players A and B has a deck with the 
same cards (or tickets). They draw the cards singly 
until each draws the same, in which case 4 wins. If 
such a match does not occur, then B wins. The 
problem is to nd_ the probability of a win for either 
player. Suppose that the players have four cards each, 
numbered 1, 2, 3, and 4, and that player A draws 
them in their natural order—1, 2, 3,4. Compute for 
each player the probability of winning. [Hint: The 
number of permutations of the n integers 1 to n in 
which every number is in the wrong place is 

1 1 1 


vam(1 1+ 


‘a 
wy 3) 


The idea behind moral expectation was that it was 
necessary to represent the relative value of a given 


Bibliography 


sum of money not by its face value, but in relation to 
the capital of the person under consideration. Gabriel 
Cramer (1704-1752) suggested that the value derived 
from a sum of money varied with the square root of 
the sum. Thus, the moral expectation in his solution 

to the St. Petersburg paradox is 


A) 20) 48) = 
WI (s) ee 


What nite value does this series give? 
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10.1 


CHAPTER 10 
The Revival of Number Theory: 


Fermat, Euler, and Gauss 


Where there is number there is beauty. 


PROCLUS 
; Very few people in the sixteenth century were seri- 
Marin Mersenne and the Search ously interested in mathematics or science. Probably 
for Perfect Numbers there were only a few thousand of them—and most 


of those interested but not active. Many who worked 
Scientific Societies in science or mathematics did So in isolation, for even 

within the university communities their interests were 
seen as only peripherally important. Science, still tainted with the practice of magic, was 
not yet a respectable intellectual occupation, so that recognition came to those who ex- 
celled in more central elds of scholarship. Outside of books, there was no regular way 
for learned individuals to keep in contact with one another and publication was at times 
dif cult owing to ecclesiastical censorship and condemnation. 

As more people engaged in scientic pursuits, however, it was increasingly seen 
that advancement in knowledge was powerfully affected by the extent and speed of 
the communication of discoveries. Continual testing of ideas keeps science sound as it 
leads to the abandonment of disproved hypotheses. In a remarkable development from 
Renaissance times, when almost all scientic discussion—what little there was—took 
place within the framework of the university curriculum, there emerged in the 1600s a 
new pattern of social organization, the learned societies and academies independent of the 
universities. Beginning with the establishment of small and relatively informal amateur 
circles, this period culminated in the founding of the Royal Society of London (1660) 
and L’Académie Royale des Sciences (1666). Both are still active and distinguished 
professional societies. These organizations worked chie y to share scientic information 
within their own memberships and secondarily to make their collective ndings generally 
available to other interested groups. 

Contrasted with the groups that gathered about the famous philosophers of antiquity, 
this form of academy, organized and run by its members, was a product of sixteenth- 
century Italy. The numerous societies and academies that sprang up at this time (some 
claim more than 700) may be viewed as an attempt by groups of intellectuals to create for 
themselves a more congenial institution than the established university. Scholars sought 
to enrich their understanding by meeting in company to discuss subjects of common 
interest. The earliest of these modern academies tended to embrace almost the entire range 
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of intellectual activity. After the mid-1500s, academies devoted to specic disciplines 
were founded; more than half were literary societies, and the rest were divided among 
theatrical, medical, legal, and (more rarely) scientic pursuits. Almost always they were 
looked on with disfavor both by the civil authorities, who saw subversive teaching and 
political intrigue in their often secret meetings, and by the Church, which feared the 
propagation of heresies and the power of uncontrolled associations. 

The rst academy devoted chie y to science was probably the one Giambattista della 
Porta (1535-1615) established in Naples in 1560, called Accademia Secretorum Naturae. 
To become a member, a person must have made some original discovery or communi- 
cated some previously unknown fact in natural science. The members of della Porta’s 
academy called themselves the Otiosi, or Idlers, following the custom then prevalent in 
Italian societies of adopting humorously derogatory names. (Thus, one famous group of 
academicians was called the Umidi, or the damp ones [perhaps we could call them “all 
wet’’] and another bore the name Scomposti, which would mean “the confused, disorga- 
nized ones.”) The title Academia Secretorum Naturae made the Church suspicious, and 
della Porta was denounced as a practitioner of the black arts. He surely encouraged the 
charge by writing a book called Magia Naturalis (Natural Magic). The secrecy of his 
society and its members’ reputation for corresponding in cipher didn’t help matters. Al- 
though della Porta was personally exonerated of all charges, he was nonetheless ordered 
to close his academy. 

This short-lived attempt at formal scienti c organization anticipated the more impor- 
tant Accademia dei Lincei (Academy of the Lynx-Eyed), which was founded in Rome in 
1603 with the object of attempting new discoveries in physical science and publishing 
them to the world. Its emblem, a lynx clawing to death a Cerberus (a many-headed dog 
with a tail of snakes) was chosen to symbolize the struggle of scientic truth against 
ignorance. On any grounds of reasonable probability, the Accademia dei Lincei should 
have collapsed. For seven years, before it attracted any widely recognized scientist to 
membership, it consisted of exactly four members, all under 30 years of age. But in 
1610 the Accademia reorganized on a larger scale, with Galileo and the aging della Porta 
among its new members. The revived organization published several books its members 
had written, including two by Galileo. The great man seemed proud to use the title 
Lincean in his correspondence and on the title pages of most of his subsequent books. 

Galileo’s new optical instrument was given the name telescope in 1611 at a meeting 
of the Accademia dei Lincei. From time to time Galileo had used a variety of descrip- 
tive words such as eye-glass or perspective trunk; but this device was too signi cant an 
instrument not to have its place in the language. A member of the Accademia coined the 
name by combining two Greek words meaning “far” and “to look” to describe Galileo’s 
far-looking instrument. Galileo’s ideas were so disturbing that some men refused even 
to look into his telescope, for fear of seeing things that might contradict traditional phi- 
losophy or religion. The study of physics and astronomy became, in fact, too dangerous 
to pursue; and in 1630, when the Holy Of ce condemned Galileo, the Accademia dis- 
banded. It was reconstituted brie y in 1745, and permanently in 1801, so that the leading 
scienti c society in Italy is still the Accademia dei Lincei. 

The last of the important Italian scientic societies of this period is the Accademia 
del Cimento (Academy of Experiments), founded in Florence in 1657. The Medici fam- 
ily provided the necessary nancial support for assembling the nest collection of sci- 
entic equipment as was then available in Europe; but when a Medici was created a 
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Cardinal (1667), the activity of the Florentine academy was not deemed tting with his 
new dignity, and ceased. The end of the Accademia was not the end of scientic experi- 
mentation and discussion, however, as it had actually been only the formal organization 
for an already existing group of friends who met more or less regularly during the 1640s 
for experimentation and discussion of matters of scientic interest. A chief gure within 
this gathering was Galileo’s nal pupil, Evangelista Torricelli (1608-1647). Torricelli 
came to live with the blind and ailing Galileo in 1641, but they had little time to work 
together for the aged scholar passed away within three months. Torricelli was appointed 
mathematician to the Grand Duke of Tuscany, the post left vacant by Galileo, but Torri- 
celli’s career was cut short when he died suddenly, possibly of typhoid fever, ve years 
later at the age of 39. 

One of the experiments duplicated by the Accademia was Torricelli’s celebrated 
1643 demonstration of the weight of the air. His ingenious idea was to Il a four-foot- 
long tube, sealed at one end, with mercury and to invert it with its open end submerged 
in a basin of mercury. Observing that the level of mercury in the tube was not always 
the same, Torricelli hypothesized that the variations were due to the changing pressure 
of the air on the open surface of the mercury in the basin. Others attempted to deny 
Torricelli’s hypothesis by proposing that the vacant space in the tube above the mercury— 
subsequently known as the Torricelli vacuum—exerted a positive attraction. The crux of 
the simple and effective 1643 demonstration is that it refuted the traditional Aristotelian 
belief that “Nature abhors a vacuum.” 

Although his name is widely linked with the mercury barometer (measurer of weight, 
from the Greek baros, weight), Torricelli was a mathematician of considerable ability. 
He discovered something thought quite paradoxical in those days: that there exists a 
geometric solid that is in nitely long, yet has nite volume. This body, which he called 
an “acute hyperbolic solid,” is generated when the in nite plane region to the right of 
the line x = | and between the x-axis and the hyperbola y = 1/x is revolved around the 
x-axis. Torricelli communicated his discovery to the French geometers in letters written 
in 1644: the result seemed so counterintuitive and astonishing that at rst some of the 
leading European mathematicians thought it impossible. He was a virtual unknown when 
he succeeded to Galileo’s professorial chair in mathematics in Florence; two years later 
his name was acclaimed all over Europe. 


Marin Mersenne’s Mathematical Gathering 


The earliest-noted instance that has come down to us of a regular gathering of 
mathematicians is the circle held together by Father Marin Mersenne (1588-1648), an 
able mathematician and physicist, Jesuit-educated, and a friar of the Order of Minims. A 
man who made little contribution himself in the way of new knowledge, Mersenne had 
the capacity for understanding and appreciating the work of others. Arriving in Paris in 
1619, he found deplorable the lack of any sort of formal organization to which scholars 
might resort. He did his best to answer the need, partly by making his monastery a 
meeting place for those eager to discuss their results and to hear of similar work done 
elsewhere, and partly by acting as a link between scholars from different countries. At a 
time when publication of technical discoveries still lay in the future, Mersenne served as 
a personal and effective clearinghouse of scientic information. He made it his business 
to become acquainted with everyone of importance in the learned world through an 
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elaborate network of correspondence by which he transmitted news of the advancement 
of science in return for more news. He stimulated scienti c thought through the numerous 
questions he asked of anyone who might be able to contribute to nding an answer, and 
he communicated both questions and answers to others to elicit their reactions. 

Mersenne’s letter-writing ful lled many functions of a modern scientic journal. 
After his death, letters were found in his cell from 78 correspondents scattered over 
Europe, among them Fermat in France, Huygens in Holland, Pell and Hobbes in England, 
and Galileo and Torricelli in Italy. During a visit to Italy in 1645, Mersenne had met 
Torricelli and discussed the latter’s experimentation on the vacuum. Returning to Paris, 
Mersenne had the idea of taking a column of mercury to the top of a mountain to observe 
the effect of atmospheric pressure. But he missed his chance at scientic fame, because 
before Mersenne could carry out his plan, Pascal conducted the famous experiment 
demonstrating that the height of a mercury column varied with altitude. 

Mersenne’s most important contribution in the early days of the new science was 
maintaining more or less regular conferences for the exchange of ideas. At his monastery, 
he gathered around him an intellectual circle, composed mainly of the scientic com- 
munity of Paris but animated also by travelers passing through the city. These meetings 
seem to have taken place almost continually from 1635 until the end of Mersenne’s life 
in 1648. An entirely informal gathering with no publications or sets of laws to bind 
its members, the circle could scarcely be considered an academy at all. Its signi cance 
lay in the fame of those who attended the meetings, among them Desargues, Roberval, 
Descartes, and the Pascals, father and son. 

When any of the gathering had results to present, he would have them printed on 
loose sheets of paper, which would be distributed. At the age of 14, the precocious Blaise 
Pascal had been admitted to the circle of learned men with whom Mersenne surrounded 
himself, and at one of the weekly meetings, he presented a lea et that contained the 
“mystic hexagram” theorem. The one-page Essay pour les Coniques was in effect an 
announcement of a treatise that Pascal was preparing. It was never published and is now 
lost, though Leibniz saw it in Paris in 1676 and described its contents. 

Pro-Copernican as early as 1629, Mersenne was virtually Galileo’s representative in 
France. When he heard that Galileo was writing a book on the movement of the earth, 
he offered to get it published for him. However, Galileo was successful in having it 
published himself and copies were quickly sent to France. Not long after their arrival 
came word of Galileo’s trial and condemnation and the burning of his book. Mersenne 
was affected by the Church’s treatment of Galileo, but less so than one might imagine. He 
published, in 1634, under the title Les Mécaniques de Galilée, a version of Galileo’s early 
lectures (around 1592) on mechanics, and in 1639, a year after the original publication, 
he translated the Discorsi into French. He did not translate Galileo’s Dialogo when it 
appeared in 1632, possibly because he felt that Galileo had broken his promise to the 
pope not to come down heavily in favor of the Copernican theory; but he did bring out 
a summary account of parts of it. As Italian was little understood abroad, Mersenne was 
instrumental in popularizing Galileo’s investigations on the Continent. It is worthy of 
notice that this was done by a faithful member of a Catholic religious order, at the height 
of the Church’s hostility to science. 

After Mersenne’s death, the conferences continued to be held at regular intervals in 
various homes in and around Paris, including Pascal’s. It is usual to regard the Académie 
Royale des Sciences, which was formed in 1666, as the more or less direct successor of 
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these numerous private gatherings. The new academy was composed of two sections: a 
mathematical one including all “exact sciences,” and a physical one concerned with the 
more “experimental sciences” such as physics, chemistry, botany, and anatomy. As its 
name indicates, this society was a royal institution as well as a scientic academy; its 
members acknowledged two masters, science and the crown. The state was willing to 
commit considerable funds to the support of the Académie so that it could reap the bene ts 
of its scientic inquiry. Whereas the Royal Society in England was forever struggling 
with nancial dif culties, the members of the French body drew xed pensions and were 
to give all their time to the Académie. The resources of the royal treasury attracted 
scholars from all quarters, so that at times the Académie Royale des Sciences seemed 
more a European than a French gathering. At rst its sessions were closed to the public, 
and even the minutes of the meetings remained the private property of the Académie 
until extracts were printed. After several years, the Académie began to share its ndings 
through the newly founded Journal des savants. 

The Journal des savants, not originally af liated with the Académie, was founded 
in 1665 to deal with arts, sciences, and theology. Its appearance met with such violent 
opposition from religious circles that its publication was temporarily suspended, its editor 
removed, and its sphere of activities seriously curtailed before it could resume publication. 
When it returned to publication, it did so as a quasi-of cial periodical of the Académie. 
Although in theory the Académie chose in favor of anonymous publication, seventeenth- 
century scientists were no less concerned with establishing their academic reputations 
than their modern counterparts are. Thus, after announcing a new idea or discovery in 
a closed meeting of the Académie, an individual would often send his ndings to the 
Journal des savants. 

The development of science in the 1600s was signally indebted to three associations: 
the Accademia dei Lincei in Rome, which existed for but a short time; the Académie 
Royale des Sciences in Paris; and the Royal Society of London. Paralleling events on 
the Continent, the founding of the Royal Society was an outgrowth of informal meetings 
of friends, more or less learned but all deeply interested in experimental knowledge. 
Several English scientists had visited the academies in Italy as well as the conferences 
of Mersennes and others in Paris and advocated forming a comparable organization in 
Great Britain. This led at rst to the gatherings for amateurs in science, one centering 
in Oxford around Robert Boyle and the other in London at Gresham College, where the 
young Christopher Wren (remembered mainly for his architectural achievements) was 
the leading spirit in the discussion of matters of science. 

It was to this group at London that the immediate rise of the Royal Society can 
be traced. The event that inaugurated the society as an organized body occurred on 
November 28, 1660, when 12 friends at Gresham College decided to initiate formal 
meetings each week—if possible, under the patronage of Charles II himself. Temporary 
ofcers were elected, rules drawn up, and a tentative list of members determined. An 
admission fee of one shilling and a further weekly fee were to be charged to defray 
the expenses of experimentation. The members agreed to refrain from any discussion 
of current news or questions of the day; science was to be the group’s sole topic. In 
1662 the Royal Society of London for the Promotion of Natural Knowledge received its 
formal charter and mace from Charles I, but unlike its French counterpart, it received 
little else. At rst the number of members was limited to 55, but it was afterward 
extended, and nally admission was left open to every proper candidate. The famous 
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A visit by Louis XIV in 1671 to the Académie des Sciences. (By courtesy of the 
University of California, Berkeley Library.) 


Philosophical Transactions of the Royal Society, which began to appear in 1665, and the 
Journal des savants were the rst journals to include mathematical and scienti c articles. 


Numbers, Perfect and Not So Perfect 


Mersenne’s name is connected with one of the oldest problems in the theory of num- 
bers, nding all perfect numbers. Early Greek mathematicians were especially intrigued 
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by the relations between a number and its aliquot divisors (positive divisors less than the 
number itself). To an age for which numbers held mystical properties, it was considered 
remarkable that 6 is the sum of its aliquot divisors: 


6=1+4+2+3. 


The next number after 6 enjoying this feature is 28, since the positive divisors of 28 are 
1, 2, 4, 7, 14, and 28, and 


28 =1424447414. 


The Pythagoreans, in line with their philosophy of attributing certain social qualities, and 
later even ethical import, to numbers, called such number “perfect.” A precise de nition 
is as follows, 


Definition 


A positive integer n is said to be perfect if n is equal to the sum of all its positive 
divisors, not including n itself: 


If we let o(m) denote the sum of all the positive divisors of n, then the sum of 
the positive divisors less than n is given by o(n) — n. Thus the condition “x is perfect” 
amounts to the requirement o(n) — n = n, or equivalently, 


o(n) = 2n. 
As examples, we have 
o(6)=1+2+4+3+6=2-6. 
and 
o0(28)=1+2+4+74+144 28 =2. 28. 


For many centuries, philosophers were more concerned with the ethical and religious 
signi cance of perfect numbers than with their mathematical properties. In his De Civitate 
Dei, Saint Augustine (354-430) explained that although God could have created the world 
all at once, He preferred to take six days, because the perfection of the work is symbolized 
by the (perfect) number six. To quote Saint Augustine: 


Six is a number perfect in itself, and not because God created all things in six days; rather, 
the converse is true. God created all things in six days because this number is perfect, and 
it would have been perfect even if the work of six days did not exist. 


Early commentators on the Old Testament argued that the perfection of the universe is 
represented by 28, the number of days the moon takes to circle the earth. In a similar 
vein, Alcuin of York (735-804), Charlemagne’s educational advisor, observed that the 
whole human race is descended from the eight souls on Noah’s ark and that this second 
creation was less perfect than the rst, 8 being an imperfect number. 

Only four perfect numbers were known to the ancient Greeks. Nicomachus of Gerasa, 
who summarized the existing knowledge of the theory of numbers in his /ntroductio 
Arithmetica (circa A.D. 100), lists 


P, =6, Py) = 28, P3; = 496, Py = 8128. 
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He said “the perfect numbers are easily enumerated and arranged in suitable order’—one 
found among the units, one among the tens, one among the hundreds, and one among the 
thousands (that is, below 10,000). Using this meager evidence, later writers conjectured 
that 


1. The nth perfect number P, contains exactly n digits. 
2. The even perfect numbers end alternately in the digits 6 or 8. 


Both assertions are wrong. There is no perfect number with ve digits; the next 
perfect number (rst given correctly in an anonymous fteenth-century manuscript) is 


Ps = 33,550,336. 
Although the nal digit of Ps is 6, the succeeding perfect number 
Po = 8,589,869,056 


ends in 6 also, not 8 as conjectured. To salvage something in the positive direction, we 
shall show later that the even perfect numbers always do end in 6 or 8, but not alternately. 

If nothing else, the magnitude of Ps should convince the reader of the rarity of 
perfect numbers. It is not yet known whether there are nitely or innitely many of 
them. 

Determining the general form of all perfect numbers dates to ancient times. It was 
partially accomplished by Euclid when in Proposition 36 of Book IX of the Elements, 
he proved that if the sum 


14+2427+2?+...42%~!=p 


is a prime number, then 2‘ ~ ! 


own words: 


p is a perfect number (of necessity even). To quote Euclid’s 


If as many numbers as we please beginning from a unit be set out continuously in double 
proportion, until the sum of all becomes prime, and if the sum multiplied into the last makes 
some number, the product will be perfect. 


For example, 1 + 2 + 4 = 7 isa prime, hence 4 - 7 = 28 is a perfect number. Euclid’s 
argument makes use of the formula for the sum of a geometric progression, 


a? ee Oe aS, 


that is found in various Pythagorean texts. In this notation, the result reads as follows. 


If 2 —1 is prime (k > 1), then n = 2* ~ '(2* — 1) is a perfect number. 


To demonstrate the theorem, it is necessary to know all the divisors of the integer 
2* — 1(2* — 1). Let us therefore take a detour and consider the problem of nding the 
divisors of an arbitrary positive integer. For this problem, we have a simple, concise 
answer. 


Ifn= pi pe ... p* is the prime factorization of the integer n > 1, then the positive 
divisors of n are precisely those integers d of the form 


=p pp, , whee sap = & -@— 1, 2.44.57). 
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An example should clarify matters. For n = 180, we have the prime factorization 
180 = 27.3? . 5. 


The lemma asserts that the positive divisors of 180 are integers of the form 


27) . 3% . 5%. 
where a, = 0, 1, 2;a. = 0, 1, 2;a3 = 0, 1. These are the integers 
1-1-1, 1-1-5, 1-3-1, 1-3-5, 1-32-1, 1.32.5, 
Qe 11 Qs 125, 2-3-1, 2-3-5, 2-37-1 2.37.5, 


22.161, 2.13, 23321, 27.3.5, 321, 2° 3725, 
or put in sequential order, 
1,2, 3,4, 5, 6,9, 10, 12, 15, 18, 20, 30, 36, 45, 60, 90, 180. 
This last lemma enables us to prove Euclid’s theorem. It tells us that if 2" — 1 is 
prime, then each of 
ee ing = ba 1 OH hy OO 
is a divisor of n = 2—'(2* — 1), and that these exhaust the possibilities for divisors. Thus 
there are two sums to be added: 
SH 14247 42.491 
and 
Sp = (2* — 1) + 2(2* — 1) + 27(2* — 1) +--- + 24-102" — 1) 
= (14242? haved FOF 1) 
= §,(2* — 1). 
Added, these yield 
o(n) = S, + S,(2* — 1) = 2* 8). 


But the formula for the sum of a geometric progression gives S; = 2* — 1, so that our 
total is 


o(n) = 242k — 1) = 2-2" 12 — 1) = 2n 


the effect of which is to make n a perfect number. 

In the eighteenth century, the great Swiss mathematician Leonhard Euler (1707— 
1783) showed that all even perfect numbers must be of the form expressed by Euclid’s 
theorem. That is, an even number n is perfect if and only if it satis es the formulation 


n = 2*1* — 1), 


where 2‘ — 1 is prime. No odd perfect number has ever been found, yet it has never 
been proved that such a number cannot exist. 

Although no one knows whether any exist, many interesting results have been ob- 
tained concerning the structure of odd perfect numbers. The bulk of present-day research 
concerns itself primarily with the determination of bounds on the number of prime fac- 
tors of such a number, and on its size. What is currently known in this direction is that 
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no odd perfect number can have fewer than eight different prime factors or be less than 
10°°°, Although all this lends support to the belief that there are no odd perfect numbers, 
only a proof of their nonexistence would be conclusive. “It must always stand to the 
credit of the Greek geometers,” wrote the mathematician James Sylvester in 1888, “that 
they succeeded in discovering a class of perfect numbers which in all probability are the 
only numbers which are perfect.” 

Euclid’s theorem immediately raises a new question: For what values of k is the 
integer 2* — 1 prime? One step toward a solution can be made at once; namely, if 2" — 1 
is prime, then the exponent & must itself be a prime. Suppose to the contrary that k is a 
composite number, say, k = rs, where | <r <s <k. Using the formula 


PeleG@ehe 429" 42-4274 fy, 
we could then write 


2*—1=2-1=(Q’y-1 
= (2” = 1(2"6-) ae 9r(s—2) oe or + 1). 


Because each factor on the right is plainly greater than 1, this violates the primality of 
2* — 1; hence k must be a prime number. 

For the primes p = 2,3,5, and 7, the values 3, 7,31, and 127 of 2? — 1 are prime 
numbers, so that by Euclid’s formula 


2(22 — 1) =2-3=6, 
24(23 — 1) =4-7=28, 
24(25 — 1) = 16-31 = 496, 
26(27 — 1) = 64-127 = 8128 


are all perfect numbers. 

Many early writers erroneously believed that 2? — | is prime for every choice of 
the prime number p. But in 1536, Hudalrichus Regius in a work entitled Utriusque 
Arithmetices exhibited the correct factorization 


2! _ 1 = 2047 = 23 - 89. 


If this seems a small accomplishment, it should be realized that his calculations were in 
all likelihood carried out in Roman numerals, with the aid of an abacus (not until the late 
1500s did the Arabic numeral system win complete ascendancy over the Roman one). 
Regius also gave p = 13 as the next value of p for which the expression 2? — | is a 
prime. From this, one obtains the fth perfect number 


Ps = 2'2(23 — 1) = 33,550,336. 


In 1603 Pietro Cataldi, who is chie y remembered for his work on extracting roots 
by means of continued fractions, published a table of factors of all numbers up to 800, 
with a separate list of primes up to 750. By the laborious procedure of dividing by all 
primes not exceeding their respective square roots, Cataldi determined that 2!’ — 1 and 
2!° — 1 were both primes, and in consequence, that 


Ps = 2'6(2!7 — 1) = 8,589,869,056 
and 


P, = 2'8(2!° — 1) = 137,438,691,328 
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were the sixth and seventh perfect numbers. He also stated that 2” — 1 was prime for 
n = 2,3,5,7, 13,17, 19, 23,29,31, and 37. In 1640, however, Fermat disproved 
Cataldi’s claim by nding factors of 27? — 1 and 23” — 1, while Euler (1738) showed 
that 2” — 1 was composite. 

One of the dif culties in discovering further perfect numbers was the lack of tables 
of primes and composites. The rst fair-sized table, one giving the least prime factors 
of all numbers not divisible by 2 or 5, up to 24,000, was published by J. H. Rahn 
as an appendix to his Teusche Algebra (1659). In 1668, John Pell of England extended 
this table to include numbers up to 100,000. The table constructed by the Viennese 
schoolmaster Anton Felkel is interesting because of its singular fate. The rst volume 
of Felkel’s computations, giving factors of numbers not divisible by 2, 3, or 5 up to 
408,000, was published in 1776 by the Imperial Treasury of Austria; when only a few 
subscriptions were received, the remaining copies were con scated, and their pages were 
used as paper in the manufacture of cartridges for a war against the Turks. Even more 
ill-starred was the table calculated by J. P. Kulik (1773-1863), which was deposited in 
the library of the Vienna Royal Academy in 1867 but was never published at all. Kulik, 
a professor of mathematics at the University of Prague, devoted 20 years of his life to 
preparing, unassisted and alone, the factors of numbers up to 100,000,000. 

It has become traditional to call numbers of the form 


M, = 2” -1, n> 1, 


Mersenne numbers, in honor of an incorrect but provocative assertion Marin Mersenne 
made about their primality. Those Mersenne numbers that happen to be prime are said 
to be Mersenne primes. As we have seen, the determination of Mersenne primes—and 
in turn, the quest for more even perfect numbers—can be narrowed to the case in which 
the exponent 7 is itself a prime. If it were known that there were in nitely many primes 
p for which M,, is prime, then there would exist an in nitude of (even) perfect numbers. 
Unfortunately, this is another one of those famous unresolved problems. 

In the preface of his Cogitata Physico-Mathematica (1644), Mersenne stated that M, 
was prime for p = 2, 3,5, 7, 13, 17, 19, 31, 67, 127, and 257, and composite for all other 
primes p < 257. It was obvious to other mathematicians that Mersenne could not have 
tested for primality all the numbers he had announced; but neither could they. Whether 
Mersenne had available some theorem not yet rediscovered or was relying mainly on 
personal guessing is conjectural. He did admit that “to tell if a given number of 15 or 
20 gures is prime or not, all time would not suf ce for the test.” The number Mj 7 
contains 39 digits, and M57 has 78. 

For nearly 150 years, Cataldi’s Mj 9 remained the largest Mersenne prime. Then 
Euler (1732) veri ed that M3, was prime by examining all prime numbers up to 46,339 
as possible divisors: but Mg7, M27, and M257 were beyond his technique. In any event, 
M3, was the largest known prime for the next century, and it gave rise to the ninth 
perfect number. 


Po = 279(23! — 1) = 2,305,843,008, 139,952, 128. 


Euler, one of the greatest mathematicians of all ages, was not immune from making false 
claims about the occurrence of perfect numbers. He wrote: 


I venture to assert that aside from the cases noted [Euler earlier mentioned 11, 23, 29, 37, 
43, 73, 83], every prime less than 50, and indeed than 100, makes 2”~!(2” — 1) a perfect 
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» Marin Mersenne 
, (1588-1648) 


(By courtesy of Columbia University, David Eugene 
Smith Collection.) 


number, whence the eleven values 1, 2, 3, 5, 7, 13, 17, 19, 31, 41, 47 of 1 yield perfect 
numbers. 


Euler eventually (1753) detected his own error as to n = 41 and 47. 

Not until 1947, after labor that was tremendous because of the unreliability of the 

desk calculators used, was examination completed on the prime or composite character of 
M, for the last of the 55 primes in the range p < 257. We know now that Mersenne made 
ve mistakes, although it is astonishing that it took over 300 years to set the good friar 
straight. The rst error in Mersenne’s statement was found when Pervusin (1883) and 
Seelhoff (1886) proved independently that Me; was prime. After Cole (1903) discovered 
factors for M67, some of Mersenne’s defenders suggested that 67 was merely a misprint 
for 61. Subsequent investigations revealed, however, three other errors in the list: Mgo 
is prime (Powers in 1911), Mjo7 is prime (Fauquembergue and Powers, independently, 
1914-1917), and M57 is composite (Kraitchik in 1922). 

A historical curiosity is that in 1876 Edouard Lucas worked a test whereby he was 
able to prove that the Mersenne number M67 was composite; but he could not produce the 
actual factors. At the October 1903 meeting of the American Mathematical Society, the 
American mathematician Frank Nelson Cole had a paper on the program with the modest 
title On the Factorization of Large Numbers. When called on to speak, Cole walked to 
the blackboard, and saying nothing, raised the integer 2 to the sixty-seventh power; then 
he carefully subtracted 1 from the resulting number and let the answer stand. Silently he 
moved to a clean part of the board and multiplied, longhand, the product 


193,707,721 x 761,838,257,287. 


The two calculations agreed. The story goes that for the rst and only time on record, 
this august body gave the presenter of a paper a standing ovation. Cole took his seat 
without having uttered a word; no one asked him a question. (Later he con ded to a 
friend that it took him “three years of Sundays” to nd the factors of M67.) 
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A question allied to the existence of Mersenne primes is whether there are in nitely 
many primes p such that the numbers M, are composite. Even the answer to this is 
unresolved. However, much larger values of p are known for which M, is composite 
than for which M, is prime. For example, 


M32376604223 


is composite. Consider the fantastic size of this number: if written in decimal form, its 
digits would more than lI the telephone books of all ve boroughs of New York City. 

After giving the eighth perfect number 23°(23! — 1), Peter Barlow in his book Theory 
of Numbers (published in 1811) concluded from its size that it “is the greatest that ever 
will be discovered; for as they are merely curious, without being useful, it is not likely 
that any person will ever attempt to nd one beyond it.” The very least that can be said 
is that Barlow underestimated the obstinacy of human curiosity. Although the subsequent 
search for larger perfect numbers provides us with one of the fascinating chapters in the 
history of mathematics, an extended discussion would be out of place here. 

It is worth remarking, however, that the rst 12 Mersenne primes (hence, 12 perfect 
numbers) have been known since 1914. The twelfth in order of discovery, Mgo, was the 
last Mersenne prime disclosed by hand calculation. Its primality was veri ed by both 
Powers and Cunningham in 1914, working independently and by different techniques. 
The prime M27 was found by Lucas in 1876 and for the next 75 years was the largest 
number actually known to be prime. 

Calculations whose mere size and tedium repel the mathematician are grist for the 
computer’s mill. Starting in 1952, 29 additional Mersenne primes, all huge, have come 
to light. The Mersenne prime M4936583, found in 2004, is the largest of the known prime 
numbers. It in turn gives rise to the largest known even perfect number, the forty- rst, 


Pay = 274036582 (924036583 _ 1), 
an immense number of 14,591,877 digits. 

Another type of number that has had a continuous history extending from the early 
Greeks to the present time constitutes the amicable numbers. Two numbers such as 220 
and 284 are called amicable, or friendly, because they have the remarkable property that 
within each number is “contained” the other, in the sense that each number is equal to 


the sum of all the positive divisors of the other, not counting the number itself. Thus, as 
regards the divisors of 220, 


14+2+44+54+104+114+ 20+ 22+ 44+4+55+ 110 = 284, 
while for 284, 
14+2+4+471+4 142 = 220. 


In terms of the o function, amicable numbers m and n (or an amicable pair) are de ned 
by the equations 


o(m)—-m=n, o(n)—-n=m, 
or what amounts to the same thing, 


o(m)=m+n=o(n). 
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Down through their quaint history, amicable numbers have been important in magic 
and astrology, and in casting horoscopes, making talismans, and concocting love po- 
tions. The Greeks believed that these numbers had a particular in uence in establish- 
ing friendship between two individuals. The philosopher Iamblichus of Chalcis (circa 
A.D. 250-330) ascribed a knowledge of the pair 220 and 284 to the Pythagoreans. 
He wrote: 


They [the Pythagoreans] call certain numbers amicable numbers, adopting virtues and social 
qualities to numbers, as 284 and 220; for the parts of each have the power to generate the 
other, according to the rule of friendship, as Pythagoras afrmed. When asked what is a 
friend, he replied, “Another I,” which is shown in these numbers. 


Biblical commentators spotted 220, the lesser of the classical pair, in Genesis 32:14 
as numbering Jacob’s present to Esau of 200 she-goats and 20 he-goats. According to 
one commentator, Jacob wisely counted out his gift (a “hidden secret arrangement’’) 
to secure the friendship of Esau. An Arab of the eleventh century, El Madschriti of 
Madrid, related that he had put to the test the erotic effect of these numbers by giv- 
ing someone a confection in the shape of the smaller number, 220, to eat, while he 
himself ate the larger, 284. He failed, however, to describe what success that ceremony 
brought. 

It is a mark of the slow development of number theory that until the 1630s no one 
had been able to add to the original pair of amicable numbers discovered by the Greeks. 
The rst explicit rule described for nding certain types of amicable pairs is due to Thabit 
ibn Kurrah, an Arabian mathematician of the ninth century. In a manuscript composed 
at that time, he indicated: 


If the three numbers p = 3-2”-! —1,q =3-2”—1, and r =9-2”"-'— | are all prime 
and n > 2, then 2” pq and 2”r are amicable numbers. 


It was not until its rediscovery centuries later by Fermat and Descartes that Thabit’s rule 
produced the second and third pairs of amicable numbers. In a letter to Mersenne in 1636, 
Fermat announced that 17,296 and 18,416 were an amicable pair, and Descartes wrote 
to Mersenne in 1638 that he had found the pair 9,363,584 and 9,437,056. Fermat’s pair 
resulted from taking n = 4 in Thabit’s rule (p = 23, g = 47, r = 1151 are all prime) and 
Descartes’s from n = 7 (p = 191, g = 383, r = 73,727 are all prime). 

In the 1700s, Euler drew up at one clip a list of 64 amicable pairs; two of these 
new pairs were later found to be “unfriendly,” one in 1909 and another in 1914. Adrien- 
Marie Legendre in 1830 found another pair, 2,172,649,216 and 8,520,191. Then in 
1866, a 16-year-old Italian schoolboy, Niccolo Paganini, startled the mathematical world 
by announcing that 1184 and 1210 were amicable. This was the second lowest pair and 
had been completely overlooked by all previous investigators. Although the boy did not 
describe how he found the pair (probably by trial and error), the discovery links his name 
forever with the history of number theory. 

Today, more than 50,000 amicable pairs are known, some of them running to 
320 digits; these pairs include all those with values less than 10!!. It has not yet been 
established whether the number of amicable pairs is nite or innite, nor has a pair 
been produced in which the numbers are relatively prime. Part of the problem is that 
in contrast with the single formula for generating perfect numbers (even), there is no 
known rule for nding all amicable pairs of numbers. 
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10.1 Problems 


. Show that the integer 496 is a perfect number by 
showing that 0 (496) = 2 - 496; in other words, sum 
up all the positive divisors of 496. 

. Verify that the integer n = 2'°(2'! — 1) is not a 
perfect number, by showing that o(n) 4 2n. [Hint: 
2! _ 1 = 23-891] 


. Verify that: 


(a) No power of a prime can be a perfect number. 


(b) The product of two odd primes is never a 
perfect number. [Hint: Expand the inequality 
(p —1)(q — 1) > 2 to get pq > pt+q+tl1.] 


. Prove that every even perfect number is also a 
triangular number. [Hint: Given the perfect number 
p = 2*!(2* — 1), show that p = f,, where 
n=2_-1] 


. It has been observed that 


6=242?, 
28 = 2? +23 +24, 

496 = 24 4+ 25 4 2°4 274 28. 
Can this procedure of representing even perfect 
numbers as sums of successive powers of 2 go on 
forever? [Hint: Establish the identity 

2k-1(2k _ 1) = 9k-1 + Ok a Qk+1 

a ee pha 

. Verify that when the reciprocals of all the positive 
divisors of a perfect number are added together, the 


10.2 From Fermat to Euler 


Fermat’s Arithmetica 


10. 


11. 


12. 
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total is 2. For example, the positive divisors of 6 are 
1, 2, 3, and 6; and 


1 1 1 1 
f+54+54+7=2. 


[Hint: Note if d|n, then n = dd’ for some d’; hence, 
the sum 


7 i / 
dil/d = Sidi) = — 941 


d\n d'\n d'\n 
From the facts 


28 = 1433 and 
496 = 134374547 


conjecture a theorem and show that this guess holds 
for the next even perfect number. 


For any even perfect number n = 2*—!(2* — 1), prove 
that 2*|o(n?) + 1. 


Prove that an integer n is one member of an amicable 
pair of numbers if and only if s(s(”)) = 1, where 
s(n) denotes the sum of the aliquot divisors of n. 


Conrm that the pair of numbers found by the Italian 
schoolboy in 1866, namely 1184 = 2° - 37 and 
1210 =2-5- 11’, actually form an amicable pair. 


For a prime p, show that neither p nor p” can be 
one of an amicable pair. 


Verify that the amicable pair 220 and 284 occurs on 
taking n = 2 in Thabit’s rule; this rule yields 
amicable pairs for n = 2,4, and 7, but for no other 
value n < 200. 


Few periods were so fruitful for mathematics as the 
seventeenth century. Northern Europe alone produced as 
many individuals of outstanding ability as had appeared 
during the preceding millennium. At a time when such 
names as Desargues, Descartes, Pascal, Wallis, Bernoulli, 
Leibniz, and Newton were becoming famous, a certain French civil servant, Pierre de 
Fermat (1601-1665), stood as an equal among these brilliant scholars. Fermat, the “Prince 
of Amateurs,” was the last great mathematician to pursue the subject as a sideline to a 
nonscientic career. By profession a lawyer and magistrate attached to the provincial 
parliament at Toulouse, he sought refuge from controversy in the abstraction of math- 
ematics. Fermat evidently had no particular mathematical training, and evidenced no 
interest in its study until he was past 30; to him it was merely a hobby to be cultivated in 
leisure time. Yet no practitioner of his day made greater discoveries or contributed more 
to the advancement of the discipline. He was one of the inventors of analytic geometry 
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Pierre de Fermat 


(1601-1665) 


(Reprinted with permission of Unwin Hyman Ltd., from 
. A History of Science, Technology and Philosophy in 
ES eae, ee the 16th and 17th Centuries, by A. Wolf:) 


(the actual term was coined in the early 1800s), he laid the technical foundations of 
differential and integral calculus, and with Pascal he established the conceptual guide- 
lines of the theory of probability. Fermat’s real love in mathematics was undoubtedly 
number theory, which he rescued from the realm of superstition and occultism, where it 
had long been imprisoned. His contributions there overshadowed all else. It may well be 
said that the revival of interest in the abstract side of number theory began with Fermat; 
for by his refusal to accept rational solutions to diophantine problems, insisting rather 
on solutions in the integers, he represented a break with the classical tradition of the 
Arithmetica. 

With the fall of Constantinople to the Turks in 1453, the Byzantine scholars who 
had been the chief custodians of mathematics brought the ancient masterpieces of Greek 
learning to the West. It is reported that the remnants of the Greek text of Diophantus’s 
Arithmetica were found in Padua about 1462 by Regiomontanus. Presumably, it too 
had been brought to Italy by the refugees from Byzantium. Regiomontanus observed, 
“In these books the very ower of the whole of arithmetic lies hid,” and he tried to 
interest others in translating it. Notwithstanding the attention that was called to the 
work, it remained practically a closed book until 1572, when the rst translation and 
printed edition was brought out by the German professor Wilhelm Holzmann, who 
wrote under the Grecian form of his name, Xylander. (Bombelli had translated ve of 
the seven books of the Arithmetica in 1570, using a Vatican manuscript, but this version 
of Diophantus has never come to light.) The Arithmetica became fully accessible to 
European mathematicians when Claude Bachet, borrowing liberally from Xylander, 
published (1621) the original Greek text, along with a Latin translation containing notes 
and comments. The Bachet edition probably has the distinction of being the work that 
rst directed the attention of Fermat to the problems of number theory. 

Fermat, the son of a prosperous leather merchant, went from the University of 
Toulouse to the University of Orleans where he received a law degree in 1631. 
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Because neither university boasted a mathematician or a scientist of particular note, 
his mathematical education probably did not extend beyond the rst six books of Eu- 
clid’s Elements and some symbolic algebra from Viéta’s new treatise, /ntroduction to the 
Analytic Art (1591). But by the spring of 1636, Fermat had set down a theory of what 
we should now call analytic geometry almost identical with what René Descartes had 
developed in his La Géométrie of 1637. Despite their simultaneous appearance on the 
mathematical scene, the two systems were entirely independent innovations. Descartes’s 
Géométrie circulated widely in print: Fermat’s [Introduction to Plane and Surface Loci 
remained in manuscript form until 1679. The /ntroduction at least ended Fermat’s iso- 
lation from the mathematical community at large, for it soon brought a letter from that 
connecting link between men of science, Father Marin Mersenne, inviting him to share 
his future ndings with the Parisian circle. 

Fermat preferred the pleasure he derived from mathematical research itself to any 
reputation that it might bring him. Indeed, he published only one important manuscript 
during his lifetime, and that just ve years before his death, using the concealing 
initials M.P.E.A.S. Adamantly refusing to bring his work to the state of perfection that 
publication would demand, he thwarted the several efforts of others to make the results 
available in print under his name. Roberval, as early as 1637, offered to edit and publish 
some of Fermat’s papers, but he was told, “Whatever of my works is judged worthy 
of publication, I do not want my name to appear there.” Yet the man who shunned 
formal publication still took pride in his achievements and did not want to remain 
entirely unknown. He carried on a voluminous correspondence with contemporary 
mathematicians in which he circulated his latest discoveries, offering to Il in gaps 
when leisure permitted. Most of what little we know about his investigations is found 
in the letters to friends with whom he exchanged problems and to whom he reported 
his successes. They did their best to publicize Fermat’s talents by passing these letters 
from hand to hand or by making copies, which were dispatched over the Continent. 

Fermat seems to have been one of those rare persons who was modest enough 
to believe that a few jottings of proof or even a statement of a theorem was enough 
for all the world to understand. This habit of communicating results piecemeal, usually 
as challenges, was particularly annoying to the Parisian mathematicians. At one point 
they angrily accused Fermat of posing impossible problems and threatened to break off 
correspondence unless more details were forthcoming. 

Because his parliamentary duties demanded an ever greater portion of his time, 
Fermat was given to inserting notes in the margin of whatever book he happened to be 
using. Fermat’s personal copy of the Bachet edition of Diophantus held in its margin many 
of his famous theorems in number theory. These were discovered ve years after Fermat’s 
death by his son Samuel, who brought out a new edition of the Arithmetica incorporating 
his father’s celebrated marginalia. Because there was little space available, Fermat’s habit 
had been to jot down some result and omit all steps leading to its conclusion. Posterity 
has often wished that Fermat had been a little less secretive about his methods, or at 
least that the margins of the Arithmetica had been wider. 

Early in his career, Fermat paid much attention to classical divisibility questions 
concerning the function o(7), the sum of the divisors of m, such as_ nding the solutions of 
o(n) = 2n (the perfect numbers) or the solutions to o(n) = n + m = o(m) (the amicable 
pairs). The crux of the problem of perfect numbers is to ascertain which numbers 2? — 1 
are prime. While investigating this question, Fermat wrote to Mersenne, probably in June 
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1640, that he had discovered that if p were a prime, then 2p~ would divide the number 
(2? — 1)—1. Phrased differently, this says that p always divides 2?~' — 1. This was 
only a taste of more to come, for shortly thereafter Fermat announced that he was able 
to derive a far more general result. In a letter to his friend, Bernard Frénicle de Bessy, 
dated October 18, 1640, he stated essentially the following: 


If p is a prime and a is any integer not divisible by p, then a?~' — | is divisible 
by p. 


For example, since 11 is prime, 3'° — 1 must be divisible by 11; indeed, a simple calcu- 
lation shows that 


310 _ 1 = 59,048 = 11 - 5368. 
A variation in which there is no longer a divisibility restriction involving a and p, is: 
For any number a and prime p, p divides a? — a. 


This result is now known as “Fermat’s theorem,” although some writers call it Fermat’s 
little theorem, to distinguish it from the more famous last theorem. 

As usual, Fermat did not say how he arrived at his theorem, writing to Frénicle 
merely, “I would send you the demonstration, if I did not fear its being too long.” 
Nowhere among his papers did he leave a hint of a proof, a trace of his “demonstration.” 
The rst published proof was given by Euler in the Proceedings of the St. Petersburg 
Academy in 1736; however, manuscripts of Leibniz in the Hanover Library show that 
he had proved Fermat’s theorem sometime before 1683. Euler’s argument, like the one 
of Leibniz, is based on the use of the binomial expansion, and it is hardly possible to 
doubt that this was the proof that Fermat had in mind. 

The proof rests on the almost evident point that if p is prime, then the binomial 
coef cients (@) are divisible by p for k = 1,2,..., p — 1, where 


k 
(7 )=mee 
k) 1-2-3---k : 


From the de nition of (2) we have 


k 


Here, p divides the right-hand side, so that it must also divide the left-hand sides. But 
the product 1-2-3---k, consisting of factors less than p, cannot be divisible by p; the 
implication is that p divides the coef cient cay as asserted. 

We now turn to Fermat’s theorem itself. 


1-2:3--04(2) = pip = Dp =2)---(p— kD 


If p is a prime, then a? — a is divisible by p for any integer a. 


Proof: When p = 2, the theorem can be checked easily. Here, we have 
a? —a=a* —a=a(a— 1), where the factors a and a — | are consecutive integers; 
thus, one of them is even and their product is divisible by 2. 

The argument showing that a? — a is divisible by any odd prime p involves 
induction on the integer a. If a = 0 or 1, the value of a? — a is zero, which is 
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divisible by p; so let us consider a > 1. Assuming that the result holds for a, we 
must conrm its validity for the case a + 1. In light of the binomial theorem, 


wren (er (ereot (yt jr 


Transposing gives 


Grae a1" rte je seal * le 
1 2 p=1 


Because p divides every binomial coef cient on the right-hand side of this equation, 
it must divide the entire right-hand side, which is simply to say that p divides 

(a + 1)? — a? — 1. Combining this observation with the induction assumption that p 
divides a? — a, one concludes that p divides the sum 


[(a + 1)? — a? —1]4+ (a’ —a)=(a4+ 1)? —(a+ 1). 


Thus, the desired conclusion holds for a + 1 and consequently for all a > 0. 
If a is a negative integer, there is no dif culty. For a = —b, where b > 0, so that 


a? — a = (—b)? —(—b) = —(b? —b). 


Because b is positive, we know already that p divides b? — b. 


| 


The main device by which Fermat proved dif cult theorems was his method of 
in nite descent. He referred to this technique often in announcing to the world theorems 
“which I have shown by the method of in nite descent.” Unfortunately, Fermat recorded 
most of his mathematics in his mind and not on paper, so he seldom explained how 
the descent was achieved. Perhaps his “proof” lay more in his faith in the applicability 
of the method than in actually having carried out the details in full. The method may 
be characterized in broad terms as follows. One assumes that there is a positive integer 
n having a certain property P. It may be possible to deduce from this assumption that 
there exists another positive integer n; <n that also has the property P. Then, by a 
reapplication of the process, the property P will be found to hold for another, still 
smaller positive integer m2 < n,—and so on inde nitely. But this is impossible, for there 
are only a nite number of positive integers less than the given integer n. Hence there 
can be no integer at all possessing property P. 

To clarify Fermat’s method of in nite descent, let us use this technique to prove that 
./2 is irrational. Suppose, on the contrary, that /2 = a/b, where a and b are positive 
integers. Now, the identity 


1 
oe a eae 
etl 


implies that 
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and so 


a b 2b-a ay 
2=-= -l= =— 
v2 b a-b a—b by’ 


with a, = 2b —a and b; =a — b. Because 1 < V2 < 2, or equivalently, 1 < a/b < 2, 
we can multiply through by 5 to get the inequality b < a < 2b. The implication is that 
0 <2b—a=a, while 2b < 2a gives 


a, = 2b-a <a. 


At rst glance this may seem protless, but much has been gained; starting with 
/2 = a/b, we have ended with /2 = a,/b,, where 0 < a, <a. If the entire process 
is then repeated, it is found that J/2 =a /b2, where ap is a positive integer less than a. 
Continuing in this way, we eventually obtain a sequence aj, a2, a3, ... such that 


a>a,>a>a3>-::->0. 


But this is impossible: positive integers cannot be decreased in magnitude inde nitely. It 
follows that our original assumption that 2 = a/b, where a and b are positive integers, 
is untenable and /2 must therefore be irrational. 


The Famous Last Theorem of Fermat 


We have seen that Fermat published very little personally, preferring to communicate 
his discoveries in letters to friends (usually with no more than the terse statement that he 
possessed a proof) or to keep them to himself in notes in the margin of his Arithmetica. 
By far the most famous is the one, written in Latin about 1637, which states: 


It is impossible to write a cube as a sum of two cubes, a fourth power as a sum of two fourth 
powers, and, in general, any power beyond the second as a sum of two similar powers. For 
this, I have discovered a truly wonderful proof, but the margin is too small to contain it. 


In this tantalizing aside, Fermat asserted that if n > 2, then the diophantine equation 
x” + y" = zl 


had no solution in the integers except for the trivial solutions in which z = x or z= y, 
with the remaining variable being zero. 

The quotation just cited has come to be known as “Fermat’s Last Theorem,” or more 
accurately, Fermat’s conjecture. All the results enunciated in the margin of his Arithmetica 
were found to be true, although the last of them, the famous last theorem, awaited proof 
until 1994. Fermat mentioned the cases of cubes and fourth powers repeatedly in his 
correspondence, which suggests that the success he had in proving these two instances 
led him to assume that the method of in nite descent would work for all exponents; but in 
no place is there any trace of this “truly wonderful proof” that he had purportedly found. 

Most experts now feel that Fermat simply believed that he had a proof, when he 
did not. For 350 years, the last theorem resisted all attempts at solution by many great 
mathematicians and an even larger number of amateurs. Then, in the late decades of 
the twentieth century, rapid and substantial progress was made. Some mathematicians 
tried to subdue the conjecture by brute force, using computers to verify the conjecture 
for ever larger exponents. By 1976, it was shown to be true for all exponents up to 
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125,000. A major step forward took place in 1983 when a young West German, Gerd 
Faltings, proved that the Fermat equation x” + y” = z” has only a nite—as opposed to 
in nite—number of integral solutions for any given n > 3. If it could be subsequently 
demonstrated that the nite number of solutions was actually zero in each case, Fermat’s 
claim would be settled once and for all. Another striking result, established in 1987, was 
that the last theorem is true for “almost all” values of the exponent n; that is, as n tends 
to in nity, the proportion of values for which the theorem is true approaches 100 percent. 
The rst indication that the problem might have been nally solved came in the summer 
of 1993, when Andrew Wiles of Princeton University sketched a proof during a series 
of three lectures in Cambridge, England. 

The overall structure and strategy of Wiles’s argument was so compelling that math- 
ematicians hailed it as almost certainly correct. But when the immensely complicated 
200-page manuscript was carefully scrutinized for hidden errors, it revealed a subtle 
snag. No one claimed that the aw was fatal, and bridging the gap was felt to be feasi- 
ble. Over a year later Wiles provided a corrected, re ned, and shorter (125 pages) version 
of his original proof to the enthusiastic reviewers. The revised argument was seen to be 
sound, and Fermat’s seemingly simple claim was __ nally settled. 

Resolving the problem of Fermat’s last theorem deprived it of a claim to attention 
but did not lessen its historical importance. The new techniques brought to bear on the 
claim and the new areas of mathematics spawned as a by-product may be more important 
to mathematics than the last theorem itself. In particular, Kummer was led to invent his 
ideal numbers out of which grew the theory of algebraic numbers. It will be of interest 
to trace this development. 

When 1 = pk, where p is a prime, the Fermat equation is the same as 


(x*)? +)? = 2"). 


If it could be shown that the equation uw? + v? = w? cannot have any integral solutions 
different from zero, then in particular there would be no solution of the form uv = x v= 
y*,w=z*, and hence x” + y” = z” would not be solvable. Thus it suf ces to prove 
Fermat’s last theorem for n = 4, and for n = p, where p is an odd prime. 

By 1839, the cases n = 3,4,5, and 7 had been eliminated. Fermat himself left a 
proof of the impossibility of satisfying the equation x* + y+ = z? in integers x, y, and z 
all different from zero; from this it follows immediately that the equation x* + y*+ = z* 
is equally impossible. Fermat’s proof is important because it illustrates his method of 
in nite descent, and it represents the only instance of a detailed proof left by him. Leon- 
hard Euler was the rst to settle the case for cubes, also using an argument by in nite 
descent. The proof, which appears in his Algebra (1770), rests on nothing more than a 
good understanding of properties of numbers of the form a* + 3b”, and there is no reason 
to doubt that Fermat had also found it. The next important advance came in 1825 when 
Lejeune Dirichlet read a paper at the Paris Academy in which he proved the theorem for 
the exponent n = 5. (The Revolutionary government had suppressed the Académie des 
Sciences in 1793, replacing it two years later with the Institut National.) His demonstra- 
tion was incomplete, as Adrien-Marie Legendre pointed out. By techniques developed 
in Dirichlet’s paper, Legendre provided an independent proof several weeks later. Each 
man acknowledged the contribution of the other, except that Legendre habitually referred 
to Dirichlet as M. Dieterich. Finally, in 1839, Gabriel Lamé was able to present a proof 
of the impossibility of solving the equation x’ + y’ = z’ in nonzero integers. 
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With the increasing complexity of the arguments came a realization that successful 
resolution of the general case called for different techniques. The best hope seemed to 
lie in extending the meaning of “integer” to include a wider class of numbers. Then it 
would be possible, by attacking the problem within this enlarged system, to get more 
information than by using ordinary integers alone. The German mathematician Ernst 
Eduard Kummer (1810-1893) made the breakthrough. 

At the March 1, 1847, meeting of the Paris Academy, Lamé, the author of the proof 
for the case n = 7, announced that he had discovered a proof of Fermat’s last theorem 
for an arbitrary exponent and credited his colleague J. Liouville with having suggested 
the basic method. After Lamé had nished, however, Liouville addressed the meeting to 
decline the compliment. He pointed out several gaps in Lamé’s proposed proof, the most 
serious of which was the tacit assumption that the usual rules of factorization of integers 
also would apply to “cyclotomic integers,” those complex numbers of the form 

ay tayr + agr? + +++ + ay_—yr""', 
where r is the complex nth root of unity (r” = 1) and ao, aj, ..., G,—1 are integers. Lamé 
admitted the shortcomings of his argument but believed (on the basis of extensive tables 
of factorizations) that the gap could be lled, afrming that the ordinary laws of integers 
held for cyclotomic integers when n = 5. So Lamé’s proof was recognized as invalid, 
and the focus of activity shifted to answering Liouville’s objection regarding the unique 
factorization of cyclotomic integers. 

If the Parisians had been better acquainted with the published notices of the Berlin 
Academy, they would have realized that the problem had been settled already. In May 
of 1847, Liouville received a letter from Kummer in which he asserted: 


Concerning the elementary proposition for these complex numbers, that a composite complex 
number may be decomposed into prime factors in only one way, which you so correctly cite 
as lacking in this proof—a proof defective in other ways as well—I can assure you that it 
does not hold in general for complex numbers of the form 


Qo air ar? feee a,1r"—!, 


but it is possible to rescue it, by introducing a new kind of complex number, which I have 
called an ideal complex number. 


Kummer enclosed a copy of the paper in which, three years earlier (1844), he had demon- 
strated the failure of unique factorization of cyclotomic integers; it had received little 
attention at the time, having been published in an obscure place (the University of Bres- 
lau, where Kummer was professor). His later attempts to nd a way of restoring unique 
factorization had led him to his entirely new theory of ideal complex numbers. Kummer 
indicated that the outlines of his research were contained in a paper, “On the Theory 
of Complex Numbers,” which had appeared in the Proceedings of the Berlin Academy 
in 1846; this work opened the way to the creation of the general theory of algebraic 
numbers. He went on to say, “I have considered already long ago the applications of this 
theory to the proof of Fermat’s theorem and I succeeded in deriving the impossibility of 
the equation x” + y” = z” from two properties of the prime number 2, so that it remains 
only to nd out whether these properties are shared by all prime numbers.” 

A few weeks later, Liouville published both Kummer’s letter and the little-known 
paper of 1844 in a privately established mathematical periodical called Journal de 
mathématiques pures et appliquées. Lamé allowed Liouville to print his ill-fated “proof” 
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in the same issue, somewhat revised but with the disproved assumption of unique fac- 
torization of cyclotomic integers glaringly evident. 

In the October 1847 issue of the Proceedings of the Berlin Academy, only a few 
months after Lamé’s attempted proof opened the subject, there appeared Kummer’s 
“Proof of Fermat’s Theorem on the Impossibility of x? + y? = z? for an Innite Num- 
ber of Primes p.” Kummer showed that Fermat’s last theorem held for a certain class of 
prime exponents, which are too elaborate to be de ned here, but which are now called 
regular primes. Moreover he reduced the problem of determining whether a given prime 
is regular to a simple procedure involving ordinary arithmetic, and calculated that 37 is 
the smallest irregular prime. He had not yet noticed that 59 is irregular or in fact that the 
only irregular primes below 100 are 37, 59, and 67. Finally, in 1857, Kummer published 
new criteria for Fermat’s last theorem to hold, conditions that would be satis ed when 
the prime exponent p is 37, 59, or 67, the values less than 100 for which he had not 
previously proved the theorem. 

In 1850, the Paris Academy offered a prize of a gold medal valued at 3000 francs 
for a complete solution of Fermat’s last theorem. When no proof was forthcoming, even 
on extension of the terminal date, the medal was awarded to Kummer as the author 
whose research most merited the prize, even though he had not submitted an entry in 
the competition. The use of the work infinite in the title of Kummer’s monumental 
paper of 1847 is still unjusti ed; it is suspected that there exist in nitely many regular 
primes, but this has never been established. In the other direction, K. L. Jensen proved 
in 1915 that there exist in nitely many irregular primes of the form 4n + 3, and in 1954 
L. Carlitz gave a simpler proof of the weaker result that the number of irregular primes is 
in nite. 

We might mention as a sidelight that in 1908 the mathematician P. Wolfskehl be- 
queathed 100,000 marks to the Academy of Science at Géttingen, to be paid for the 
rst complete proof of Fermat’s last theorem. The immediate result was a deluge of in- 
correct demonstrations by amateur mathematicians. Because only printed solutions were 
eligible, Fermat’s conjecture is reputed to be the mathematical problem for which the 
greatest number of false proofs have been published; indeed, between 1908 and 1912 
over 1000 alleged proofs appeared, mostly printed as private pamphlets. Suf ce it to say, 
interest declined as the German in ation of the 1920s wiped out the monetary value of 
the prize. (With the introduction of the Reichsmark and Deutsche Mark [DM] and after 
various currency revaluations, the award, worth about DM 75,000 or $40,000, was paid 
to Wiles.) 

The importance of Fermat’s work resides not so much in any contribution to the 
mathematics of his own day as in its animating effect on later generations. Fermat com- 
plained that his interest in the new number theory found no echo among contemporary 
mathematicians. This perhaps explains some of his reluctance to write up his vaunted 
proofs. Whatever the reason, his refusal to publish robbed him of recognition for many 
striking discoveries, and his almost exclusive interest in number theory during the last 
15 years of his life led to a growing isolation from the mainstream of research. He tried 
to persuade Pascal to collaborate on the subject, but Pascal was not a number theorist 
by temperament, and after a certain time became much more interested in religion than 
in mathematics. Fermat’s name slipped into relative obscurity and a century was to pass 
before a_rst-class mathematician, Leonhard Euler, either understood or appreciated the 
signi cance of what Fermat had been doing. 
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Because of the heritage of the scientic revolution of the 1600s, one might have 
expected a whole new burst of discoveries to follow. But the hour had passed. Eu- 
rope during the 1700s produced no gures of the epoch-making stature of Newton or 
Galileo. Instead there followed a long, slightly stunned period of assimilation during 
which a whole army of writers worked to make knowledge more widely available and 
intelligible. 

The phenomenon characteristic of all levels of scientic activity was expansion. 
There was growth in the size of the profession, in the public it addressed, in the quantity 
of its publications, in the scope of its activities, and in the range of applications. Growth 
in the amount of printed matter available made possible the diffusion of scientic in- 
formation with unheard-of speed and effectiveness. The production of books (some two 
million during the century), periodicals, and newspapers began at an unprecedented rate. 
Latin went out of use as the language of communication on weighty subjects. Thus, an 
intelligent person had available, in his or her own language, a wide range of sources 
from which might be extracted virtually unlimited amounts of information and even an 
occasional idea. At no other time in history have scientic principles and inventions held 
such a predominant share of public attention. 


The Eighteenth-Century Enlightenment 


The eighteenth century has been variously described as the Age of Enlightenment, 
the Age of Reason, and the Age of Inquiry. The term “Enlightenment” is applied to a 
great movement of liberal and often iconoclastic ideas that sought to improve the practi- 
cal conditions of human life through the power of reason. Although the same ideas were 
spread widely in Europe, they found their intellectual center in France in the works of 
Montesquieu, Diderot, Rousseau, and Voltaire. With few exceptions, these philosophes 
of the Enlightenment were not scientists themselves, nor were they philosophers, except 
in the sense that they made it their mission to re ne, elaborate, and transmit the scienti c 
ideas built up by the giants of the preceding century, especially Newton. The popular- 
ization of Newton’s ideas on the Continent was largely the work of Voltaire. He spent 
nearly three years (1726-1729) in England, where he attended Newton’s funeral, and 
from which he returned with a copy of the Principia in his bags. With the publication 
of Voltaire’s popular Elements de la philosophie de Newton in 1738, the whole world 
of English empiricism and its implications became widespread knowledge among the 
literate public of Europe. Popular interest even called forth a special Newtonianism for 
the Ladies in Italian. 

In the eyes of the eighteenth century, Newton’s great achievement lay in showing 
that there was order to the universe and that it was to be described in the abstract terms 
of mathematics. Whereas Newton’s laws of universal gravitation implied a certain math- 
ematical regularity and uniformity of nature in the physical world, the philosophes set 
themselves the task of nding similar general laws throughout the whole range of human 
experience. Exaltation of science and an almost boundless faith in the powers of “rea- 
son” and in unfettered intelligence led them to reevaluate matters that had formerly been 
excluded from rational scrutiny: social conditions, religious dogma and authority, govern- 
mental policies. (To the philosophes, reason meant the inductive method of Newton—the 
collection and classi cation of facts, and the testing of conclusions by observation and 
experiment.) The underlying assumption was that there was a right answer to every 
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question, that all problems could be answered with the same certain assurance as prob- 
lems in mathematics. 

The net effect of the Enlightenment was to produce a generation of people who were 
critical of existing institutions. This critical spirit was directed particularly at church and 
nobility, whose dogmatic claims to authority were alien to the new spirit of inquiry that 
demanded some sort of rational justi cation for the order of society. There arose an 
expectation, not so much of revolution but of a civilization of the future soon to be 
realized, more progressive and more equitable than the existing order. (The intellectual 
climate can, however, be numbered among the causes of the French Revolution of 1789 
as well as the American Revolution of 1776.) 

As science became fashionable at all levels of society, it grew in externals such as 
learned societies and periodicals. The beginnings of scientic journalism are generally 
placed at the year 1665 with the introduction of both the Philosophical Transactions and 
the Journal des savants, two publications that have persisted to this day. It has been 
estimated that in the period 1750-1789 close to 900 scientic periodicals were founded, 
many of them short-lived, as against a mere 35 before 1700. These early journals were 
predominantly vehicles for communicating scientic information—news, notes, and re- 
ports picked up from various sources—rather than repositories of scientic knowledge. 
They can hardly be said to have contained scienti c contributions in the form of original 
papers as we know them today. Many of the communications were letters, either sent 
directly to the editor or conveyed to him by the recipient of the letter, who was then 
acknowledged in the publication as often as the writer himself. Despite their numbers, 
the journals did not immediately supplant books as the preferred mode of scienti c pub- 
lication; throughout the century the publication of short articles in the form of pamphlets 
continued. 

Although the periodical literature of the 1600s consisted almost wholly of the journals 
of learned societies, a more popular type of periodical began to develop in the 1700s. The 
English took the lead with the publication of The Tatler (1709), The Guardian (1710), 
and The Spectator (1711). Essentially literary journals, they often printed articles of 
scienti c and technical interest, culled from the publications of various scienti c societies. 
The success of The Spectator (20,000 copies of some issues are said to have been 
sold) prompted the publication of a similar periodical in Paris, the Spectateur Frangais 
(1722). The end of the century saw the appearance of a superior kind of periodical, 
the Philosophical Magazine (1798), with its stated objective “to diffuse philosophical 
[scientic] knowledge among every class of society, and to give the public as early 
account as possible of everything new or curious in the scientic world.” 

Antecedents of the modern newspaper, brief, intermittent, and usually short-lived 
news sheets, had been published from quite early in the seventeenth century. Britain 
pioneered the rst daily newspaper, the Daily Courant, in 1702. Three more were being 
published in London by 1724, and toward the end of the century some 42 papers were 
appearing. France had no daily paper until 1777, when the Journal de Paris was founded. 
Whereas the scholarly formed their academies, more ordinary people began to establish 
reading clubs. A group of people would rent a room or two, subscribe collectively to 
various periodicals and newspapers, and meet from time to time to discuss the contents 
of these publications. 

Compendia of knowledge were known in European antiquity and medieval times, 
but most suffered from being the works of individual authors. The notion that everything 
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Jean D’Alembert 
(1717-1783) 


(From A Concise History of Mathematics by Dirk 
Struik, 1967, Dover Publications, Inc., N.Y.) 


could still be “discussed and analyzed, or at least mentioned” led a team of French schol- 
ars of the eighteenth century to try to gather all existing knowledge into an orderly whole, 
alphabetically arranged and cross-referenced. The monumental work that bore the title 
Encyclopédie ou dictionnaire raisonnée des sciences, des arts, et des métiers began to ap- 
pear in 1751 with the publication of the rst of its 17 volumes of articles. The editors were 
Denis Diderot (1713-1784) and Jean d’Alembert (1717-1783), of whom the rst wrote 
articles on particular arts and trades, and d’Alembert oversaw the mathematical subjects. 
Almost everybody who was anybody in France at this period took a hand in the gigantic 
publication, and distinguished contributors often wrote on unexpected subjects; Voltaire 
was the author of a piece on hemstitching, and Rousseau supplied the section on music. 

More than a repository of knowledge, the Encyclopédie was the manifesto of the 
Enlightenment, a brilliant piece of propaganda in support of skepticism and exalting 
the scientic method as the supreme tool of reasoning. As such, it was subject to strict 
censorship and periodic suppression by the authorities, which helped to delay and maim 
the enterprise. After the rst two volumes had appeared, any further sale or publication 
of the work was prohibited by decree of King’s Council on the grounds that it tended to 
“lay a foundation for error, corruption, irreligion, and incredulity.” (After the historical 
and religious articles had left Diderot’s desk, they were surreptitiously censored by the 
pious printer.) The 11 volumes of engraved illustrations, making a total of 28 in all, were 
nally completed in 1772, one year after the Encyclopedia Britannica made its debut 
with a mere 3 volumes. The effect of the 1000 or so engravings was to exhibit the state 
of manufacturing at that time by giving something like an anatomy of machines—overall 
views, cutaway drawings, and representations of individual parts, pieces, and tools— 
and industrial processes. There were, for instance, 55 plates on the textile industry, 11 
on mining, and 3 on the manufacture of pins. The idea of presenting uniform industrial 
methods that any producer could adopt was neither easily achieved nor entirely welcomed. 
Many artisans, determined to guard their techniques in secret, resisted inquiries by Diderot 
and his agents or deliberately provided inaccurate descriptions of their procedures. For all 
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the errors in fact, careless repetitions, and agrant omissions, however, the Encyclopédie 
was the greatest achievement of its kind and the most in uential work published in the 
eighteenth century. 

The Enlightenment was certainly not a great period in the history of university ed- 
ucation. Universities throughout Europe had with but few exceptions fallen from their 
high estate as centers of intellectual life. Political and ecclesiastical despotism had de- 
stroyed their freedom; Catholic governments required professors to take oaths of loyalty 
to the Church and its dogmas. The classical curriculum, to which established schools 
were limited by statute or tradition, had lost the power of inspiration that it had had in 
the days when people really spoke and wrote in Latin, and was hopelessly behind the 
times. Except for a few astronomical instruments, and terrestrial and celestial globes, 
most schools lacked the laboratory equipment for the study of science. Contempt for 
classical studies was not the rule in England, however, where the only universities, Ox- 
ford and Cambridge, enjoyed a certain prestige among the well-to-do classes. Because 
a vigorous parliamentary life put a premium on oratory, these universities continued to 
provide young men with brilliant instruction in ancient languages and literature. 

Here and there on the Continent, there were universities that were genuine centers 
of research and at which rst-rate teaching, at least in certain subjects, was to be found. 
In Germany, the most famous university of the 1700s was Leipzig (where Leibniz had 
been a student), and Leiden had replaced Padua as the leading European medical school. 
These institutions turned from Latin to the German language as the medium of com- 
munication in class and textbooks. The opening of a modern university in Géttingen 
(1737), with its emphasis on physics and astronomy, was a landmark in the history of 
German science. Within a generation, its library had grown to include 60,000 books and 
100,000 pamphlets, making it the largest university library of the century. The absence 
of scienti cally equipped French universities was partially remedied by the appearance 
of technical colleges and advanced military academies, such as the Ecole Militaire in 
Paris, during the second half of the 1700s. 

On the whole, the Enlightenment in France went forward apart from that country’s 
universities and was even impeded in some ways by their rigidity and conservatism. 
Rousseau, obviously alluding to France, wrote in Emile (1762) of “those laughable places 
called Colleges.” The wit and intellect of eighteenth-century France were to be found 
in her salons, country houses, and academies. A “salon” was simply the drawing room 
of a private house, with the company who habitually met there for social and cultural 
discussions. These “little courts” provided forums where nobles and men of letters were 
in some sense socially equal, and indeed where women conversed on the same footing 
as men. Salons differed completely from the clubs, composed exclusively of men, which 
were so prominent in English urban life. Great stress was laid on propriety of demeanor 
and diction, so that the activities of the salon have been somewhat uncharitably described 
as “a method of wasting time in perfect French.” 

The best French society showed as keen an interest in the sciences as in arts and 
letters. (John Locke, the great English philosopher, had said as early as 1693 that a 
gentleman must “look into” natural philosophy “tot himself for conversation.”’) It was 
fashionable to attend the courses of public lectures on elementary science that were 
becoming increasingly common and to perform scientic experiments of the most su- 
per cial kind. Although these drawing-room experiments in physics and electricity were 
more a social accomplishment than a serious exploration of nature, they at least indicated 
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a widespread realization that the world was governed by laws that could be discerned by 
intelligent observation and careful measurement. 

Academies and societies of learned men had existed from the time of the Renais- 
sance. There were several reasons for scholars, or just interested amateurs of scholarship, 
to band together. The association conferred distinction and reputation on them, the publi- 
cations of the organization presented an opportunity for placing the results of their studies 
before the learned world, and they could exercise their intellect and wit safely without 
the intervention of government censors. By the mid-1700s, each French provincial cap- 
ital, like most of the larger cities of western Europe, had its scientic, literary, artistic, 
or musical academy. Because these were often short-lived, unaf liated, voluntary asso- 
ciations, only a few are still known, although they once were abundant. The members 
met regularly, read and discussed their dissertations, and encouraged merit by the offer 
of prizes. The Academy of Dijon, in 1749, made Rousseau famous by awarding him a 
prize for an essay in answer to the question: “Has the restoration of the arts and the 
sciences had a purifying effect on morals?” His Discourse on the Arts and Sciences was 
a tirade against all institutions of civilized life, and to that extent was in accord with the 
spirit of the philosophes. But he broke with them by attacking the cherished view that 
public enlightenment through the widespread use of human reason was the most fruitful 
of activities. According to Rousseau, the source of all social evils was to be found in the 
restless curiosity of which the arts and sciences were the nal product. The eloquence 
and philosophy of the Discourse took the French intellectual community by storm and 
established Rousseau’s European reputation. 

Whereas the 1600s had been an age of great amateurs, the success of scienti c 
enterprise in the 1700s was continually providing more scientists with a livelihood, 
either as university professors or members of government-af liated academies. Many of 
the reigning monarchs delighted in regarding themselves as patrons of learning, and the 
academies were the intellectual crown jewels of the royal courts. The motives of these 
rulers may not have been entirely philanthropic, but the fact remains that these bodies 
were in time to become important vehicles for raising the level of scientic learning. They 
provided salaries for distinguished scholars, published journals of research papers on a 
regular basis, and organized scientic expeditions. Beginning in 1721, the Académie des 
Sciences opened prize competitions in which monetary awards went to those offering 
the best solutions to problems the Académie posed. Academies of science sprang up 
in Germany (1700), Sweden (1710), Russia (1725), and Denmark (1742), but few of 
these institutions had an immediate effect on Europe’s stock of scientic ideas. (The 
oldest American academy for the advancement of science is the American Philosophical 
Society Held at Philadelphia for Promoting Useful Knowledge, proposed by Benjamin 
Franklin in 1743 and_ nally organized in 1769.) The Royal Academy in Berlin, founded 
at the instigation of Leibniz, rapidly gained distinction through the generous patronage 
of Frederick the Great of Prussia. With its membership not limited to Germans, it had 
more the appearance of an association of expatriate philosophes than of an educational 
establishment. 


Maclaurin’s Treatise on Fluxions 


Certainly there were no accomplishments in the eighteenth century comparable with 
the formulation of calculus in the seventeenth. Instead, the philosophical question of 
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the logical structure of calculus seemed to absorb the energies of the eighteenth-century 
mathematicians. As the subject left the hands of Newton and Leibniz, it rested on no 
solid foundation; few mathematicians, content with expanding its scope and effective- 
ness, paused to examine the basic contradictions in the theory. Such a state of affairs 
could not continue long unchallenged. A reaction set in against the total lack of clarity 
and agreement about the nature of the limit concept. Attacks came from all quarters; 
Voltaire called the calculus “the art of numbering and measuring exactly a thing whose 
existence cannot be conceived.” The shrewdest attack of all, however, was delivered by 
a nonmathematician, George Berkeley (1685-1753), the bishop of the Irish diocese of 
Cloyne. 

The most signi cant mathematical event in eighteenth-century England was the pub- 
lication (1734) of a tract entitled The Analyst: or a Discourse Addressed to an Infidel 
Mathematician, in which Berkeley tried to show that the mysteries of calculus—although 
they led to true results—were no more securely grounded than the mysteries of religion. 
The “in del mathematician” so addressed was supposed to be Newton’s friend, the as- 
tronomer Edmund Halley. It appears that an acquaintance, when on his deathbed, refused 
Berkeley’s spiritual consolation, because Halley had proved to the poor man the unten- 
able nature of the doctrines of Christianity. This induced Berkeley to attack “the modern 
analysis” at its weakest spot: the foundations of the in nitesimal calculus. He taunted the 
proponents of calculus with “submitting to authority, taking things on trust, and believing 
points inconceivable”—precisely the charges against the followers of religious tenets. At 
one point in The Analyst, he derided the idea of instantaneous rates of change, using an 
expression that has become almost classic: 


And what are these uxions? The velocities of evanescent increments. And what are these 
same evanescent increments? They are neither nite quantities, nor quantities in nite ly small, 
nor yet nothing. May we not call them ghosts of departed quantities? 


When denying the utility of the new devices or the validity of the results, Berkeley 
developed an ingenious theory of compensating for errors that was meant to explain the 
correctness of the results of calculus; thus, “by virtue of a twofold mistake you arrive, 
though not at science, yet at truth.” 

It is dif cult to understand why Berkeley thought that he could restore regard for 
religion by showing that mathematicians were as apt to rely on faith as theologians were. 
Be that as it may, Berkeley’s criticism was well grounded and bore good fruit. Attention 
was thereby focused on some of the great weaknesses of the calculus, particularly the 
obvious need for a logical clari cation. Almost every important mathematician of the 
period—Abraham De Moivre and Brook Taylor in England, Colin Maclaurin in Scotland, 
Joseph-Louis Lagrange in France, and the Bernoullis in Switzerland—made some effort 
to give the subject all the rigor of the demonstrations of the ancients. By far the ablest 
and most famous attempt to answer The Analyst was Maclaurin’s Treatise on Fluxions, 
which appeared in 1742. Maclaurin took Greek demonstrative rigor as a model, and with 
a verbosity of which the ancients were guilty (the work consisted of 763 pages) tried 
to provide a geometric framework for Newton’s doctrine of “ultimate ratios.” The effort 
was commendable, much praised, and much neglected; 59 years elapsed before a second 
edition appeared. 

The Treatise on Fluxions is noteworthy for its investigation of in nite series, includ- 
ing the derivation of a well-known expansion for a function of x in terms of its powers; 
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in today’s symbolism, 


(0)x (0)x2 "(0)x3 
fx) = f+ LO% , FOX, LOW 
1! 2! 3! 
To obtain what is now known as the “Maclaurin series of f(x), Maclaurin proceeds as 
follows: let f(x) be represented by a series of the form 


f= A+ Be 4 Cx? + Dx? + -, 


where A, B,C, D,... are constants whose values are to be determined. The foregoing 
expression is differentiated successively to give 


f'@)y= B+42Cx + 3Dx* +.+. 
fy SIC HOD See: 
f"\@)=3D-2+--- 


Substituting x = 0 in each equation yields the desired constants: 


A= f(0), B= f'(0),C = f"(0)/2, D= f"(0)/B + 2),.-. 


By current standards, this argument wouldn’t be called a proof. Maclaurin simply assumes 
that his power series expansion is always possible [without explicitly requiring that f(x) 
have derivatives of all orders at x = 0]. He also takes for granted that a function’s series 
can be differentiated term by term, just as if it were a polynomial, and that the result of 
differentiating a function’s series actually represents the derivative of the function. 

The gifted Scot seems to have been something of a mathematical prodigy. He entered 
the University of Glasgow when he was 11 years old and graduated with a master’s degree 
at the age of 15. On the basis of a competitive examination, Maclaurin was appointed 
to a chair at Aberdeen, although still only 19. He moved to the University of Edinburgh 
in 1725 at Newton’s recommendation, initially to assist and later to succeed the aging 
incumbent as professor of mathematics; Newton privately offered to provide part of the 
salary so as to enable the university to secure the promising young scholar. Maclaurin 
was twice awarded prizes by the Académie des Sciences, on the rst occasion (1724) 
for a memoir on the collision of bodies, and subsequently (1740) for his essay on the 
mathematical theory of tides. When the army of the Young Pretender, Charles Stuart, laid 
siege to Edinburgh in the uprising of 1745, Maclaurin planned and supervised the hastily 
erected forti cations. He escaped to England when the city was overrun, but exposure 
in the trenches and the rigors of ight had taken their toll. He died shortly thereafter at 
the age of 48. 

With the early deaths of two of Newton’s most ardent advocates, Cotes and Maclau- 
rin, the English mathematical school remained relatively undistinguished for the rest of 
the eighteenth century. It is reasonable to speculate that if Cotes and Maclaurin had 
lived longer the power of Continental mathematics might have been recognized sooner 
in England, so that the hiatus in the development of British mathematics might at least 
have been shortened. 

It should be said that Maclaurin’s power series is merely a particular instance of 
one obtained earlier by Brook Taylor (1685-1731) using a different approach; Maclaurin 
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acknowledges this fully in the Treatise on Fluxions. The oddity is that the general result 
and one of its special cases have different people’s names attached to them. 

Educated at Cambridge, Taylor was elected to the Royal Society in 1714 and became 
its secretary two years later. He was appointed to the committee set up by the society 
to settle the rival claims of priority concerning the discovery of the calculus. When 
only 34, Taylor’s ill health forced him to resign the post of secretary and abandon 
his mathematical activity. Taylor’s principal work, Methodus Incrementorum Directa et 
Inversa (1715), includes the “Taylor series” expansion for f(x): except for notation, it 
says that 


/ 11 2 dA 3 
foe tn= sot ren BE irks 


from which Maclaurin’s series is realized by setting x = 0. Taylor announced his dis- 
covery at least three years before it appeared in print. In the Newtonian tradition, his 
argument involved uxions and evanescent quantities, along with the cumbersome sym- 
bolism in which dots appeared above and below variables. His presentation was no more 
free of blemish than was Maclaurin’s, for little attention was paid to considerations of 
convergence. 

The in uence of Fluxions on the progress of British mathematics in the 1700s was on 
the whole unfortunate. Maclaurin’s skillful use of geometry persuaded Newton’s country- 
men to adhere rigidly to classical geometric methods at the moment when “analysis” (the 
study of in nite processes), not geometry, was on the rise in Continental Europe. Thus, 
England dropped behind. With little regard for rigor, Continental mathematicians pushed 
on with the purely formal development of several new branches of analysis: differential 
equations, calculus of variations, differential geometry, and the theory of functions of a 
complex variable. The center of activity shifted repeatedly across Europe, with no nation 
remaining the leader for any prolonged period. 


Euler’s Life and Contributions 


The key gure in eighteenth-century mathematics was Leonhard Euler (1707-1783), 
and the scene of his activity was chie y Germany and Russia. Euler was the son of 
a Lutheran pastor who lived near Basel, Switzerland. His father earnestly wished him 
to enter the ministry and sent his son, at the age of 13, to the University of Basel to 
study theology. There the young Euler met John Bernoulli, then one of Europe’s leading 
mathematicians, and befriended Bernoulli’s two sons, Nicolaus and Daniel. Within a 
short time, Euler broke off the theological studies that had been chosen for him to apply 
himself exclusively to mathematics. He received his master’s degree in 1723; in 1727, 
when he was only 19, he won a prize from the Académie des Sciences for a treatise on 
the most ef cient arrangement of ship masts. 

For mathematicians beginning their careers in Switzerland in the early 1700s, there 
were few professorships and thus little chance of nding employment. When it was 
learned that the newly organized (1724) Academy of St. Petersburg was looking for 
personnel, many leading mathematicians from all over Europe, including Nicolaus and 
Daniel Bernoulli, went to Russia. On their recommendation, an appointment was also 
secured for Euler. Because of his youth (he was not yet 20) he had recently been denied 
a professorship in physics at the University of Basel and was only too ready to accept 
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Leonhard Euler 
(1707-1783) 


(Smithsonian Institution.) 


the invitation of the academy. On the very day in 1727 that Euler set foot on Russian 
soil, the liberal Empress Catherine I died, an event that nearly led to the dissolution of 
the academy. Euler gave up all hope of an academic career, became a ship’s of cer, and 
almost accepted a lieutenancy in the Russian navy. When another change of ruler brought 
a more favorable regime, conditions for scienti c work improved, and Euler joined (1730) 
the academy’s staff as professor of physics. Three years later when Daniel Bernoulli 
returned to Basel, Euler became his friend’s successor as chief mathematician of the 
academy. Euler surprised the Russian mathematicians by computing in three days some 
astronomical tables whose construction was expected to take several months. Perhaps 
weakened by the exertion, however, he came down with a feverish illness that ultimately 
resulted in the loss of sight in one eye. In St. Petersburg, Euler met the versatile scholar 
Christian Goldbach, a man who subsequently rose from professor of mathematics to 
Russian minister of foreign affairs. With his interests, it seems likely that Goldbach was 
the one who rst drew Euler’s attention to the work of Fermat on the theory of numbers. 

Euler eventually sickened of the political repression then prevalent in Russia and, in 
1741, accepted the offer of Frederick the Great to direct the mathematical division of the 
Berlin Academy, which was then rising in fame. The queen mother received him kindly 
at court, and asked why so distinguished a scholar should be so reticent and timid: he 
replied, “Madame, it is because I have just come from a country where, when one speaks, 
one is hanged.” Despite a attering letter of welcome, relations between Frederick and 
Euler soon spoiled. The youthful monarch felt it his duty to encourage mathematics, yet 
had little patience with a subject that seemed impractical to him. At court he preferred the 
company of polished philosophers such as Voltaire to the unsophisticated Euler, whom he 
cruelly called a “mathematical Cyclops.” Euler’s 25 years in Berlin were not altogether 
happy ones. 

From his rst years in Berlin, Euler kept in regular contact with the St. Peters- 
burg Academy, so that he was in effect working simultaneously in two academies. His 
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reputation among the Russians was such that they continued his salary for a long time 
after his departure; and when, during the Seven Years’ War, Russian troops plundered 
Euler’s estate, the commanding general compensated him handsomely for the loss, and 
the empress sent an additional sum of 4000 crowns. Heartened by the warmth of Russian 
feeling toward him, and at last unendurably offended by the contrasting coolness of Fred- 
erick, Euler returned to St. Petersburg in 1766 at the request of Catherine II, Catherine 
the Great. 

This second Catherine was a keen patron of the intellectuals of the Enlightenment. In 
fact, it had been her nancial support that helped to keep the Encyclopédie going during 
Diderot’s darkest days. Diderot, short of funds, was about to sell his personal library to 
provide a dowry for his daughter when Catherine heard of his dif culties. She bought all 
Diderot’s books, and not only allowed him to retain possession of them but also paid him 
an annual salary to act as librarian-custodian. Catherine tried unsuccessfully to persuade 
Diderot to transfer the publication of the Encyclopédie to Russia. Although Diderot 
refused to move to St. Petersburg permanently, he spent several months at Catherine’s 
court in 1773, impudently arguing with her. Wearied by his insistent demands for reforms, 
she nally admonished him: “You only work on paper, while I, poor Empress, work on 
human skin, which is more ticklish.” 

There is a famous anecdote concerning Euler and Diderot that has circulated for so 
many years that historians tend to give it some credence. According to the tale told by 
Augustus De Morgan in his Budget of Paradoxes (1872), Catherine the Great persuaded 
Euler to join her in suppressing the atheistic views that the freethinking Diderot had been 
advancing among the younger members of her court. Diderot was informed that a learned 
mathematician possessed an algebraic proof of the existence of God and would give it to 
him before all the court, if he desired. When Diderot eagerly consented, Euler advanced 
toward him and said, gravely and in a tone of perfect conviction, “Sir, (a + b")/n = x; 
hence God exists. Can you answer that?” It sounded like sense to the French philosopher, 
to whom, the story goes, all mathematics was inscrutable. Humiliated by the peals of 
laughter that greeted his embarrassed silence, the poor man asked Catherine’s permission 
to return to France at once, permission that was graciously granted. 

Although Diderot’s alleged atheistic conversations with courtiers may possibly have 
become annoying to Catherine, the story runs counter to what we know of the charac- 
ters of both men. There is no reason to think that Diderot, the author of ve credible 
memoirs on mathematics, would have been nonplused by Euler’s supposed formula. If 
the incident has any basis in fact, it is more likely that Diderot saw through the practical 
joke immediately, but was annoyed at the prospect of similar silly performances in the 
future and therefore went back to his country—and that not even “at once.” 

Shortly after Euler moved to St. Petersburg, he suffered the loss of sight in his 
good eye because of a cataract. He would not, however, permit blindness to diminish 
the rate of output of his scientic work. Euler’s comment on his calamity was, “I'll 
have fewer distractions.” His memory was so phenomenal, and his speed of calculation 
so great, that he continued to produce almost as actively as before. Euler’s faculty with 
gures was so prodigious that he could carry out in his head numerical computations that 
competent mathematicians had dif culty doing on paper. It is recorded that when two of 
his pupils, working the sum of a series to 17 terms, disagreed in their results at the ftieth 
signicant gure, Euler arrived at the correct calculation in his own mind. For the next 
17 years, Euler found sel ess and devoted helpers in his sons and servants, to whom he 


530 


Chapter 10 The Revival of Number Theory: Fermat, Euler, and Gauss 


dictated a vast store of mathematical papers. One of the rst works dictated after the onset 
of his blindness, dictated to a valet who knew nothing of mathematics beyond simple 
computation, was his famous Vollstdndige Anleitung zur Algebra (Complete Instruction in 
Algebra; 1770). This book was subsequently published in many editions—English, Dutch, 
French, Italian, and Russian—and gave to algebra the form that it retains to our time. 
Euler continued his labors unabated until in 1783, while playing with his grandchildren 
and sipping tea, he suddenly suffered a brain hemorrhage. In the oft-quoted words of 
Condorcet, “He ceased to calculate and to live.” 

Without a doubt, Euler was the most versatile and prolic writer in the entire history 
of mathematics. Fifty pages were nally required in his eulogy merely to list the titles 
of his published works. He wrote or dictated over 700 books and papers in his lifetime, 
and left so much unpublished material that the St. Petersburg Academy did not nish 
printing all his manuscripts until 47 years after his death. The publication of Euler’s 
collected works was begun by the Swiss Society of Natural Sciences in 1911, and it is 
estimated that more than 75 large volumes will ultimately be required for the completion 
of this monumental project. The best testament to the quality of these papers may be 
that on 12 occasions they won the coveted biennial prize of the Academie des Sciences. 
The rst of these prizes, which Euler received at the age of 31, was for an essay on 
the theory of heat, and another involved a study of the tides, a subject that especially 
attracted scientists at that time. Euler also gained a share of the prize of £20,000 offered 
by the British parliament for a method of calculating longitude at sea to an accuracy of 
half a degree. 

During his stay in Berlin, Euler fell into the habit of writing memoir after memoir, 
placing each when nished at the top of a pile of manuscripts. Whenever material was 
needed to Il the academy’s journal, the printers would help themselves to a few papers 
from the top of the stack. As the height of the pile increased more rapidly than the demand 
made on it, memoirs at the bottom would tend to remain in place a long time. This 
explains how it happened that when various papers of Euler were published, extension 
and improvements of the material contained in them had previously appeared in print 
under his name. It should be added that the manner in which Euler made his work public 
contrasted sharply with the secrecy customary in Fermat’s time. 

Besides the Algebra, another great and rich textbook was Euler’s /ntroductio in 
Analysin Infinitorum (1748), a two-volume treatise covering a wide variety of subjects. 
The rst of its two volumes was devoted to functions in general, and to logarithmic, 
exponential, and trigonometric functions in particular; it also dealt with power series, 
continued fractions, and various problems in number theory. The second volume con- 
tained analytic geometry of the plane and space, including a discussion of algebraic and 
transcendental curves. The /ntroductio did for analysis what Euclid’s Elements had done 
for geometry and al-Khowarizm”’s Hisdb al-jabr w’al muqdbalah for algebra. It was a 
classic text from which whole generations were inspired to learn their analysis, especially 
their knowledge of in nite series. 

In a sense, the /ntroductio created order in the still uncertain world of mathematical 
notation, for after its publication, Euler’s conventions were used generally. He gave his 
approval for using the Greek letter z for the ratio between the circumference and diameter 
of a circle (7 appeared in Oughtred’s Clavis Mathematicae of 1647 to denote simply 
the circumference), and for e to represent the base of natural logarithms. (Euler had rst 
used the symbol e in a letter to Goldbach in 1731.) One also meets here the modern 
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abbreviations for the trigonometric functions, which he regarded as ratios, not lengths, 
and hence as dimensionless quantities. And there is also the famous “Euler identity” 
linking trigonometric and exponential functions: 


ev = cosx +/—I1sinx. 


This formula was not entirely new. Roger Cotes (1682-1716) in the Philosophical Trans- 
actions of London, published in 1714, showed that 


V—I1x = log(cosx + /—1sinx), 


although not in this particular notation. (Cotes, who helped Newton prepare the second 
edition of the Principia, was the most mathematically gifted of all Newton’s adherents. 
His death led Newton to lament, “If Cotes had lived, we might have learned something.”’) 
Near the end of Euler’s life, in a memoir presented in 1777 to the Academy at St. 
Petersburg, he introduced the symbol i for /—1. 

The Introductio was followed by Jnstitutiones Calculi Differentialis (1755), and 
the three-volume Jnstitutiones Calculi Integralis (1768-1770). These formed the most 
complete and systematic exposition of the calculus, including a theory of differential 
equations, to appear up to that time. The works that formed Euler’s trilogy on analysis 
were exciting to read and skillfully presented, containing many new results of great value. 
They remained standard textbooks until the end of the century. Generations of young 
mathematicians followed Laplace’s (1749-1827) advice: “Read Euler, he is our master 
in all.” 

Like most major mathematicians of the eighteenth century, Euler made signi cant 
contributions to the subject of in nite series. The /ntroductio in Analysin Infinitorum, for 
instance, contains a derivation of the now-familiar series expansion of e. As outlined 
in the next paragraph, Euler’s argument is typical of the age in its reliance on formal 
manipulation and careless handling of limits and convergence. 

Given a > 1, Euler rst writes a” = 1+ kw, where w is taken to be an in nitely 
small number (“so small that it is just not equal to zero”) and k is a constant depending 
only on a. For any real number x, he puts 7 = x/w, then 


a* =a” =(1+kwy = (1+ kx /y, 


which can be expanded by Newton’s binomial theorem into 


eo FLY IG HD (RY? JG -DU -2) (kx)? 
- =145()+4 (4) re (8) a 


Because w is innitely small, 7 will be innitely large; this allows Euler, in a loose 
notion of passing to the limit, to assume that 


Thus, he decides that 


— kx (kx (kx 
Bear cps ae ager 


’ 
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so that, when x = 1, 


1+— : + = ; at i + 
a= rT rT ; 
Because the number e is the value of a when k = 1, 
1 1 1 1 
e=!l1+— 7 + it 3y 31 tees 


(In effect, Euler has identi ed & with lim a=! ‘the latter limit being equal to log, a.) The 


w ? 
series obtained is well-suited for computing e, and Euler used it to obtain a numerical 
value to 23 decimal places: 


é = 2.71828182845904523536028.... 


Unburdened by modern standards of proof, Euler treated an in nite series as a long 
polynomial to which nite procedures could be safely extended. He placed great reliance 
on the power of mathematical symbols, unhesitatingly assuming that correct results would 
follow from a formally coherent argument. Even though the modern reader might quarrel 
with the shakily based details of Euler’s work with series, many of his striking ndings 
were subsequently given a strictly rigorous sense. His remarkable mathematical intuition 
usually kept him to the right path. 

Euler’s mastery of formal manipulation is seen again in the /ntroductio when he 
derives the in nite series expansions of the sine and cosine functions. (These series were 
rst obtained by Newton around 1670, using his newly developed calculus.) Euler begins 
with the famous relation 


(cosz +i sinz)" = cosnz +i sinnz, 
discovered by Abraham De Moivre in 1730. The two choices of signs allow him to solve 
for cosnz as 
(cosz +7 sinz)" + (cosz — i sinz)” 
5 : 


When the right-hand side is expanded according to the binomial theorem, it follows that 


cosnz = (cosz)” — cos z)"-?(sinz)? 


ge — 1)\(n — 2)(n — 3) 
1-2-3-4 
Euler now lets z be an in nitely small number and n be an in nitely large one. This 
permits him to assume that cosz = 1, sinz = z, and 


COSNZ = 


(cos z)"~4(sinz)* — --- 


n(n —l)=n’, n(n — 1)(n — 2)(n — 3) =n"— 


Substituting and simplifying, the series for cosnz becomes 


cosnz = 1— 


2! 4) 
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In light of the sizes of z and n, Euler concludes that nz will be a_ nite quantity, say 
nz = x. This transforms the series to the more familiar form 


Reasoning along similar lines, he presents the in nite series for sinx as 


a oe a 


eae TT 

Another achievement of Euler in this area deals with the summation of the reciprocals 
of the squares of the positive integers; that is, with the in nite series 1 + 1/2? + 1/37 + 
1/4° +--+. This series had provoked considerable discussion in England and France. 
Oldenburg, in a letter of 1673, had asked Leibniz for its sum and received no reply. In 
1689, James Bernoulli showed that the series had a_ nite sum, but as to nding its actual 
value he “confessed that all his zeal had been mocked.” Writing to Christian Goldbach 
in 1728, Daniel Bernoulli gave 8/5 as an approximate sum, whereas James Stirling later 
(1730) computed the sum to 16 decimal places, namely 1.6449340668482264 (the series 
converges slowly so that many terms have to be taken into account to achieve an accuracy 
of a few decimal digits). Neither mathematician appears to have recognized that their 
sums were about 27/6, which Euler determined in 1735 to be the correct sum. 

One approach taken by Euler, subsequently described in the /ntroductio, was to apply 
to in nite series the relations between roots and coef cients of polynomials. The starting 
point was a trigonometric series known to Newton: 


ee oe ee 
Se SE ae a ag 
He treated the right side as a polynomial of in nite degree, arguing that because sinx = 0 
has roots 0, £77, +27, +37,..., so will 
a ne 
aaa ake oe 


Dividing this series by x to eliminate 0 as a root, and replacing x* by y, gave him a new 
polynomial equation 


2 3 
=j-24%_*% 
Cela ta at 
with roots 27, (27), (37), .... Now it was known from the theory of equations that, 


for a polynomial with constant term equal to 1, the sum of the reciprocals of its roots is 
the negative of the coef cient of the linear term. Proceeding by analogy with the nite 
case, Euler therefore concluded that 


1 1 1 1 


31 x2 Gay Gay 


fees, 


or 
© a ed et 
6 12 22 32 , 


He was led by this reasoning to sum the series that had defeated his predecessors. 
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By paying insuf cient heed to questions of convergence, Euler was sometimes led 
into absurdities. There is the occasion when he used the binomial theorem to expand 
aes ~2 
(ax. — ad + Xx ) as 
1 


(ge 


and then set x = —1 to obtain 
co =14+24+34+4+.---. 


Since he regarded in nity as a number (the reciprocal of 0), this sum caused no wonder- 
ment. From the series — =1+x+x*?+2x3+4+.---, with x replaced by 2, Euler found 
the strange equation 


Mate 2tO4 84 cn, 


where positive integers are added to produce a negative value. Observing that the terms 
of this series exceed the corresponding ones of the earlier series, he argued that —1 is 
larger than in nity. In a further ight of fancy, Euler contended that in nity separates 
positive and negative numbers, much as 0 does. 

In 1736, Euler published what is believed to be the rst paper on graph theory. 
It settled a famous problem of his day, called the K6nigsberg Bridges Problem. The 
puzzle concerns the bridges spanning the river Pregal at the German university town 
of K6nigsberg (since renamed Kalingrad) in East Prussia. In the river were two islands, 
connected with each other and with the mainland by seven bridges. The two islands were 
joined by a bridge; one of the islands was linked to each river bank by a single bridge, 
while the other had two bridges crossing to each bank. The citizens of K6nigsberg wanted 
to know if a person, starting from home, could pass over every one of the seven bridges 
exactly once before returning. 

Euler gave an elegant proof that such a route was not possible—a fact that many 
of the residents had already concluded. His rst step was to strip away the irrelevant 
details. It is the way the bridges connect, not the shapes and sizes of the land areas 
and bridges, that is important. Euler collapsed each landmass to a point (vertex) and 
treated each bridge as an arc. The resulting geometric con guration of vertices and arcs 
furnishes an example of a graph. In terms of this graph, the problem is to discover 
whether there is a path (a continuous sequence of arcs) beginning and ending at the same 
vertex while following each arc precisely once. In honor of Euler, a route in which each 
arc is traversed only once is today called a Euler circuit. 


Bw 
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The crucial idea in the solution is to attach to each of the four vertices an integer that 
indicates the number of arcs meeting at that vertex. For an Euler circuit any pass through 
a given vertex contributes two to its count, one for the entering and one for the departing 
arc. Thus, every vertex in a traversable graph must have an even integer associated with 
it. But in the K6nigsberg Bridges graph, all four of its vertices are labeled as odd. There 
can be no walking tour of K6nigsberg that crosses each bridge exactly once. 

Euler was much carried away with the theory of numbers, and his mathematical 
research supplied many of the proofs that Fermat claimed to possess—by techniques, to 
be sure, that were not those of Fermat. Having established Fermat’s theorem [if p is a 
prime and gcd(a, p) = 1, then a?~! — 1 is divisible by p], Euler demonstrated a more 
general result by introducing the phi function @(m). It denotes the number of positive 
integers less than n and relatively prime to n, so that if n is prime, ¢(”) = n — 1. What 
Euler proved (1760) was: 


If gcd(a,n) = 1, then n divides the difference a®”) — 1. 


For a better understanding of Euler’s generalization of Fermat’s theorem, let us 
consider a numerical example. If, say, n = 16, then it is found that d(7) = 8; for among 
the integers between | and 15, there are eight that are relatively prime to 16, namely the 
integers 1, 3, 5, 7, 9, 11, 13, and 15. Taking a = 3 [note that ged (3, 16) = 1], Euler’s 
result asserts that 3° — 1 is divisible by 16. Indeed, one can easily check that 


38 — 1 = 6561 — 1 = 6560 = 435 - 16. 


In the tradition of Diophantus and Fermat, another group of Euler’s works in- 
volved the solution of indeterminate equations in integral values of the unknowns. In 
1767, Euler gave a method for calculating the smallest integral solution of the equation 
x? —ay* = 1, where a is a positive nonsquare. In an earlier paper of 1732-1733, he 
had erroneously named this Pell’s equation, and the misnomer has stuck. (John Pell, an 
English mathematician, did no more than copy the equation from Fermat’s letters.) Euler 
extended Fermat’s studies on the representation of integers by sums x? + y* by proving 
(1751) that every prime of the form 44+ 1 can be uniquely expressed as the sum of 
two squares. Although Euler could not dispose of Fermat’s last theorem, he did establish 
(1770) the impossibility of having nontrivial integral solutions of x” + y" =z” for the 
case n = 3. 

Fermat, whose mathematical intuition was usually reliable, had observed that the 
integers 


P45, 27 41=17, aes O57, 27 4.1 = 65,537 


were all primes and expressed his belief that 27" + 1 would be prime for each value 
of n. In writing to Mersenne, he con dently announced, “I have found that numbers of 
the form 27" + 1 are always prime numbers and have long since signied to analysts 
the truth of this theorem.” However, Fermat bemoaned his inability to come up with a 
proof, and in subsequent letters, his growing tone of exasperation suggested that he was 
continually trying to do so. Through his uncanny ability at computation, Euler showed 
(1732) that 2?' + 1 was not prime, one of its factors being 641: 


2? 4.1 = 2324 1 = 4,294,967,297 = 641 - 6,700,417. 
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Among Euler’s accomplishments in number theory was a posthumously published 
paper (1862) giving the converse of Euclid’s theorem on perfect numbers. The proof is 
so surprisingly elementary that we shall describe it here. 


If n is an even perfect number, then n is of the form 2*—1(2* — 1), where 2‘ — 1 is a 
Mersenne prime. 


Proof. To begin, we may write n = 2‘—'m, where m is an odd integer and k > 2. 
The divisors of 2‘~'m are 1,2,27,...,2*-', each multiplied by the divisors 
1,d,,d),...,m of m itself. Now these numbers are precisely the terms in the 
expansion of the product 


(i Oe ee 
But this is the value of o(n), the sum of the divisors of n: 


o(n)=(14+2+4+274--- +2!) +d) +a +--++m) 
=(14+242?4+---4+2'')o(m). 


Because the rst factor on the right-hand side is a geometric progression, the last 
equation can also be written in the form 


o(n) = (2° — 1)o(m). 
The requirement for n to be perfect entails that 
o(n) = 2n = 2*m. 
Together, these two expressions for o(n) yield 
2m = (2* — 1)a(m), 


which is simply to say that 2 — 1 divides the product 2m. But 2‘ — 1 and 2* are 
relatively prime integers, whence, as a consequence of Euclid’s lemma, 2‘ — 1 must 
divide m; say, m = (2* — 1)M. The result of substituting this value of m into the 
last-displayed equation and canceling the common factor 2‘ — 1 is that 

o(m) = 2*M. Because m and M are both divisors of m, with M < m, we have 


2*M =o(m)>m+M =(2*-1)M+M=2'M, 


leading to o(m) = m + M. The implication of this equality is that m has only two 
positive divisors, to wit, M/ and m itself. It must be that m is prime and the smaller 
divisor M = 1; in other words, 


m= (2*-I)M=2*-1 
is a prime number. The original perfect even number n therefore has the form 
n= plane = ges eg _ 1), 


with 2" — 1 prime, completing the present proof. 
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10.2 Problems 


Many problems in number theory remain unsolved, but for combined simplicity of 
statement and dif culty of attack none surpasses the one posed by Christian Goldbach 
(1690-1764). Goldbach asserted in a letter to Euler dated June 7, 1742, that every even 
number was the sum of two odd primes. This proposition has generally been described 
as “Goldbach’s conjecture.” We should observe at the outset that if the term “prime” is 
taken in its modern sense, then the statement is false. It fails in the case of the integer 
2, which is even, but not the sum of two odd primes. We do not nowadays call 1 a 
prime, for if we did, the factorization of an integer into a product of primes would not 
be unique. The phrase “greater than 2” must therefore be inserted in the enunciation of 
the statement: Every even number greater than 2 is the sum of two primes. A somewhat 
more general version is that every even integer greater than 4 is the sum of two odd 
primes. It is uncertain how much time Euler spent on the problem; but in replying to 
Goldbach, he wrote, “That every even number is a sum of two primes, I consider an 
entirely certain theorem in spite of the fact that I am not able to demonstrate it.” He then 
countered with a conjecture of his own: Any even number greater than 6 that is of the 
form 4n + 2 is the sum of two primes of the form 47 + 1 (taking 1 as a prime of this 
latter type, where necessary). For instance, because 30 is an even number of the form 
4n + 2, we can write 30 = 13 + 17 by Euler’s conjecture. 

There is no reasonable doubt that Goldbach’s conjecture is correct and that the 
number of representations is large when the even integer is large, but all attempts to 
obtain a proof have been completely unsuccessful. Probably the most important result 
obtained in modern times in regard to Goldbach’s conjecture is contained in a Russian 
paper by Schnirelman, published in 1930. Schnirelman proved that there exists a xed 
number k such that every large number can be written as the sum of k& or fewer primes. 
However, this was an existence theorem, which gave no clue about the actual magnitude 
of k. Since then various mathematicians have given values for k. The best-known result 
is attributed to I. M. Vinogradov (1937): 


Every suf c iently large odd integer can be written as the sum of at most three prime numbers, 
and thus every suf ciently large integer is the sum of at most four primes. 


One can use the Schnirelman method to determine that every even number is the sum 
of no more than six prime numbers. 


2. Carry out, as follows, the details in Pietro Mengoli’s 
(1625-1687) proof of the divergence of the harmonic 
series 


1. 1. 4 
1. Using the expansion l+-=4+-4+-4---. 
g p tatatat 
1 ap 72 byt Ss. (a) Foran > 1, show that 
1+ x2 : 


and termwise integration, obtain Leibniz’s celebrated 


series for 2/4: 


=f dx 
4 Jo 14x? 


1 ree” 1 3 
>. 
-l<x<l, n—1l n n+l n 


(b) By grouping the terms of the series by threes, 
verify the inequality 


11 1 
=1-1/34+1/5-1/7--+-. 1+(54+343)+ 
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l 1 l and use the fact that cosx = 0 has roots 
(s+3+5)+ > Mud 3m 5m ] 
Bg 
1+54(24+5+5)+ 2 2 
Ll = eS as 2 
3 6 9 12 (b) From Euler’s expansions of ~ and > 
9 27 formally obtain the equation 
l+s+-+—+4+-°-. 
3 9 27 1 1 1 1 1 
es Bi 


. 2 2 2° 32 @ 
3. Use Euler’s identity e’7 = cosz +i sinz to prove that 
i= e777? 7. (a) Show, in the manner of Euler, that 


4. Consider the polynomial equation sinx 


= x 
ax" tax” +++ +a, 1x +a, =0, ao #0, 


x? x? x? 
with (complex) roots 71,72, ...,7,, So that (1 =) (: ax) (1 a) see 


—1 
Ax" a ax" +++ + Ay-1X + ay 


(Hint: Interpret the right side of the expression 


= ay(x — ry)( = 12)--- Th). ae a 
By multiplying the linear factors on the right and x aa 3! = 5! * 
comparing the resulting coef cients, prove that poe y 
i ee 
es a a ae A 0 ; a? 
a as a polynomial of in nite degree and apply 
. ; : -  @® Problem 5(b).] 
oe aa aa aces sae a (b) Substitute x = § in the formula of part (a) to 
obtain 


: ay 
riers sty = (1, 2 (5) (4) (2) (4) 
" —-—{—_— —— —_ = Jers, 
4 2-2 4.4 6-6 8-8 


where the kth equation asserts that the sum of all 
possible products of the roots, taken & at a time, This famous in nite product for 7 was 
published by John Wallis in his Arithmetica 


equals (— 1)“, 
a Infinitorum (1665). 


5. With the notation of Problem 4, let a, 4 0, so that (c) From Wallis’s formula, conclude that 
the roots 71,72,...,7, are all nonzero. Establish the 
. (n 1°22” 
following: Ja = lim — : 
noo (2n)!./n 
1 1 Lo aya 
(a) rh a > ae 82 an ae [Hint: First prove that 


(b) — dox" +++ + G,-1X +a, 1-3\ /3-5\ (5-7 
2-9) \4-by 616) ~ 


=a, (1-4) (1-4)...(1-4). 
r| r2 ln = — DQn+ ~) 


6. (a) By Euler’s methods, derive the summation 2n-2n 
2 
ae ae see eae ee _ [2n)!P@n+1) 
8 12 32 52 P — (n!)424" 
[Hint: Begin with the trigonometric series 
x2 x4 x6 8. The following is an outline of a proof that e is 
cata lo ata a eis irrational: 


2! 4! 6! 
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(a) Use the estimate e < 3 to show that 
i. i 4 I ds i WH 
0 a" esi =: = i <= 
<e (i+ t+5+5+ +=) X 


Tetrahedron Hexahedron Octahedron 


3 
See] ont LN 
(+1! CSR eK 
for any positive integer n. on 29, SQ 
(b) Assume to the contrary that e is rational, say, Dodecahedron Icosahedron 
e = a/b, where a and b are positive integers. In 1751, Euler proved a formula relating the faces, 


Pick n > b and also n > 3. Substitute e = a/b 


edges, and vertices of any convex polyhedron, not 
into the inequality of part (a) and multiply the 


just a regular one; the result was already known to 


inequality by !. Prove that this leads to the Descartes a century earlier. For the solids pictured, 
existence of an integer N, which satis e s conrm Euler’s formula 
0<WN < 3/4. 

F+V=E+42, 


9. For the seven bridges in the K6nigsberg Bridge 


where F’, FE, and V denote the number of faces, 
Problem, can 


edges, and vertices of the solid, respectively. 


(a) one bridge be removed, 11. For a regular polyhedron, establish the following 
(b) an additional bridge be added assertions (as they are expressed in the notation of 
Problem 10). 


so that an Euler circuit is now possible? 
: (a) If each face is a regular polygon of n sides, 


10. A rectangular polyhedron is a geometric solid, all of then nF = 2E. 
whose lateral faces are identical regular polygons, (b) Ifm edges meet at each vertex, then mV = 2E. 
and in which the same number of polygons meet at (c) Euler’s formula implies that 1/m + 1/n > 1/2. 
each vertex. There are only ve regular polyhedrons, (d) Just ve pairs of values of m and n satisfy the 
or Platonic solids, named according to the number of inequality in part (c), and each pair is 
faces each has; as shown in the gure. associated with a Platonic solid. 


In number theory, Fermat was essentially alone for 


10.3 The Prince of Mathematicians: the whole of the seventeenth century, as Euler was 
Carl Friedrich Gauss for most of the century following, until he was joined 

by Lagrange. Joseph Louis Lagrange (1736-1813), 

The Period of the French Italian by birth, German by adoption, and French by 


. choice, was, next to Euler, the foremost mathemati- 
Revolution: Lagrange, cian of the eighteenth century. When he entered the 
Monge, and Carnot University of Turin, his great interest was in physics, 

but after chancing to read a tract by Halley on the 
merits of Newtonian calculus, he became excited about the new mathematics that was 
transforming celestial mechanics. He applied himself with such energy to mathematical 
studies that he was appointed, at age 18, professor of geometry at the Royal Artillery 
School in Turin. The French Académie des Sciences soon became accustomed to see- 
ing Lagrange among the competitors for its biennial prizes. Between 1764 and 1788 
he won ve of the coveted prizes for his applications of mathematics to problems in 
astronomy. 
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In 1766, when Euler left Berlin for St. Petersburg, Frederick the Great arranged for 
Lagrange to Il the vacated post, accompanying his invitation with a modest message 
saying, “It is necessary that the greatest geometer of Europe should live near the greatest 
of Kings.” (To d’Alembert, who had suggested Lagrange’s name, the king wrote, “To 
your care and recommendation am | indebted for having replaced a half-blind mathemati- 
cian with a mathematician with both eyes, which will especially please the anatomical 
members of my academy.”’) For the next 20 years, Lagrange was director of the math- 
ematics section of the Berlin Academy. There he produced work of high distinction, 
which culminated in his monumental treatise, the Mécanique analytique, published in 
1788 in four volumes. In this work he unied general mechanics and made of it, as 
the mathematician Hamilton was later to say, “a kind of scientic poem.” Holding that 
mechanics was really a branch of pure mathematics, Lagrange so completely banished 
geometric ideas from the Mécanique analytique that he could boast in the preface that 
not a single diagram appeared in its pages. 

Frederick died in 1787 and Lagrange, no longer nding a sympathetic atmosphere 
at the Prussian court, decided to accept the invitation of Louis XVI to settle in Paris, 
where he took French citizenship. But the years of constant activity had taken their toll. 
Lagrange fell into a deep mental depression that destroyed his interest in mathematics. 
So profound was his loathing for the subject that the rst printed copy of the Mécanique 
analytique, the work of a quarter-century, lay unexamined on his desk for more than two 
years. Strange to say, it was the turmoil of the French Revolution that helped to awaken 
him from his lethargy. 

The French Revolution was a watershed event, overthrowing the old order and lay- 
ing the groundwork for profound changes in Western society. One of its most dra- 
matic moments occurred on July 14, 1789, when a Parisian mob seized the Bastille, a 
prison-fortress, which was a hated symbol of royal tyranny. (The day of this act, July 
14, has become the major national holiday of the French people.) In January of 1793, 
the imprisoned Louis XVI was found guilty of treason and condemned to the guillo- 
tine. There followed two years in which thousands were put to death, often without 
adequate evidence, as perceived enemies of the Revolution—by some counts 30,000 
died, including the queen, Marie Antoinette. A coup d’etat in 1799 can be said to mark 
the end of the revolution for it opened the way to a resumption of absolutism, rst 
under Napoleon and then under the restored monarchy. Nearly 16 years passed be- 
fore the Bourbons returned to the throne in the person of Louis XVIII, brother of the 
executed king. 

Madame de Pompadour’s cynical prophecy, “After us, the deluge,” had barely hinted 
at the oods that were in fact released at the storming of the Bastille. In the national 
upheaval that followed, anything that was more or less identied with the injustices 
practiced by the Old Regime came under suspicion and was destroyed. The universities, as 
well as the Académie des Sciences, were abolished by the National Convention in August 
of 1793. Because it was the “academy” as an aristocratic anachronism in a democratic 
society that was the target of the revolution, and not learning itself, the solution was to 
start afresh. Two new institutions, bearing the humble titles Ecole Normale and Ecole 
Polytechnique, were to keep scientic education aoat in a sea of turmoil. The Ecole 
Polytechnique, created in 1794, was to produce the engineers so urgently required by a 
nation at war; technical instruction was to be rmly based on sound scientic training, 
and particularly on mathematics and physics. 
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The most famous embodiment of the revolutionary overhaul of French education was 
the Ecole Normale, originally intended to create, almost overnight, a cadre of educators 
for the nation. More than 1400 persons were selected by local of cials to come to Paris 
and study at government expense. Candidates had to be at least 25 years old and of 
good character and unquestioned patriotism. In roughly four months’ time, they were 
to learn the essential elements of a number of academic subjects, which on completing 
the course of training they in turn would teach to their own students. To guarantee the 
standards of excellence, the government found itself forced to recruit ex-academicians 
for the faculty. Lagrange, Laplace, Monge, and Vandermonde were in charge of lecturing 
on mathematics. Unfortunately missing were the victims of the Reign of Terror. Among 
those were the chemist Lavoisier, who had gone to the guillotine, the mathematician 
Condorcet, who had preferred to take poison in prison, and the astronomer de Saron, 
who before marching to the platform had spent his last hours calculating the orbit of 
a newly discovered planet. The outraged Lagrange, hearing that the Revolution had 
silenced yet another voice of progress, said, “It took them only a moment to cause his 
head [Lavoisier’s] to fall, and a hundred years will not suf ce to produce its like.” The 
fundamental weakness of the whole scheme was that in the absence of true universities, 
the Ecole Normale became the goal of all those who wanted to pursue scientic or 
literary studies at a higher level; professors gave lectures directed toward “the heights of 
science rather than towards the art of teaching.” The failure to perform the prescribed task 
brought increasing criticism, and after only three months and 11 days, the school closed 
its doors. In 1808, the Ecole Normale was reorganized by Napoleon as the Ecole Normale 
Superieure, with a view of securing a constant supply of instructors for secondary schools. 

Lagrange, with Laplace as his assistant, taught elementary mathematics at the Ecole 
Normale, and when the short-lived institution had closed, lectured on analysis at the 
Ecole Polytechnique. Although he had not lectured since his early days at Turin, having 
been under royal patronage in the interim, he seemed to welcome these appointments. The 
instructors, subject to constant surveillance, were pledged “neither to read nor repeat from 
memory,” and transcripts of their lectures as delivered were inspected by the authorities. 
Despite the petty harassments, Lagrange gained a reputation as an inspiring teacher. His 
lecture notes on differential calculus formed the basis of another classic in mathematics, 
the Théorie des fonctions analytiques (1797). 

Through his teaching at the Ecole Polytechnique, Lagrange became acquainted with 
Sophie Germain (1776-1831), a woman frequently called the Hypatia of the nineteenth 
century. Although the school did not accept women as students, she managed to secure 
Lagrange’s lecture notes on analysis. Following the new revolutionary practice that al- 
lowed students to hand in written observations, she communicated her own thoughts to 
Lagrange, writing under the male pseudonym M. Leblanc (the name of a student enrolled 
at the school). Lagrange was so struck by the quality of her work that on learning the 
author’s true identity, he praised Germain as one of the promising young mathematicians 
of the future. This accolade was vindicated when she proved (around the mid-1820s) 
that for each odd prime p < 100, Fermat’s equation x? + y? =z? has no solution in 
integers not divisible by p; in other words, if x, y, and z are nonzero integers satisfying 
xP + y? = z?, then p divides xyz. 

Lagrange’s research covered an extraordinarily wide spectrum, but he possessed, 
much like Fermat and Euler before him, a special talent for the theory of numbers. 
Some of his work there included the rst proof of a theorem stated by John Wilson 
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Joseph Louis Lagrange 
(1736-1813) 


(Smithsonian Institution.) 


in 1770 to the effect that n is prime if and only if n divides (n — 1)!+ 1; the nding 
of all integral solutions of the diophantine equation x? — ay? = 1; and the solution of 
numerous problems posed by Fermat on how certain primes could be represented in 
particular ways. (Typical of these last-mentioned problems was the result that every 
prime p of the form p = 8n + 1 could be written as p = a? + 2b? for suitable integers 
a and b.) The discovery for which Lagrange acquired his greatest renown in number 
theory is that every positive integer can be expressed as the sum of four integral squares. 

News of the fall of the Bastille was hailed everywhere on the Continent with sat- 
isfaction. But no other nation imitated France spontaneously; rather the French armies, 
with their goal of “natural frontiers” and their slogan, “War to the thrones, peace to the 
cottages,” spread the principles of the revolution. The con ict began in April 1792 and 
lasted, with brief interruptions and shifting alliances, until 1815. To repel the combined 
forces of all the old monarchies, it was necessary for France to return to absolutism—in 
this case, the dictatorship of a general, Napoleon Bonaparte. The statesmen of Europe 
held together a coalition of mutually hostile states just long enough to allow the weight 
of their numbers to ensure victory. On June 16, 1815, the British under Wellington and 
the Prussians under Bliicher delivered the nal blow to the Napoleonic epic at Waterloo, 
near Brussels. (Before the revolution, war had been “the sport of kings” and armies 
were small; but at Waterloo, the massive armies of the allies numbered some 800,000 
men—not all concentrated, to be sure—whereas Napoleon put together a force of 200,000 
soldiers, of whom 130,000 were available.) Germany was more permanently transformed 
than most other parts of Europe during the revolutionary-Napoleonic years, and under 
the domination of Prussia, it was to become the center of future development on the 
Continent. 

The geometer Gaspard Monge (1746-1818) passed safely through the days of the 
Terror, but suffered later for his close ties to the revolution. Monge was born in Beaune, 
Burgundy, the son of an itinerant peddler. He received his early education in schools 
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conducted by the Oratorian religious order. At the age of 16, he made an architectural 
sketch of his native town that so impressed military authorities that he was offered a 
place in the artillery school at Méziéres. Monge’s lowly birth precluded his obtaining a 
commission; instead, he was assigned to an annex of the school where surveying and 
drawing were taught. Asked to prepare a plan for a fortress, he replaced the tedious 
arithmetic calculations usually used with a rapid graphical approach. This was a begin- 
ning of a separate branch of geometry, which he referred to as “descriptive geometry.” 
Monge’s innovative idea was to represent a three-dimensional body by projecting it onto 
two mutually perpendicular planes, ordinarily the vertical and the horizontal, and then 
rotating the plane taken as vertical through 90° in order that both projections would 
appear in a single plane, or sheet of paper. Conversely, the size and shape of the body 
could be deduced from the projections. The practical advantages of Monge’s scheme led 
the authorities to keep it a military secret for years. 

Monge joined the staff at Mézieres upon graduation, and rose through the ranks to 
professor of mathematics by the age of 22. In 1780 he was elected to the Académie des 
Sciences. Three years later Monge moved to Paris, where he was to hold a number of 
important political and academic positions under successive regimes: royal, revolutionary, 
and imperial. He was named examiner of naval cadets in 1783; for the next six years he 
divided his time between mathematics and the inspection of maritime schools throughout 
the country. Following the overthrow of the monarchy, Monge brie y held the post of 
Minister of the Navy and thereafter was a member of the Committee of Arms responsible 
for supervising metallurgical foundries and gunpowder workshops. By the spring of 1794, 
three thousand workers were turning out seven hundred guns a day, with six thousand 
workshops busy producing powder. Yet his activities were not enough to shield Monge 
from being denounced at the height of the Terror, and he was forced to ee Paris in haste. 

On his return to the city, Monge helped organize and taught at the ephemeral Ecole 
Normale. The rst account of his new geometry appeared in 1795 as Géomeétrie de- 
scriptive, printed in the serial form of lectures given at the school. Monge was also 
among the original staff at the Ecole Polytechnique, becoming its director in 1808. His 
course on solid analytic geometry led to the textbook Feuilles d’analyse appliquées a 
la géométrie (1801) and in an expanded version entitled Application de l’analyse a la 
géomeétrie (1805). Under the in uence of Monge, the study of geometry began to ourish 
at the Ecole Polytechnique. He was considered an outstanding teacher, whose lectures in- 
spired a host of brilliant students to enter the eld, among them such men as Jean-Baptiste 
Biot, Charles-Julian Brianchon, and Jean-Victor Poncelet. 

Monge came to be an ardent supporter of Napoleon, whom he saw as a “man of 
destiny” capable of spreading the goals of the revolution to every part of Europe. He 
went to Italy in 1796 as one of the commissioners to determine which local art treasures 
the victorious French army should remove to Paris; and later he accompanied Napoleon 
on the Egyptian campaign. When Napoleon ascended to power, he rewarded Monge’s 
enthusiastic labors by bestowing the title of Count on him and making him a senator 
for life. 

As a deputy, Monge had not only voted in favor of the execution of Louis XVI, but 
in his further capacity as minister of the navy had signed the of cial record of the trial. It 
is little wonder that the aged professor was cruelly treated following the restoration of the 
Bourbon monarchy. Stripped of all titles and honors previously conferred upon him, he 
was expelled from the Institut de France (the renamed Académie). As a _ nal indignity, 
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the king denied to the students and faculty of the Ecole Polytechnique permission to 
attend Monge’s funeral. Instead, on the following day they marched to the cemetery in 
a body to place a wreath upon his grave. 

Lazare Carnot (1753-1823) was another geometer with an important role during the 
revolution. A lawyer’s son from Burgundy, he was admitted to the military school at 
Méziéres, where Monge was one of his teachers. Carnot graduated after two years with 
a commission as a lieutenant in the Corps of Engineers. During the next 18 years of 
routine garrison duty in France, Carnot was promoted to the rank of captain—the highest 
attainable by someone not of noble birth. He nonetheless had ample spare time to study 
and write on mathematics, mechanics, and defensive forti cation. 

Carnot entered public life in 1791 with his election to a seat in the newly formed 
Legislative Assembly, and in the following year to its successor, the National Convention. 
There he was among the deputies who voted for the execution of the king. In 1793 
Carnot was appointed to the powerful Committee of Public Safety, the 12 men who were 
to rule France during the years of the Terror. As the only member of the committee with 
military experience, he assumed direction of the nation’s war effort. The tremendous 
labor required to mold the ill-equipped /evée en masse into the armies that saved France 
from invasion by the still-existing European monarchies earned Carnot the admiration of 
his countrymen and the appellation “Organizer of Victory.” 

Carnot’s political fortunes changed when he refused to be associated with Napoleon’s 
coup d’etat of 1797. He fell under suspicion of being a royalist sympathizer, was pro- 
scribed, and ed to Switzerland in fear for his life. After Napoleon seized power for 
himself, Carnot felt safe to return. However, except for two brief periods (1800 and 
1815), he showed little interest in holding public of ce. He remained nonetheless an en- 
emy of all tyrants; as an elected member of the Tribunate, he was the only one with the 
courage to cast a vote against Napoleon’s being named emperor. With the restoration of 
Louix XVIII to the throne, Carnot’s reputation as an ardent revolutionary condemned him 
to perpetual exile. His remaining years were spent in the Prussian city of Magdenburg. 

Carnot’s numerous activities did not seem to limit his scientic and mathematical 
writings. His earliest contribution, a modest effort on engineering mechanics with the title 
Essai sur les machines en général (1793) failed to attract attention; but a revision brought 
out in 1803 as Princips fundamentaux de l’equilibrium et du mouvement was far more 
successful. Carnot’s best known publication, the Reflexions sur la métaphysique du calcul 
infinitésimal (1797) was completed while taking refuge in Geneva. Widely read, it ran 
through numerous printings (the last in 1921) and was translated into several languages, 
including English. The Reflexions sought to explain how the different approaches to 
the calculus—Newton’s_ uxions, Leibniz’s differentials, and d’Alembert’s limits—were 
essentially equivalent. In explaining how correct theorems could emerge from suspect 
assumptions within the process of calculation, Carnot echoed Bishop Berkeley in conclud- 
ing that “the true metaphysical principle in the calculus is the doctrine of compensating 
errors.” 

Carnot’s reputation as a mathematician rests largely on his geometric works. He was 
an advocate of pure geometric reasoning, which he felt was superior to Cartesian proofs 
with their “hieroglyphics of analysis.” He was one of the rst after Desargues to take 
up the ideas of projective geometry and, in doing so, to in uence the nineteenth century 
revival of the subject under John Victor Poncelet. His De la corrélation des figures de 
géomeétrie (1801) was followed by an enlarged extension, Géométrie de position (1803). 
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The latter discusses a variety of coordinate systems—such as rectangular, polar, and 
bipolar—for locating a point in space. While the equation of a curve would depend on 
the particular system employed, Carnot felt that the curve’s intrinsic properties should 
be independent of the frame of reference; he sought a system of “intrinsic coordinates” 
that would best reveal the curve’s properties. Carnot’s best known geometric publication, 
Essai sur la théorie des transversals (1806), was instrumental in bringing into prominence 
the study of transversals. In the Essai, he extended a number of classical results, one 
being Heron’s formula for the area of a triangle in terms of its sides. Carnot represented 
the volume of a tetrahedron in terms of its six edges. 

Napoleon was himself something of an amateur mathematician, with a penchant for 
geometry. On one occasion, he posed to the French scholars the problem of dividing the 
circumference of a circle into four arcs of equal length by means of a compass alone. 
This has become known as “Napoleon’s Problem,” although it was not original with him. 
Lorenzo Mascheroni (1750-1800), a professor of geometry at the University of Pavia, 
furnished a solution in his Geometria del Compasso (1797). The work was dedicated 
to Napoleon, whom the author greatly admired (not surprisingly, it was translated into 
French). Mascheroni went much further in his book by demonstrating that any plane 
Euclidean construction that can be carried out with a straightedge and compass can 
also be accomplished using the compass alone. Many years later it was discovered that 
Mascheroni’s solution to Napoleon’s challenge had been anticipated by an obscure Danish 
mathematician, Georg Mohr, in a brief treatise entitled Euclides Danicus (1672). The 
booklet seems to have gone entirely unnoticed until 1928 when a student happened upon 
it in a secondhand bookstore in Copenhagen. 

One of the lasting contributions of the French Revolution is the metric system of 
weights and measures. Previously, the country suffered under a crushing burden of dif- 
ferent standards, varying not only from province to province but almost from town to 
town. In May 1790, the Constituent Assembly appointed a committee to choose between 
two proposals for a basic unit of linear measure. One suggestion was to take the unit to 
be the length of a simple pendulum, beating seconds, at the equator. The other alternative 
was to take as the standard a ten-millionth part of the distance from the North Pole to the 
equator. In March 1797, the assembly settled in favor of the quadrant measure, feeling 
that the pendulum option lacked precision because gravity varies from place to place. 
The fundamental unit was named the meter, after the Greek word for measure; it was 
assigned a provisional value based on earlier calculations. 

The committee decided to extrapolate the length of the earth’s quadrant by carefully 
measuring the segment of it running from Dunkirk to Barcelona, two seaport towns on 
the same meridian. The astronomers Jean Baptiste Delambre and Pierre Méchain were 
entrusted with the survey of the chosen arc. Setting out in opposite directions, it took 
them from 1792 until 1798 to complete the necessary chain of triangulation. At the 
International Congress on the Meter held at Paris in June 1799, the “true” length of a 
meter was xed—the provisional meter having been slightly longer—and a copy in the 
form of a at bar of pure platinum was deposited in the national archives. However, 
a great many people remained wary of the new weights and measures and stubbornly 
resisted their use. In July 1837, legislation was passed making use of the metric system 
mandatory throughout France and its possessions as of January 1, 1840. 

More recent surveys show that the length of the meridian from the North Pole to 
the equator is not exactly ten million meters. It is, rather, 10,002,290 meters. 
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Gauss’s Disquisitiones Arithmeticae 


With the end of the 1700s, Germany assumed the lead as the center of mathematical 
activity. The greatest mathematician of modern times, Carl Friedrich Gauss (1777-1855), 
was so very German that never in his life did he leave the country, not even for a visit. 
Unlike Fermat and Euler, who came from nancially secure middle-class backgrounds. 
Gauss was born into a poor and unlettered family. His father managed to eke out a 
meager living in Brunswick, Germany, through hard work as stonecutter, gardener, canal 
worker, and nally foreman for a masonry rm. Had the father’s views prevailed, the 
son would have followed one of the family trades; it was only by fortunate chance that 
Gauss did not become a bricklayer or a gardener. Much against his will, the father was 
persuaded to allow the gifted boy to acquire an appropriate education. 

Gauss was one of those remarkable infant prodigies whose natural aptitude for math- 
ematics soon becomes apparent. He used to say, laughingly, that without the help or 
knowledge of others he had learned to reckon before he could talk. As a child of three, 
according to a well-authenticated story, he corrected an error in his father’s weekly pay- 
roll calculations. His arithmetical powers so overwhelmed his schoolmasters that by the 
time Gauss was nine years old, they admitted that there was nothing more they could 
teach the boy. It is said that in his rst arithmetic class, Gauss astonished his teacher by 
instantly solving what was intended as a long “busy work” problem, “Find the sum of 
all the numbers from | to 100.” The young Gauss later confessed to having recognized 
the pattern 


1+ 100 = 101, 2+ 99 = 101, 3+4+98 = 101,..., 50+ 51 = 101. 


Because there are 50 pairs of numbers, each of which adds up to 101, the sum of all the 
numbers must be 50- 101 = 5050. This technique provides another way of deriving the 
formula 

n(n + 1) 
— > 


for the sum of the rst 7 positive integers. One need only display the consecutive integers 
1 through 7 in two rows as follows: 


1L+24+34---+n 


Adding the vertical columns produces n terms, each of which equals n + 1; when these 
terms are added, we get the value n(n + 1). Because the same sum is obtained on adding 
the two rows horizontally, we arrive at the formula 


n(n +1) =2(11+24+3+4---+n). 


This brilliance aroused the attention of in uential people, particularly Duke Fer- 
dinand of Brunswick, who became and remained Gauss’s patron through many years. 
The duke’s generosity made it possible for the young genius to attend Caroline Col- 
lege, a preparatory school in Brunswick, and later the University of Géttingen, where 
he remained only three years (1795-1798). During his student days at Caroline College, 
Gauss formulated the method of least squares, by which the most likely value of a vari- 
able quantity could be estimated from many random observations. He shared the honor 
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of this discovery with Legendre, who rst published the device independently in 1806 
in his Nouvelles methodes pour la determination des orbites des cometes. When Gauss 
entered Gottingen, he was still undecided whether to become a mathematician or pursue 
a career in classical languages. March 30, 1796, marked the turning point in the choice 
of studies. On that day, when Gauss was still not 20 years old, he made a dramatic 
discovery that de nitely decided him in favor of mathematics. 

The problem of constructing regular polygons using only “Euclidean tools,” that is 
to say, straightedge and compass, had long been laid aside in the belief that the ancients 
had exhausted all the possible constructions. What Gauss demonstrated in 1796 was that 
the regular polygon of 17 sides is so constructible—the rst such advance since the time 
of Euclid. Gauss was so proud of this discovery that he expressed the wish to have 
a regular 17-sided polygon carved on his tombstone, just as Archimedes’ grave was 
decorated with the gure of a sphere inscribed in a cylinder. Although this desire was 
not carried out, there is a 17-pointed star inscribed on the base of a monument erected 
in Gauss’s memory in his native town of Brunswick. Apparently the stonemason refused 
to chisel a regular 17-sided polygon because he feared that the resulting gure would be 
indistinguishable from a circle. 

Later, in the impressive Disquisitiones Arithmeticae of 1801, Gauss proved that a 
regular polygon of p sides, where p is an odd prime, is constructible by straightedge 
and compass if p is of the form 2” + 1. As we have already pointed out, the values of 
2% +1 for k = 0, 1,2, 3, 4, namely, 


3,5, 17,257, and 65,537 


are all prime. Euler showed that 2? + 1 admits the factor 641; mathematicians now be- 
lieve that 2” + 1 is composite for all k > 5, although there has yet been no proof of this. 

At Gottingen, Gauss studied mathematics under Abraham Kastner; but since Kastner 
showed little understanding of Gauss’s research, he tended to work quite independently 
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of his teachers. There is a story that Gauss tried to interest Kastner in his construction 
for the regular 17-gon by pointing out that he had solved a seventeenth-degree algebraic 
equation to perform the construction. The professor, seeking to dismiss Gauss, replied that 
the solution was impossible. For this rebuff Gauss repaid Kastner, who prided himself on 
being something of a poet, by declaring him to be the best mathematician among poets 
and the best poet among mathematicians. 

In 1798, Gauss returned to Brunswick, where he earned a somewhat precarious 
living by private tutoring. When he was unsuccessful in getting pupils, Duke Ferdinand 
granted him a_ xed pension so that he could devote himself entirely to scientic work 
unhampered by nancial worries. During this period, Gauss frequently consulted the 
mathematical library of the University of Helmstadt and there made the acquaintance 
of Johann Fredrich Pfaff, then the best-known mathematician in Germany. Gauss’s rst 
published scientic work was his famous dissertation, nominally written under Pfaff, 
on the basis of which he was granted his doctorate from Helmstadt without the usual 
examination. The doctoral thesis, bearing the title “New Proof of the Theorem That 
Every Integral Rational Function of One Variable Can Be Decomposed into Real Factors 
of the First or Second Degree,” gave the rst substantial proof (although not rigorous by 
modern standards) of the so-called fundamental theorem of algebra. This theorem states 
that every polynomial equation 


x” tax" !|4+..-4+a, 1x +a, =0 


in which the coef cients are real or complex numbers has at least one root in the complex 
eld, from which it follows that such a polynomial of degree n has precisely n roots 
(which may not be real or distinct). Gauss returned to the topic several times later on. 

Although the name fundamental theorem of algebra appears to have been intro- 
duced by Gauss, the result itself was familiar, having resisted attempted demonstrations 
by d’Alembert (1746), Euler (1749), and Lagrange (1772). Many historians seem will- 
ing to give Albert Girard priority in the formulation of the fundamental theorem, for in 
his L Invention nouvelle en l’algébre of 1629 he asserted, “Every equation of algebra 
has as many solutions as the exponent of the highest term indicates.” Jean d’Alembert 
(who wrote many of the mathematical articles in the great French Encyclopédie) made 
the rst attempt to prove it. But he was not able to show even the existence of a root 
of the equation, only the form that the root takes. However, the “proof” was so widely 
accepted that the theorem came to be known, at least in France, as d’Alembert’s theorem. 
Euler’s contribution lay in showing that the complex roots occur in conjugate pairs; that 
is, when a root a + b./—I occurs, there is also a conjugate root a — b./—1, so that the 
polynomial equation must contain a quadratic factor x? + px +q with real coef cients. 
Gauss’s original argument (1799) involved geometric considerations, but because his 
contemporaries wished an entirely algebraic proof, he gave three more demonstrations 
of the fundamental theorem, the last when he was 70 years old. The third proof (1816) 
necessitated using complex integrals and showed Gauss’s early mastery of the new com- 
plex analysis. In the fourth proof (1849), which was really a worked-over version of the 
rst, Gauss used complex numbers more freely, because as he said, they “are nowadays 
apparent to all.” 

From the fundamental theorem of algebra, Gauss went on to a succession of triumphs, 
each new discovery following on the heels of the previous one. The number-theoretic 
ideas that had been piling up since his days at Caroline College were nally brought 
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together in Disquisitiones Arithmeticae, a work that instantly won Gauss recognition as 
a mathematical genius of the rst order. 

But the most extraordinary achievement of Gauss was more in the realm of theoretical 
astronomy than of mathematics. On the opening night of the nineteenth century, January 
1, 1801, the Italian astronomer Piazzi discovered the rst of the so-called minor planets 
(planetoids or asteroids), later called Ceres. After this newly found body, visible in its 
course only by telescope, passed the sun, neither Piazzi nor any other astronomer could 
nd it again. Piazzi’s observations extended over 41 days, during which the orbit swept 
out an angle of only nine degrees. From the scanty data available, Gauss was able to 
calculate Ceres’s orbit with amazing accuracy, and the elusive planet was rediscovered at 
the end of the year in almost exactly the position he had forecast. This success brought 
Gauss worldwide fame and led to the offer of an appointment at the St. Petersburg 
Academy, a post he declined. 

Gauss’s tour de force on Ceres has often been viewed as a disaster for mathematics. 
Just when he was experiencing an upsurge of ideas and publications in various elds of 
mathematics, he became absorbed in astronomy, a subject that was to remain his chief 
preoccupation for the next 20 years. He now developed a general theory of planetary 
and cometary orbits, taking into account not only the main gravitational attraction of the 
sun, but also the minor gravitational forces of the other planets. As the prying telescope 
picked up the rst of a series of planetoids that lay between Mars and Jupiter—Pallas 
(1802), Juno (1804), and Vesta (1807)—Gauss immediately calculated the orbit of each 
new discovery. Today more than 1500 of these planetoids have been identi ed. 

Using his superior methods, Gauss redid in an hour’s time the calculations on which 
Euler had spent three days, and which sometimes are said to have led to Euler’s loss 
of sight in one eye. Gauss remarked unkindly, “I should also have gone blind if I had 
calculated in that fashion for three days.” The method of orbital determination, including 
the use of least squares in minimizing the inevitable errors of observation, was published 
in 1809 as his second masterpiece, Theoria Motus Corporum Coelestium in Sectionibus 
Conicus Solem Ambietium (Theory of the Motion of the Heavenly Bodies About the Sun 
in Conic Sections). It was rst written in German, but at the request of the publisher, 
who possibly believed that it would be read for centuries, was translated into Latin. The 
triumph of Gauss’s methods was the calculation (in 1846 and so within Gauss’s lifetime) 
of the orbit of the most distant planet, Neptune. Neptune was discovered as the result of 
observed disturbances in the motion of Uranus, and it was the rst planet to be discovered 
on the basis of theoretical calculation. 

When the patron of his student days, the Duke of Brunswick, was killed in the 
Battle of Jena (1806) leading the Prussian armies against Napoleon, Gauss was forced to 
seek some reliable livelihood to support his family. Reluctant to see Germany lose the 
most renowned mathematician in the world to St. Petersburg, his friends secured Gauss’s 
appointment as director of the newly built observatory at the University of Géttingen, a 
position that he held until his death nearly a half-century later. It is said that except for a 
visit to Berlin in 1836 to attend a scientic meeting, Gauss never slept under any other 
roof than that of his own observatory. 

Napoleon had a remarkable triumph at Jena, where he was victorious with only 
26,000 men against Brunswick’s 60,000. After this victory, Napoleon was free to reor- 
ganize northern Germany. By the Treaty of Tilsit in 1807, portions of Hanover, Hesse, 
and Brunswick west of the Elbe were given to the new Kingdom of Westphalia, to be 
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ruled by Napoleon’s youngest brother. In the following years, French war levies were 
severe, and Gauss, as a professor at G6ttingen, was assessed an involuntary contribution 
of 2000 francs. The sum was exorbitant in that day’s currency and quite beyond Gauss’s 
ability to pay, particularly so because wage payments at Géttingen were frequently sus- 
pended. Laplace wrote to Gauss that he had himself paid the 2000-franc ne into the 
Treasury at Paris, considering it an honor to lift this underserved burden from the shoul- 
ders of a friend. As soon as Gauss could, however, he repaid Laplace with interest at 
the current rate of exchange. For the rest of his life, Gauss resented the French, refus- 
ing to publish in their language and pretending ignorance of French when speaking to 
Frenchmen whom he did not know. 

There was at least one other occasion when a French mathematician interceded with 
the authorities on Gauss’s behalf. Sophie Germain (1776-1831), fascinated by Gauss’s 
masterpiece on the theory of numbers, the Disquisitiones Arithmeticae (1801), had begun 
sending him some of the results of her own mathematical investigations. Because she 
feared that he might be prejudiced against a woman mathematician, she used the pen name 
M. Leblanc; and indeed Gauss did not suspect the true identity of M. Leblanc until 1806. 
The French forces had occupied Brunswick and were besieging Hanover. Concerned 
over the great man’s safety, Germain asked the general commanding the French troops, 
a family friend, to send an of cer to see how Gauss was faring. It developed that Gauss 
was safe and well, but he denied any knowledge of Mlle. Germain because, after all, 
his only contact had been with M. Leblanc. The confusion was nally put right in an 
exchange of letters. The irony is that despite their extensive correspondence, Germain and 
Gauss never met; and before the University of Géttingen could award her the honorary 
doctor’s degree that Gauss had recommended, Sophie Germain died. 

Concurrently with his purely theoretical research in mathematics, Gauss pursued 
work in several related scientic elds, notably physics, mechanics, and theoretical 
astronomy. In the year 1831, Wilhelm Weber (1804-1891) was called to Géttingen at 
Gauss’s suggestion to assume a chair in physics. There he became Gauss’s collaborator 
in investigating the intensity of the earth’s magnetic force. Gauss and Weber were 
the rst to communicate by electromagnetic telegraph when, in 1833, they connected 
the astronomical observatory and the physics laboratory with a mile-long double 
wire. Shortly thereafter, it was accidentally discovered (in Bavaria) that it was quite 
unnecessary to lay two wires, as the earth provided a return for the current. More 
ef cient methods came in 1837 with the work of Samuel Morse. 

By the mid-1800s, mathematics had grown into an enormous and unwieldy structure, 
divided into many elds in which only the specialist knew the way. Gauss was the last 
complete mathematician, and it is no exaggeration to say that he was in some degree 
connected with nearly every aspect of the subject. His contemporaries regarded him 
as Princeps Mathematicorum (Prince of Mathematicians), on a par with Archimedes 
and Isaac Newton. This is illustrated in a small incident: On being asked who was 
the greatest mathematician in Germany, Laplace answered, “Why, Pfaff.” When the 
questioner indicated that he would have thought Gauss was, Laplace replied, “Pfaff is 
by far the greatest in Germany, but Gauss is the greatest in all Europe.” 

Although Gauss adorned every branch of mathematics, he always held number theory 
in high esteem and affection. He is reputed to have insisted, “mathematics is the Queen of 
the sciences, and the theory of numbers is the Queen of mathematics.” The publication of 
Disquisitiones Arithmeticae (Arithmetical Investigations), which appeared in 1801 when 
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Gauss was 24 years old, laid the foundations of modern number theory. Legend has it 
that a large part of the Disquisitiones Arithmeticae had been submitted as a memoir to 
the French Academy the previous year and had been rejected in a manner that, even 
had the work been as worthless as the referees believed, would have been inexcusably 
tude. (In an attempt to lay this defamatory tale to rest, the ofcers of the academy 
made an exhaustive search of their permanent records in 1935 and concluded that the 
Disquisitiones was never submitted, much less rejected.) “It is really astonishing,” said 
Leopold Kronecker, “to think that a single man of such young years was able to bring to 
light such a wealth of results, and above all to present such a profound and well-organized 
treatment of an entirely new discipline.” 

In gratitude for all that the Duke of Brunswick had done for him, Gauss dedicated 
the Disquisitiones to his benefactor. “Were it not for your unceasing bene ts in support 
of my studies,” said Gauss in the lengthy and owery dedication, “I would not have 
been able to devote myself totally to my passionate love, the study of mathematics.” The 
words were not empty attery, for if ever a patron deserved the homage of a protégé, 
Ferdinand deserved that of Gauss. The duke had nanced the education of the young 
prodigy, granted him a yearly pension on completion of his studies, and even paid for 
the printing of his doctoral dissertation. 


The Legacy of Gauss: Congruence Theory 


In the rst chapter of Disquisitiones Arithmeticae, Gauss introduced the concept of 
congruence and the notation that made it such a powerful technique. He explained that he 
was induced to adopt the symbol = because of the close analogy with algebraic equality. 
According to Gauss, “If a number 1 measures the difference between two numbers a 
and b, then a and b are said to be congruent with respect to n; if not, incongruent.” In 
the form of a de nition, this becomes the following. 


Definition 


Let n be a fixed positive integer. Two integers a and b are said to be congruent modulo 
n, symbolized by 


a = b (mod n) 


ifn divides the difference a — b, that is, provided that a — b = kn for some integer k. 


To x the idea let us consider the case n = 7. It is routine to check that 
3 = 24 (mod 7), —31 = 11 (mod 7), —15 = —64 (mod 7), 


because 3 — 24 = (—3)7, —31 — 11 = (—6)7, and —15 — (—64) = 7-7. On the other 
hand, if n { (a—5), then, we say, a is incongruent to b modulo n, and we write 
a # b (mod n). For example, 25 ¥ 12 (mod 7), since 7 fails to divide 25 — 12 = 13. 

It is to be noted that any two integers are congruent modulo 1, whereas two integers 
are congruent modulo 2 when they are both even or both odd. Because congruence 
modulo | is not interesting, the usual practice is to assume that n > 1. 

There is a useful characterization of congruence modulo n in terms of remainders 
on division by a. 
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THEOREM For a and b arbitrary integers, a = b (mod n) if and only if a and b leave the same 
nonnegative remainder when divided by n. 


Proof. First, take a = b (mod n), so that a= b+ kn for some integer k. On 
division by n, we nd that b leaves a certain remainder r,; that is,b =qn+r, 
where 0 <r <n. Therefore, 

a=b+kn=(qn+r)+hn=(q+hkhn-+r, 


which indicates that a has the same remainder as b. 
On the other hand, suppose we can write a = qin +r and b = qon +r, with 
the same remainder r(0 <r <n). Then 
a—b=(qin+r)—(qan+r)= (qi -— Qn, 


whence n|(a — b). In the language of congruences, this says that a = b (mod n). 


a 


Example. Because the integers —56 and —11 can be expressed in the form 
—56 = (—7)9 +7, —11 =(—2)9+7, 


with the same remainder 7, by the foregoing theorem, —56 = —11 (mod 9). In the 
other direction, the congruence —31 = 11 (mod 7) implies that —31 and 11 have the 
same remainder when divided by 7. This is clear from the relations 


Sata(=5)7 +4, 11=1-74+4. 


Congruence can be viewed as a generalized form of equality, in the sense that its 
behavior with respect to addition and multiplication is reminiscent of ordinary equality. 
Some of the elementary properties of equality that carry over to congruences appear in 
the next theorem. 


THEOREM Letn > 0 be fixed and a, b,c, d be arbitrary integers. Then the following properties 
hold. 
1. a=a (mod n). 
2. Ifa =b (mod n), then b =a (mod n). 
3. Ifa =b (mod n) and b =c (mod n), then a = c (mod n). 
4 


If a = b (mod n) and c = d (mod n), then a+ c=b-+d (mod n) and 
ac = bd (mod n). 


If a = b (mod n), then a+ c =b+c (mod n) and ac = bc (mod n). 


o 


If a = b (mod n), then a* = b* (mod n) for any positive integer k. 


Proof. For any integer a, we have a —a =0-n, so that a =a (mod n). Now if 
a =b (mod n), then a — b = kn for some integer k. Hence, 
b —a = —(kn) = (—A)n, and since —k is an integer, this yields property (2). 
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Property (3) is slightly less obvious. Suppose that a = b (mod n) and also b=c 
(mod n). Then there exist integers h and k satisfying a — b = hn and b—c=kn. It 
follows that 


a-—c=(a—b)+(b-c)=hn+kn=(h+Ah)n, 


in consequence of which a = c (mod n). 

In the same vein, if a = b (mod n) and c =d (mod n), then we are assured that 
a—b=kn and c—d=kyn for some choice of k; and ky. Adding these equations, 
one gets 


(a+c)—(6+ 4) =(a—b)+(c-—d)=khn+hkyn = (ki +h, 


or as a congruence statement, a +c = b+d (mod n). As regards the second 
assertion of property (4), note that 


ac = (b+ k\n)(d + kyn) = bd + (bkn + dky + kikon)n. 


Because bk + dk; + k,kon is an integer, this says that ac — bd is divisible by n, 
whence ac = bd (mod n). 

The proof of property (5) is covered by (4) and the fact that c = c (mod n). 
Finally, we obtain property (6) by making an induction argument. The statement 
certainly holds for & = 1, and we shall assume it is true for some xed k. From 
property (4), we know that a = b (mod n) and a‘ = b* (mod n) together imply that 
aa‘ = bb* (mod n), or equivalently, a“+! = b*+! (mod n). This is the form the 
statement should take for k + 1, so the induction step is complete. 


| 


Before going further, you will want to know the great help that congruences can be 
in lightening labor when you are trying certain types of computations. 


Example. Let us show that 641 divides 2? + 1, or in terms of congruence, that 
2? + 1 =0 (mod 641). (This is the case that refutes Fermat’s claim that all numbers 
of the form 27" + 1 are prime.) We rst observe that 5 - 27 = 640 = —1 (mod 641). 
According to part (6) of the last theorem, both sides of this congruence can be raised 
to the fourth power, to give 


2 = (=1)" = 1 (mod 641), 
But 54 = 625 = —16 = —(2*) (mod 641), which can be written 
2* = —(5*) (mod 641). 
These two congruences, together with parts (3) and (5) of the theorem, imply that 
27 41=24.2% 41 = —(54)22 + 1 = -1+ 1 =0 (mod 641). 
Thus, 641|2?° + 1, as desired. 


Example. For another illustration in the same spirit, we shall conrm that the 
Mersenne number 2% — 1 is divisible by 167, a fact Euler noticed in 1738. One 
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approach to the calculation is through repeated squarings. We nd that 


28 = 256 = 89 (mod 167), 

2'6 = go? = 7921 = 72 (mod 167), 
232 = 722 = 5184 =7 (mod 167), 
2% = 49 (mod 167), 


and so 2°7 = 23. 2° = 8. 49 = 392 = 58 (mod 167). Combining these results, we 
obtain 


28 _ 1 = 2'6.2°7 |] = 72.58 —1 = 4167 — 1 = 4166 = 0 (mod 167), 


which shows that 167|2°? — 1. 


We shall now prove an earlier assertion regarding the nal digits of even perfect 
numbers, namely, that all such numbers terminate in either 6 or 8. As a prelude, let us 
observe that any positive integer is congruent, modulo 10, to its last digit. The reason is 
that a positive integer n, expressed in decimal notation, takes the form 


n = a;10" + ay, 10°' +-.+ +.4)10+ ap, 


where each a; is an integer satisfying 0 < a; < 9, and the coefcients ao, a,,..., a, are 
the digits of the number n. Because 10 = 0 (mod 10), we know by the theorem on the 
arithmetic of congruences that 10’ = 0' = 0 (mod 10). Therefore, by parts (4) and (5) of 
the same theorem, we conclude that 


n=ayp-O+ap_1-O+--- +a, -0+ a9 = ap (mod 10). 


Thus, to determine the nal digit of a number, it suf ces to nd the number’s congruence 
modulo 10. 


An even perfect number n ends in the digit 6 or the digit 8; that is, n = 6 (mod 10) or 
n = 8 (mod 10). 


Proof. Being an even perfect number, 7 looks like n = 2*~!(2* — 1), where the 
factor 2* — 1 is a prime. According to an earlier result, the exponent & must also be 
prime. If k = 2, then n = 6, and the asserted theorem holds. We can therefore 
con ne our attention to the case k > 2. The proof falls into two parts, according as k 
takes the form 4m + | or 4m +3. 

If k is of the form 4m + 1, then 


n= 24m (2am = 1) 
= 8+ = gan = 9% 162” — 16”. 


A straightforward induction argument will make it clear that 16’ = 6 (mod 10) for 
any positive integer ¢. Using this congruence, we get 


n=2-6—6=6 (mod 10). 
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LEMMA 


COROLLARY 


Now in the case in which k = 4m + 3, 
n= Qant2 (24m+3 _ 1) 


= 28m+5 = gam+2 =: 162+! —4.16". 


Recalling that 16’ = 6 (mod 10), we see that 
n=2-6—4-6=-—12=8 (mod 10). 


Consequently, every even perfect number has a last digit equal to 6 or 8. 


| 


We have seen that if a = b (mod n), then ca = cb (mod n) for any choice of the 
integer c. The converse, however, fails to hold. For an example as simple as any, note that 
2-4=2.-1 (mod 6), while 4 ¥ | (mod 6). We emphasize that one cannot unrestrictedly 
cancel a common factor in the arithmetic of congruences. 

With some suitable precautions, however, cancellation can be allowed. One setting 
is provided by the following lemma. 


If ca = cb (mod n), and gcd(c,n) = d, then a = b (mod n/d). 


Proof. By assumption, we can write 
c(a— b)=ca—cb=kn 


for some integer k. Because gced(c, n) = d, there exist relatively prime integers r 
and s satisfying c = dr,n = ds. When these values are substituted in the displayed 
equation and the common factor d is canceled, the net result is 


r(a— b)=ks. 


Hence, s|r(a — b) and ged(r, s) = 1. Euclid’s lemma tells us that under these 
circumstances s|a — b, which can be recast as a = b (mod s). In other words, we 
have a = b (mod n/d). 


| 


When the requirement that gcd(c,) = 1 is added, the cancellation can be accom- 
plished without a change in modulus. 


If ca = cb (mod n) with gcd(c,n) = 1, then a = b (mod n). 


For an illustration of the lemma, consider the congruence 33 = 15 (mod 9), which 
is the same as 3- 11 = 3-5 (mod 9). The lemma indicates that since ged (3, 9) = 3, then 
11 =5 (mod 3). 

Gauss, making an analogy with algebraic equations of the rst degree, devoted the 
second section of the Disquisitiones Arithmeticae to the theory of linear congruences: An 
equation of the form ax = b (mod 7) 1s called a linear congruence, and by a solution 
of such an equation we mean an integer x9 for which axp = b (mod n). By de nition, 
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axo = b (mod n) if and only if n|(axo — 6), or what amounts to the same thing, if and 
only if axo —b = nyo for some integer yo. Thus, the problem of nding all integers 
satisfying the linear congruence ax = b (mod 7) is identical with obtaining all solutions 
of the linear diophantine equation ax — ny = b. 

It is convenient to treat two solutions of ax = b (mod n) that are congruent modulo 
n as “equal” even though they are not equal in the usual sense. For instance, x = 3 
and x = —9 both satisfy the congruence 3x = 9 (mod 12). Because 3 = —9 (mod 12), 
they are not counted as different solutions. In short, when we refer to the number of 
solutions of ax = b (mod n), we mean the number of incongruent integers satisfying 
this congruence. 

With these remarks in mind, we can easily formulate the principal result on linear 
congruences. 


The linear congruence ax = b (mod n) has a solution if and only if d\b, where 
d = gcd(a,n). If d\b, then it has d mutually incongruent solutions modulo n. 


Proof. We have already observed that the given congruence is equivalent to the 
linear diophantine equation ax — ny = b. From Section 5.3, it is known that the 
latter equation can be solved if and only if d|b; moreover, if it is solvable and x9, yo 
is one specic solution, then any other solution has the form 


es A de oo 
x=x Saks _ mats: 
0 | y= 70 7 
for some choice of ft. 

Among the various integers satisfying the rst of these formulas, consider those 
that occur when ¢ takes on the successive values tf = 0,1,2,...,d—1: 


2n (d —1)n 


n 
Xo, X09 + =, X0 + -Xo+ 7 


d a oe 
We claim that these integers are incongruent modulo n, while any other such integer 


x is congruent to some one of them. If it happened that 


47; = 4 on ( d 
9 + Gh = x0 + Gh (mo n), 


where | < t; < tf: < d — 1, then one would have 


“n= d 

gil = Giz (mod n). 

Now gcd (n/d,n) = n/d, and so by the last lemma, the factor n/d could be 
canceled to arrive at the congruence 


t; = th (mod d), 


which is to say that d|f) — t,. But this is clearly impossible in view of the inequality 
0<th—-t <d. 

It remains to argue that any other solution x9 + (n/d)t is congruent modulo 7 to 
one of the d integers listed. The division theorem permits us to write the number ¢ 
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COROLLARY 


as t= qd+r, where 0 <r <d-—1. Hence 
n n 
xo + par ge) 
es 
=xXo+ 7 (mod n), 


with x9 + (n/d)r as one of our d selected solutions. This ends the proof. 


| 


The argument in this theorem brings out a point worth stating explicitly: If xo is any 
solution of ax = b (mod n), then the d = gcd(a,n) incongruent solutions are given by 


ae a gen 
X09, Xo ae ree d . 


For your convenience, here is the form the theorem takes in the special case in which 
a and n are assumed to be relatively prime. 


If gcd(a,n) = 1, then the linear congruence ax = b (mod n) has a unique solution 
modulo n. 


We now pause to look at two concrete examples. 


Example. Consider rst the linear congruence 18x = 30 (mod 42). Because 
gcd (18, 42) = 6 and 6 surely divides 30, the foregoing theorem guarantees the exis- 
tence of exactly six solutions, which are incongruent modulo 42. By inspection, one 
solution is found to be x = 4 (mod 42). Our theory tells us that the six solutions are 
as follows: 


42 
x=4+—t=447t (mod 42),  £=0,1,...,5, 


or plainly enumerated, 


x = 4,11, 18, 25, 32, 39 (mod 42). 


Example. Let us solve the linear congruence 9x = 21 (mod 30). At the outset we 
know, since ged (9, 30) = 3 and 3|21, that there must be three incongruent solutions. 
Because the congruence 9x = 21 (mod 30) is equivalent to the diophantine equation 


9x — 30y = 21, 


we can begin by expressing 3 = gcd(9, 30) as a linear combination of 9 and 30. 
Among the various possibilities, we see that 3 = 9(—3) — 30- 1, and so if both sides 
are multiplied by 7, 


21 =7-3 = 9(—21) — 30(—7). 


Modulo 30, this becomes 9(—21) = 21 (mod 21). Thus, x = —21 (mod 30) satis- 
es the congruence 9x = 21 (mod 30), or if one prefers to have positive numbers, 
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x = 9 (mod 30) is a solution. With this known, the three incongruent solutions can 
be obtained from the formula 


x =9+ 2 = 9+ 10¢ (mod 30), f= 0, 1,2. 


We end with x = 9, 19, 29 (mod 30) as the three required solutions. 


Dirichlet and Jacobi 


Gustav-Peter Lejeune Dirichlet (1805-1859) was one of the most gifted German 
mathematicians in the generation after Gauss. Although his interests covered many areas, 
he devoted himself primarily to the study of numbers. Dirichlet was born near Cologne, 
in the town of Ditiren where his father served as the local postmaster. He graduated 
from a Jesuit college in Cologne when he was only 16; there, his teacher in mathe- 
matics was Georg Ohm (later known for Ohm’s Law describing resistance in electrical 
circuits). Aside from the presence of Gauss at Géttingen, advanced mathematical educa- 
tion in Germany was then at a low ebb. But Paris had a number of prominent gures, 
including Legendre, Fourier, Cauchy, and Poisson. Dirichlet spent the years 1822-1826 
in the French capital, in what might be considered graduate study, attending lectures 
and supporting himself as tutor to the children of a general. Somehow he managed to 
acquire a copy of Gauss’s Disquisitiones Arithmeticae, which he continually carried and 
reread—some say he even slept with it under his pillow. 

In 1825, Dirichlet presented his rst paper before the Académie des Sciences, where 
Legendre was permanent secretary. The topic dealt with diophantine equations of the 
form x° + y° = Az°. Building on Dirichlet’s techniques, Legendre succeeded within a 
few weeks in showing that Fermat’s equation x” + y” =z" had no integer solution 
(xyz 4 0) when n = 5S. Dirichlet returned to this famous problem seven years later when 
he published a proof that there was no solution in the case n = 14. 

In the early 1800s, Germany was not a unied country but rather a collection of 
states of various sizes, the greatest of which was Prussia. In France all the mathematical 
activity was concentrated in Paris, but in Germany it was widely diffused among the state 
universities. The leading universities had recently instituted reforms that would eventually 
make them a major intellectual force in Europe. Perhaps re ecting the renown of the 
Ecole Polytechnique, great importance was now placed on original research. New faculty 
members were required to have a doctoral degree, for this would indicate that the holder 
had some experience in advancing knowledge. Dirichlet returned to his native Germany 
in the fall of 1826. The University of Bonn awarded him an honorary doctorate, and 
a position of privatdozent was created for him at Breslau. (Literally “private teachers”, 
privatdozents received no salaries and were dependent on student gratuities; they were 
in a sense the recruits from among whom the professors were ultimately chosen.) On 
his way to Breslau, Dirichlet made a stop at Géttingen to meet Gauss for the rst time. 
Gauss praised the talented young man—still only 22 years of age—and hoped that he 
would some day join the staff at Géttingen, a desire that was realized years later. 

German professors and students were relatively free to migrate from one university 
to another. Thus, after a year in Breslau, Dirichlet obtained a more suitable appointment 
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at the Military Academy in Berlin. This was quickly combined with an entry post at 
Prussia’s central academic institution, the University of Berlin. He remained there, rising 
through the ranks, for the next 26 years. Dirichlet, along with Carl Jacobi and the ge- 
ometer Jacob Steiner, helped transform the University of Berlin into a growing leader in 
mathematical studies for the German people. 

Because Dirichlet had not written or defended a doctoral dissertation in Latin, his 
starting salary had been rather modest, and, though adjusted on several occasions, it 
remained below the of cial scale. Moreover, he had been unable to free himself from 
the burdensome teaching at the Military Academy, which left little time for his own 
research. When Gauss died in 1855, Dirichlet was not hesitant in accepting the offer 
to take up the vacant mathematics chair at Gottingen. His stay there was cut short by 
a severe heart attack three years later. After his wife—the sister of the composer Felix 
Mendelssohn—died unexpectedly, Dirichlet died also within a few months. 

Dirichlet is said to have been the rst person to master Gauss’s dif cult-to-read 
Disquisitiones Arithmeticae. By reworking and simplifying the contents in his classroom 
lectures, he hoped to make the material accessible to other mathematicians. Dirichlet’s 
great work, the Vorlesungen tiber Zahlentheorie, was brought out in 1863 by his friend 
and former student Richard Dedekind. Based on Dirichlet’s lectures, the book was for 
the most part written by Dedekind. As it contained many new results, the treatise was the 
most important work on number theory after the Disquisitiones Arithmeticae. Dedekind 
appended supplements to the subsequent three editions, the last in 1894, which incorpo- 
rated his own investigations. 

In 1837, Dirichlet established the remarkable theorem that goes by his name today. 
This result on the existence of prime numbers in a given arithmetic progression had been 
conjectured (1808) by Legendre, who tried to give a proof but failed. It asserts that any 
arithmetic progression of integers 


a,at+b,a+2b,a+3b,...,a+kb,... 


in which ged(a, b) = | must contain an innite sequence of primes. The condition 
gcd (a, b) = | is essential, for otherwise there would be no primes at all in the progression. 
Indeed, if ged (a, b) = d > 1, then any term could be written as a + kb = d(a/d + kb/d), 
where a/d and b/d are both integers. 

Dirichlet’s Theorem indicates, for example, that there are in nitely many prime 
numbers ending in 999, such as 1999, 100999 and 1000999. These primes appear in the 
arithmetic progression 999 + 1000k, with gcd (999, 1000) = 1. 

Carl Gustav Jacobi (1804-1851) was a further contributor to the revival of German 
mathematics during this period. He was born and raised in Potsdam, near Berlin, the son 
of a wealthy Jewish banker. Something of a prodigy, Jacobi was prepared to enter the 
University of Berlin when 12 years old, but the authorities would not allow this until he 
reached 16 years of age. When there, he concluded that the existing mathematics courses 
were too elementary and preferred instead to study the works of Euler, Lagrange, and 
Gauss by himself. Jacobi gained his doctorate in 1824 and then began his university career 
as a privatdozent at Berlin. After a stay of only a year, he transferred to a salaried position 
at the University of Konigsberg. He remained there for the next 18 years, becoming a 
full professor at the early age of 23. 
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Unlike his contemporary Gauss, who abhorred the classroom, Jacobi was considered 
the most inspiring teacher of his generation. He instilled in his listeners an appreciation 
for research by interweaving the lecture material with the substance of his own investi- 
gations. Jacobi also had a reputation for being unusually blunt and to the point. On the 
occasion of a visit to England, for example, he was asked who was the greatest living 
mathematician in that country, Jacobi replied, “There is none.” With the astronomer Fred- 
erich Bessel, he inaugurated the Ké6nigsberg Mathematical-Physical Seminar, in which 
colleagues and the more able students presented theoretical articles for discussion. The 
innovation later became a central feature of mathematical instruction at the University 
of Berlin. The seminar was quite selective, accepting only a dozen or fewer students 
each year. 

Early in 1843, Jacobi fell seriously ill from a diabetic condition, exacerbated by 
K6nigsberg’s severe climate. He traveled to Italy to recover, and after nine months 
there his health improved enough that he could return to Berlin on a royal pension. As 
a member of the Academy of Science, Jacobi was entitled to lecture on his research 
interests at the University of Berlin, although he was not of cially a faculty member. 

Not even Germany could escape the wave of revolutionary fervor that spread 
over western Europe in 1848. In Germany, the agitation was over more liberal re- 
forms and the creation of a unied country, federated under Prussian leadership. Ja- 
cobi unwisely allowed himself to become involved in the political turmoil to the ex- 
tent that the government retracted his pension. Because his inherited fortune had been 
lost in a bankruptcy, Jacobi, with his wife and their eight children, was reduced to 
living in near poverty. Fortunately, the Prussian government was not prepared to suf- 
fer the ignominy of having Germany’s second greatest mathematician accept a profes- 
sorship in Vienna, and the pension was restored. Jacobi spent the last seven years of 
his life in Berlin. In 1851, at the age of 47 years, he contracted smallpox and died 
within a week. 

Jacobi’s celebrated Fundamenta Nova Theoriae Functionum Ellipticarum, which ap- 
peared in 1829, was the rst textbook on elliptic functions. As such, it was much used in 
European universities. A few years earlier, Jacobi and Niels Abel independently had the 
idea of inverting elliptic integrals to obtain elliptic functions. That is, instead of treating 
the value of an elliptic integral as a function of its upper limit, they considered the upper 
limit as a function of the integral itself. The two talented young mathematicians each 
published papers on this “revolutionary reversal” in 1827, Jacobi’s without any proof 
though he later presented the necessary demonstrations in the Fundamenta Nova. The 
French Académie des Sciences awarded its 1830 grand prize to Abel and Jacobi jointly, 
for their outstanding discoveries. The recognition came too late for Abel, who had died 
from tuberculosis during the preceding year. His share of the prize went to his next 
of kin. 

Although his best work was in the eld of analysis, Jacobi contributed to many 
areas of mathematics. In number theory, for instance, he used elliptic functions to give 
a new proof of Lagrange’s theorem that every positive integer can be expressed as 
the sum of the squares of four integers (2 can be written as 2 = 17+ 17 +07 + 07), 
and determined the number of such representations of a given integer. Jacobi’s lengthy 
memoir De Formatione ex Proprietasibus Determinatum (1841) is a systematic exposition 
of the subject of determinants. In this work we meet the functional determinant of partial 
derivatives, which we now call the “Jacobian.” 


Bibliography 


10.3. Problems 


10. 


11. 


Verify each of the following assertions: 


(a) Ifa=b (mod n) and m|n, then a = b (mod m). 

(b) Ifa=b (mod n) and the integer c > 0, then 
ca = cb (mod cn). 

(c) Ifa=b (mod n) and the integers a, b, and n 
are all divisible by d > 0, then 
a/d = b/d (mod n/d). 

(d) Ifa=b (mod n) and a = b (mod m), where 
gcd(m,n) = 1, then a = b (mod mn). 


Give an example to show that a? = b* (mod n) need 
not imply that a = b (mod n). 


Prove that every integer is congruent modulo n to 
exactly one of the integers 0,1,2,...,m —1. 


Prove that if a is any odd integer, then a? = 1 
(mod 8). [Hint: Note that a = 1, 3,5, or 7 (mod 8).] 


For any integer a, prove the following statements: 


(a) a? =0 or | (mod 3). 
(b) a>? =0,1 or —1 (mod 7). 
(c) a*=0 or 1 (mod 5). 


Determine whether 5*° — 1 is divisible by 13. 


(a) Determine the remainder when 10*? + 5? is 
divided by 7. 
(b) Find the remainder on dividing the sum 


1h 2) 4 Sl Al ass 90) $100) 


by 12. [Hint: Note that 4! = 0 (mod 12).] 
Use congruence theory to show that 47 divides the 


Mersenne number 273 — | and that 223 divides 
OFF: = ih, 


Verify each of the following divisibility statements: 


(a) 3]4"+! 4.57"-! for all n > 1. [Hint: 
antl 4 sent = ntl + (—1)""! (mod 3).] 
(b) 5|7- 16" +3 for all n > 0. 
(c) 7/5 +3-2-? for alln > 1. 
(d) 17|34"*2 +2. 49"*! for all n > 0. 
(e) 27|2"+! + 5"+? for all n > 0. 


Prove that if m > 6 is an even perfect number, then 
n = 4 (mod 6). [Hint: 4” = 4 (mod 6) for any 
m > 1.) 


Prove that if p is a prime and a” = | (mod p), then 
a=1 (mod p) ora= p— 1 (mod p). 
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12. Suppose that the integer n is given in decimal form, 
n= a,10* + a,_,10°! +--+ +.a,10 + ap, 
where 0 < a; < 9 for all 7. Prove that 


(a) n=aj+a,;+---+ a, (mod 9); hence 9 
divides n if and only if 9 divides the sum of its 
digits. 

(b) n =a) —a, tay —--- +(—1)'a, (mod 11); 
hence 11 divides n if and only if 11 divides the 
alternating sum of its digits. 


13. Without actually performing the divisions, check the 
following numbers for divisibility by 9 or 11: 


(a) 113,058. (b) 2,964,357 (c) 176,521,221. 


14. Using Problem 12, nd the missing digit in the 
calculation: 


52,817 - 3,212,146 = 169,655,x 15,282 


15. A palindrome is a number that reads the same 
backward as forward, such as 2662 or 9,351,539. 
Prove that every six-digit palindrome is divisible by 
it. 


16. Solve the following linear congruences: 


(a) 6x = 15 (mod 21). 
(b) 36x = 8 (mod 102). 
(c) 13x = 27 (mod 52). 
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CHAPTER l l 


Nineteenth-Century Contributions: 


Lobachevsky to Hilbert 


Geometry in every proposition speaks a language which experience never dares to utter; and indeed 
of which she but halfway comprehends the meaning. 
WILLIAM WHEWELL 


In writing the Elements, Euclid rested his 


Attempts to Prove the Parallel Postulate imposing structure on certain statements, 
grounded in physical experience, which 

The Efforts of Proclus, Playfair, he regarded as self-evident truths. Distinct 

and Wallis among these postulates is one whose tone 


differs greatly from the terseness and sim- 
ple comprehensibility of the others. This is the famous parallel postulate, which Euclid 
phrased as, “Ifa straight line falling on two straight lines makes the interior angles on the 
same side less than two right angles, the two straight lines, if produced inde nitely, meet 
on that side on which are the angles less than two right angles.” Not only did this state- 
ment lack the quality of self-evidence; it was actually open to doubt. The ancient Greeks 
were aware that certain curves may approach nearer and nearer and yet not meet, as the 
hyperbola approaches but does not meet its asymptotes. They saw no reason why the 
two straight lines in the parallel postulate might not exhibit the same behavior. Evidently 
Euclid himself did not quite trust the postulate, for he postponed using it in his proofs 
until he had reached Proposition 29 of Book I. The wish to remove the unwieldy—and 
suspect—thing from the set of postulates was quite natural. Starting in Euclid’s own 
lifetime, attempts were made to change the de nition of parallel lines, or to replace the 
troublesome postulate by a more acceptable but equivalent assertion, or to deduce it as 
a theorem from the other nine unquestioned axioms. 

Many substitutes for Euclid’s parallel postulate have been suggested through the 
ages. These alternatives are more intuitively appealing statements, but when they are 
carefully examined, they turn out to be logically equivalent to Euclid’s axiom. The most 
frequent substitute is known as “Playfair’s axiom,” although it was stated as early as 
the fth century by Proclus. John Playfair (1748-1819) held a chair in mathematics at 
the University of Edinburgh from 1785 until 1805. His Elements of Geometry was a 
detailed presentation of the rst six books of Euclid, along with a supplement embracing 
the approximate calculation of z and some solid geometry. Finding Euclid’s axiom 
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on parallels unsatisfactory, Playfair proposed replacing it with the following statement: 
Through a given point, not on a given line, only one parallel can be drawn to the 
given line. The popularity of Playfair’s Elements of Geometry, of which 10 editions were 
published between 1795 and 1846, doubtless led to the appearance of his version of the 
postulate in most modern geometry textbooks. 

The parallel postulate, an assertion about intersecting lines, and Playfair’s axiom, an 
assertion about parallel lines, are equivalent statements. This means that they mutually 
imply each other (the other, unquestioned axioms holding). That the parallel postulate 
implies its modern counterpart is evident from our earlier study of Euclidean geometry. 
Thus it remains for us to derive the parallel postulate, accepting as data the remaining 
nine axioms of Euclid’s geometry and Playfair’s axiom. 


Playfair’s axiom implies the parallel postulate. 


Proof. Let the lines / and /' be cut by a transversal ¢ in points P and Q, forming a 
pair of interior angles 41 and 22 on one side of the transversal. Assume that the sum 
of Z1 and Z2 is less than two right angles. 


If 23 is the supplement of 22 on the side of ¢ opposite 41 and 22, then 
23+ 22 = 180° > 21+ 22, 


which implies that 43 > 41. Through P, construct a line /” making with ¢ an angle 
equal to and alternatively interior to 23. (Proposition 23 of Euclid I permits this.) 
Then by Proposition 27, /” is parallel to /. Because 23 > 41, we have /' and /” as 
distinct lines. From Playfair’s axiom, which tells us that only one line can be drawn 
through P parallel to /, we conclude that / and /’ must meet at some point R. If R 
were on the side of t opposite 41 and 22, then 21 would be an exterior angle of 
triangle POR. The exterior angle theorem (Proposition 16) would then guarantee 
that 41 is greater than £3, which is impossible. Thus / and /’ intersect on the side of 
t containing Z1 and 22, proving the parallel postulate. 


a 


Mathematicians from the Greeks onward have compiled a long list of statements 
equivalent to the parallel postulate. Among them are: 


e A line that intersects one of two parallel lines intersects the other also. 
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e There exist lines that are everywhere equidistant from one another. 
e The sum of the angles of a triangle is equal to two right angles. 

e For any triangle, there exists a similar noncongruent triangle. 

e Any two parallel lines have a common perpendicular. 

e There exists a circle passing through any three noncollinear points. 
e Two lines parallel to the same line are parallel to each other. 


Those geometers who were not content to accept Euclid’s postulate on parallels did not 
wish merely to know whether some simpler equivalent assumption could be substituted 
for it (although this question is not without interest). Their real aim was to learn whether 
Euclid’s form of the postulate was deducible from the other nine axioms and the rst 28 
propositions of the Elements, which did not depend on the parallel postulate. 

One of the earliest efforts to prove the parallel postulate was made by Proclus (410- 
485) in his Commentary on the First Book of Euclid’s Elements. After remarking that 
“others before us have classed it among the theorems and demanded a proof of this 
which was taken as a postulate by the author of the Elements,” he went on to point out a 
fallacy contained in a demonstration offered by the noted astronomer Ptolemy and then 
to submit a “proof” of his own. Proclus derived what we call Playfair’s axiom, hence by 
implication, the parallel postulate. In substance, the argument ran as follows. 

Let / be a given line, with P any point not on it. Assume that Q is the foot of the 
perpendicular from P to / and let /' be the line perpendicular to PO at P. The lines / and 
l', forming equal alternate interior angles with PQ, are parallel. Thus it suf ces to show 
that any other line /” through P meets /. Pick any point R on /” in the region between 
I and /', and take S to be the foot of the perpendicular from R to /’. As the point R 
moves along /”, away from P, the length of the segment RS increases. Eventually, this 
length becomes greater than the distance between the parallels, namely, the length of the 
segment PQ. Then R will cross over to the other side of /, so that /” cuts /. This shows 
that /’ is the only line through P parallel to /. 


Q 


It is easy enough to pick out the aw in Proclus’s reasoning. He made free with 
the assumption that two parallel lines are everywhere the same distance apart. (His 
argument would sufce if one merely granted that the perpendicular distance between 
them is bounded.) Yet there is no justication in Postulates 14 for this. Indeed, the 
supposition that parallel lines are a constant distance from one another can be shown to 
imply the parallel postulate, so that Proclus was tacitly assuming a property of parallel 
lines that is equivalent to Postulate 5. Despite his ambitious efforts, he succeeded only 
in begging the question. 

All the numerous and varied attempts to prove the postulate on parallels failed, al- 
though for a time some mathematicians thought they had succeeded. Like Proclus, many 
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failed because their arguments were marred by the use, open or hidden, conscious or 
unconscious, of some assumption equivalent to the fth postulate. Some failed because 
their reasoning was otherwise fallacious. Some of these alleged “proofs,” such as that 
presented by John Wallis (1616-1703), had a certain seductive appeal. While lecturing 
at Oxford University in 1663, Wallis proposed replacing the parallel postulate by a new 
axiom he felt was more plausible and should therefore be given precedence. He sug- 
gested: To each triangle, there exists a similar triangle of arbitrary magnitude. Using this 
assumption and the other axioms of Euclid, Wallis was able to demonstrate that Playfair’s 
axiom holds, hence so does its equivalent, the parallel postulate. He proceeded thus: 

Given a point P not on the line /, construct a line /’ through P parallel to / in the 
familiar way, that is, by dropping a perpendicular PQ to / and erecting a perpendicular 
l' to PO at P. The crux of the argument is to show that if /” is any other line through 
P, then 7” necessarily meets /. Pursuing this aim, pick R to be any point on /” in the 
region between / and /’ and drop a perpendicular RS to PQ. 


Because we are taking Wallis’s axiom as part of our hypothesis, there exists a triangle 
POT similar to triangle PSR and such that T is on the same side of PQ as R is. Then, 
in compliance with the de nition of similar triangles, 2TPQ = ZRPS. The implication 
is that since these angles have the segments PS and PQ as a common side, PR and PT 
coincide. This puts the point 7 on the line /”. Again, by similarity, POT = ZPSR = 90°, 
which makes 7Q a perpendicular to PQ at Q. But / is the unique perpendicular to PO 
at Q, so that T lies on /. Thus / and /” meet at the point 7, allowing us to conclude that 
the only line through P parallel to / is /’. 

Actually Wallis fared no better than Proclus. Although the above “proof” seems per- 
fectly reasonable, it rests on the assumption of the existence of two similar but noncon- 
gruent triangles. This turns out to be another of the equivalents of the parallel postulate. 
Like the many others who tried to demonstrate “the enemy of geometers,” Wallis was 
guilty of the circular reasoning of assuming what he set out to prove. 


Saccheri Quadrilaterals 


This is not the place to review the entire history of the vain efforts to establish the 
truth of Euclid’s fth postulate as a matter of proof; it is enough to note that all inevitably 
failed. Although the results of these investigations were on the whole negative, some 
writers, such as Saccheri, Lambert, and Legendre, made important contributions to what 
we now call non-Euclidean geometry, though each was unaware of the true meaning at 
the time. The work of Girolamo Saccheri (1667-1733), a professor at the University of 
Pavia, deserves attention, because he seems to have been the rst to study the logical 
consequences of an actual denial of the famous postulate. Saccheri was a Jesuit priest 
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who taught in a succession of colleges of his order in Italy. He was considered a brilliant 
teacher and a man of such remarkable memory, we are told, that he could play three 
games of chess at a time without seeing any of the boards. Before going to Pavia in 
1697 to occupy the chair in mathematics, Saccheri taught philosophy for three years 
in Turin. The result of this experience was the publication of a work on logic, Logica 
Demonstrativa (1697), concerning the compatibility of de nitions and postulates. As a 
logician, Saccheri became impressed with the deductive power of reductio ad absurdum, 
or the indirect method of reasoning, used early in the Elements. According to this method, 
one shows that if the desired conclusion were not true, then a contradiction (absurdity) 
would follow. The question of Euclid’s fth postulate interested Saccheri throughout his 
lifetime and it is natural that in searching for material to which his logical principles 
could be applied, he would turn to the problem of parallels. 

In the year of his death, Saccheri published a little treatise of some 101 pages, bear- 
ing the intriguing title Euclides ab omni naevo vindicatus, commonly translated Euclid 
Vindicated of Every Blemish. This was an attempt to clear Euclid of the criticisms arising 
from the doubtful status of the parallel postulate; and the title was perhaps intended in 
response to Sir Henry Savile, who in his Praelectiones on Euclid called the theory of 
parallels a blemish (naevus) on geometry. Saccheri’s novel procedure for vindicating 
Euclid was to assume the parallel axiom to be false and to develop the resulting con- 
sequences, hoping to reach a contradiction; this would entitle him to af rm the parallel 
postulate by reductio ad absurdum. Denying the parallel postulate gave Saccheri a new 
premise for his reasonings and led him to expect more success than his predecessors had 
found. 

The fundamental gure of Saccheri’s investigations was a quadrilateral ABCD in 
which the sides 4D and BC were equal and perpendicular to the base AB. These 
quadrilaterals have subsequently become known as Saccheri quadrilaterals. Without using 
the parallel postulate or any of its consequences, Saccheri was able to prove that the angles 
at C and D, the summit angles of the quadrilateral, were equal. For triangles BAD and 
ABC are congruent by the side-angle-side proposition (Proposition 4 of Euclid I), hence 
BD = AC. But this in its turn makes triangles ADC and BCD congruent by the side- 
side-side proposition (Proposition 8 of Euclid I). As corresponding parts of congruent 
triangles, it follows that ZC = 4D. Now there are three possible alternatives, giving rise 
to three hypotheses: 


1. LC =D > 90° (hypothesis of the obtuse angle). 
2. LC = LD < 90° (hypothesis of the acute angle). 
3. LC =LD=90° (hypothesis of the right angle). 


D Cc 


A B 


Euclid’s parallel postulate is equivalent to the third of these possibilities, so that to assume 
that the summit angles are either obtuse or acute is an implicit denial of the parallel 
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axiom. This is exactly what Saccheri’s proof by reductio ad absurdum demanded, to 
show that the hypothesis of the obtuse angle and the hypothesis of the acute angle both 
led to contradictions. The hypothesis of the right angle, and in consequence the parallel 
postulate, should then hold. 

Saccheri proved that if any one of the three hypotheses were true for one of his 
quadrilaterals, then it would be true for every such quadrilateral. He went on to show 
that according to the hypothesis of the obtuse angle, the hypothesis of the acute angle, 
or the hypothesis of the right angle held, the sum of the angles of a triangle would be, 
respectively, greater than, less than, or equal to two right angles. The reasoning was 
as follows: Given a triangle ABC, let D and E be the midpoints of sides AC and BC, 
respectively. Drop perpendiculars AF, BG, and CH from the vertices A, B, and C to the 
line through D and E. The triangles AFD and CHD are congruent; and also the triangles 
BGE and CHE are congruent by the side-angle-angle proposition (Proposition 26 of 
Euclid I). This means that 4F = CH= BG, making ABGF a Saccheri quadrilateral 
with right angles at F and G. Then 


LFAB = LFAD+ LZDAB = LHCD + LDAB 
and 


LGBA = LGBE + LEBA = LHCE + ZEBA. 


¢ 


The result of adding these equalities is that 


LFAB + LGBA = LDAB + (LHCD + LHCE) + LEBA 
= BAC + LACB + LABC, 


or in words, the sum of the summit angles of the Saccheri quadrilateral ABGF equals the 
sum of the angles of the triangle ABC. Knowing this, one can easily see that if Saccheri’s 
hypothesis of the obtuse angle holds (so that in particular, each of ZFAB and ZGBA is 
obtuse), then the sum of the angles of any triangle is greater than two right angles. If 
the hypothesis of the acute holds, the same sum is less than two right angles. If the 
hypothesis of the right angle holds, this sum equals two right angles. 

Saccheri had no trouble in eliminating the hypothesis of the obtuse angle. After a 
carefully reasoned chain of 13 propositions, he found that the hypothesis of the obtuse 
angle implied the parallel postulate, which in its turn implied that the sum of the angles of 
a triangle would have to equal two right angles. But this second implication contradicts the 
just-demonstrated result that on the supposition of the hypothesis of the obtuse angle, the 
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sum of the angles of a triangle is greater than two right angles. In Saccheri’s own colorful 
words from Euclides Vindicatus: “The Hypothesis of the Obtuse Angle is absolutely 
false because it destroys itself.” We might remark that the unconscious assumption that a 
straight line is in nite played a part in reaching this conclusion, since Euclid’s Proposition 
16 (the exterior angle theorem) was used. 

The task of disposing of the “hostile Acute Angle Hypothesis,” as Saccheri called 
it, was much more elusive. Though he proved one theorem after another, he never found 
the sought-after contradiction. Without realizing it, Saccheri was on the threshold of 
discovering the rst non-Euclidean geometry. Although the theorems he was obtaining 
seemed at variance with experience, they formed a geometry that in internal coherence 
was not inferior to Euclid’s. But Saccheri was so convinced by what he was trying 
to do that he permitted his faith in Euclidean geometry to interfere with his logical 
perseverance. When no contradiction presented itself, the devoted geometer forced his 
mathematical development into an unsatisfactory ending by treating a point at in nity (a 
limit point of the plane) as if it were a point of the plane. Lamely, he concluded that two 
distinct lines that meet at an in nitely distant point can both be perpendicular at that point 
to the same straight line. Saccheri viewed this as a contradiction to Proposition 12 of 
Euclid I, according to which there is a unique perpendicular to a line at each point of the 
line. Triumphantly he announced, “The Hypothesis of the Acute Angle is absolutely false, 
being repugnant to the nature of a straight line.” Saccheri was satis ed that he had nally 
removed the uncertainty surrounding Euclid’s axiom system, declaring at the close of his 
work, “The foregoing considerations seem to me suf cient to clear Euclid of the faults 
with which he has been reproached.” But a violation of intuitive ideas does not make a 
logical contradiction, and the problem of the necessity of the parallel postulate remained. 

Had Saccheri un inchingly admitted that his reasoning led to no contradiction but 
merely results that seemed paradoxical in that they clashed sharply with the familiar 
geometry, he would have anticipated the discovery of non-Euclidean geometry by at least 
100 years. As it was, his treatise was not read widely enough to have much in uence and 
was virtually forgotten until the Italian mathematician Eugenio Beltrami rescued it from 
oblivion in 1889. Beltrami hailed Saccheri as the “Italian precursor of the Hungarian 
Bolyai and the Russian Lobachevsky,” the two individuals who are usually credited with 
the concept of a logically consistent geometry based on a denial of the parallel postulate. 
But the nal discovery of non-Euclidean geometry did not depend on the pioneering 
work of Saccheri, because neither Bolyai nor Lobachevsky seemed to have heard of the 
book or its author. 

Indeed, had Saccheri actually accomplished his purpose and proved the parallel pos- 
tulate from the remaining axioms of Euclidean geometry, he would not have vindicated 
Euclid. Quite to the contrary, he would have dealt a terrible blow to Euclid. Euclid was 
vindicated by the discovery of non-Euclidean geometry, for its existence demonstrates 
that the parallel postulate is independent of Euclid’s other axioms, so that it truly widens 
the axiomatic base on which Euclid’s geometry stands. We must admire the Great Ge- 
ometer all the more; the introduction of the fth postulate, so decidedly unaxiomatic in 
appearance, yet an independent postulate, was a stroke of pure genius. 

There is a notable resemblance between Saccheri’s Euclides Vindicatus and the 
Theorie der Parallellinien (written in 1766) of the German mathematician Johann Hein- 
rich Lambert (1728-1777), a colleague of Euler and Lagrange at the Berlin Academy of 
Sciences. For the fundamental gure of his investigations, Lambert adopted a quadrilateral 
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having three right angles and then examined the hypotheses in which the remaining angle 
was in turn obtuse, acute, and right. Lambert disposed of the obtuse-angle hypothesis 
in much the manner of Saccheri by establishing the contradictory result that the parallel 
postulate could be proved as a theorem from it. He too derived many non-Euclidean 
facts from the acute angle hypothesis, but unlike Saccheri, he was well aware that he 
had reached no contradiction. Lambert showed what no one had noticed before—that in 
this new geometry the angle sum of a triangle increases when the area decreases. (By 
the angle sum of a triangle, we mean the sum of its three interior angles.) He did this by 
proving that the area of a triangle is proportional to its angular defect, that is, the amount 
by which its angle sum falls short of two right angles. In modern symbols, for a triangle 
with angles a, b, and c, he demonstrated that the area A = k*[180° — (a + b +c)], where 
k is a constant of proportionality. 

Although the range of Lambert’s research interests was enormous, he is remembered 
today mainly for having given the rst rigorous proof that the number z is irrational. 
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In a paper presented to the Berlin Academy in 1768, he showed that if x is a nonzero 
rational number, then neither e* nor tanx can be rational (earlier, in 1737, Euler had 
established that e and e? are irrational). Because tanz/4 = 1, a rational number, it can 
be inferred that 7/4, hence z itself, is irrational. Lambert also suggested that e and z 
are transcendental numbers; but proof of this would have to wait a hundred years. 


The Accomplishments of Legendre 


Perhaps the most tireless pursuer of a proof of the parallel postulate was the third 
member of the “great trio” Lagrange, Laplace, and Legendre. Adrien-Marie Legendre 
(1752-1833) came from a well-to-do family in the south of France but spent the greater 
part of his life in Paris. He was educated there at the College Mazarin and from 1775 
until 1780 was a professor at the Ecole Militaire, where Laplace was also teaching. Two 
years after resigning his position to reserve more time for research, Legendre won a prize 
offered by the Berlin Academy with an essay on the path of a projectile in a resisting 
medium (Recherches sur la trajectoire des projectiles dans les milieux résistants). This 
brought him to the attention of the scientic community and secured his appointment to 
the Académie des Sciences, rst as an adjunct member in 1783 and then as an associate 
member in 1785. Following the closing of the Académie in 1793, Legendre was not in- 
vited to cooperate in the reorganization of public education. This may have been because 
he was not in favor with the revolutionary government or because of some other reason. 
His name does not gure among those of the professors at either the Ecole Normale or 
the Ecole Polytechnique. Nor was he on the list of 48 scholars whom the government 
selected (1795) to form the nucleus of the new Institut National des Sciences et des Arts 
as Lagrange and Laplace were. His colleagues redressed this oversight by electing him 
a resident member of the mathematics section. During this period of turmoil in France, 
Legendre was, however, made a member of several public commissions. He served, in 
particular, as one of the three commissioners who were to oversee the triangulations 
necessary for determining the standard meter. (This new unit of measure was originally 
intended to represent the 10-millionth part of the distance from the North Pole to the 
equator, calculated from the measured length of the meridian arc between Dunkirk and 
Barcelona.) He later was appointed to succeed Laplace as examiner in mathematics of 
the graduates of the Ecole Polytechnique destined for the artillery. It is said that Leg- 
endre was not so well regarded by his countrymen as Lagrange and Laplace were, for, 
according to Lao Simons, “He sought recognition which apparently came unsought to 
both Lagrange and Laplace.” Whatever the faults of his personality, Legendre was a man 
of integrity who always spoke his mind, even when his own interests were adversely af- 
fected. Refusing to vote in favor of the government’s candidate for the Institut National, 
he was deprived (1824) of his pension and died in poverty. 

Legendre’s achievements in mathematics, although they were considerable, did not 
approach the achievements of his two compeers, Lagrange and Laplace. That his work 
was nonetheless held in high regard is indicated by a comment of the secretary of the 
French Academy, Jean-Baptiste Elie de Beaumont, who wrote, “Laplace...has earned 
a right to be styled the Newton of France; Legendre, more profound than popular, was 
our Euler.” Like Euler, Legendre had interests covering the breadth of mathematics. He 
seems to have been most pleased with his research in number theory, celestial mechanics, 
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and the theory of elliptic functions, because although he took up other problems in the 
course of his lifetime, he always returned to these. 

In 1785, Legendre read a memoir to the Académie with the title Recherches d’analyse 
indéterminé. This contained the celebrated quadratic reciprocity law, which is concerned 
with the solvability of the pair of quadratic congruences x? = q (mod p) and x? = 
p (mod q), where p and q are distinct odd primes. The following holds: 


Either both congruences x? = g (mod p) and x? = p (mod q) are solvable or both unsolv- 
able; except in the case in which p and gq are each of the form 4 + 3, in which event one 
of the congruences is solvable whereas the other is not. 


Legendre tried a long and imperfect demonstration of this, but slipped in assuming an 
obvious theorem that was as dif cult to prove as the law itself. Undaunted, he tried 
another proof in his Essai sur la théorie des nombres (1798). This too contained a 
gap, because Legendre took for granted that there were an in nite number of primes 
in certain arithmetic progressions. At 18, Gauss (in 1795) had independently discovered 
this reciprocity law, and after a year’s unremitting labor, he obtained the rst complete 
proof. “It tortured me,” said Gauss, “for a whole year and eluded my most strenuous 
efforts before, nally, I got the proof explained in the fourth section of the Disquisitiones 
Arithmeticae.” In the Disquisitiones Arithmeticae (published in 1801, though nished in 
1798), Gauss attributed the quadratic reciprocity law to himself. Two pages after its 
proof, he vaguely alluded to Legendre’s contribution, saying, “Legendre in his excellent 
tract in Mem. Acad. des Sci. 1785 arrived at a theorem which is basically the same as the 
fundamental theorem.” If Legendre was nettled by this passing reference, it would not 
be the last time that he would feel that Gauss had given him insuf cient recognition. His 
Essai sur la théorie des nombres, which represented the rst “modern” treatise devoted 
exclusively to number theory, passed through two editions, one appearing in 1798 and 
the other in 1808. In the second edition, Legendre adopted the proof of the quadratic 
reciprocity law given by the young Gauss. Legendre’s Essai was later expanded into 
his Théorie des nombres. The two-volume third edition (1830), together with Gauss’s 
Disquisitiones Arithmeticae, became the dominant work on the subject for the remainder 
of the century. 

Although the primes occur among the positive integers in a most irregular manner, 
there are ways in which their overall distribution seems quite regular. By inspecting tables 
of prime numbers, both Legendre and Gauss—the latter still in his early teens—hoped 
to nd a simple function whose values approximated those of (x), the function which 
gives the number of primes less than or equal to x. Legendre, in his Essai sur la théorie 
des nombres, stated that the function 

x 


log x — 1.08366 
agrees very well with z(x) as long as x is not greater than one million. Although Gauss 
discovered that the integral 
“ dt 
2 logt 
produces a better approximation than Legendre’s guess, he never published the results 


of his investigation. The rst major progress concerning z(x) is attributed to the Rus- 
sian mathematician P. L. Chebyshev (1821-1894). Around 1850, he showed that the 
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inequalities 
x x 
(0.92) —— < n(x) < (1.11) —— 
log x log x 


are valid for all x suf ciently large; and if the limit 
LACS) 
lim. ——— 
x>0 x/logx 
exists, then its value must be 1. However, the existence of the limit turned out to be very 
dif cult to prove. It was nally established in 1896, a hundred years after Legendre’s and 
Gauss’s conjectures, by means of new and powerful methods primarily from analysis. 
This celebrated achievement has since become known as the prime number theorem. 
The usual procedure in astronomy is for different observers, using different instru- 
ments, to make numerous observations at a variety of locations. Because the results of 
these observations are subject to errors arising from the reactions of the observers or 
the precision of their instruments, determining the most probable value of an observed 
quantity becomes a question. The method of least squares is so called because the sum 
of the squares of the differences between the observed values and the true value of the 
observed quantity must be minimized. This method was discovered independently, and 
almost simultaneously, by Legendre and Gauss. Gauss had been using the basic idea 
since 1794 or 1795, when he was a student at Caroline College in Brunswick preparing 
for his university studies. But the rst explicit account of the method was published by 
Legendre in 1805 in his paper Nouvelles méthodes pour la determination des orbites des 
cométes. He gave it the name méthode des moindres quarrés, showed that the rule of the 
arithmetic mean is a particular case of the general principle, and presented examples of 
its application to the determination of the orbit of a comet. He offered no probabilistic 
justi cation, however, that the results thus obtained were the “best” or the most plausible 
results the observations were capable of affording. It remained for Gauss in his famous 
astronomical text Theoria Motus Corporum Coelestium (Theory of the Motion of the 
Heavenly Bodies) of 1809 to derive the law of probability of error that is the basis for 
the method of least squares. This was the work in which Gauss explained his solution of 
one of the most interesting and dif cult problems in astronomy, nding the orbit of the 
lost planet Ceres. At one point he stated, “Our principle, which we have use of since the 
year 1795, has lately been published by Legendre.” Legendre was disturbed by Gauss’s 
use of the phrase “our principle” and wrote to him in censure: 


There is no discovery that one cannot claim for himself by saying that one had found the 
same thing some years previously; but if one does not supply the evidence by citing the 
place where one has published it, this assertion becomes pointless and serves only to do a 
disservice to the true author of the discovery. 


It was not that Gauss treated casually what others did, only that he considered a result 
to be his if he gave the rst rigorous demonstration of it. Legendre, on the other hand, 
felt that publication established priority, even if an argument that was merely plausible 
took the place of a complete proof. All discussion of priority rights between the two was 
futile. Because each clung to the correctness of his position, neither took heed of the 
other. Gauss could only express regret about the competition between them. Writing to 
Heinrich Olbers, his associate and a noted astronomer, Gauss was led to remark: 
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It seems to be my fate to compete with Legendre in almost all my theoretical works. So it 
is in the higher arithmetic, in the researches on transcendental functions connected with the 
recti cation of the ellipse, in the fundamentals of geometry, and now here again. 


A large portion of Legendre’s research was devoted to elliptic functions. (The name 
elliptic is apt because these functions exist in the expression for nding the length of an 
arc of an ellipse.) A systematic account of the theory of elliptic functions is contained 
in Legendre’s three-volume Exercises du calcul intégral (1811, 1817, 1819), a great and 
rich textbook on the integral calculus that rivaled Euler’s /nstitutiones Calculi Integralis 
in its comprehensiveness. The third volume includes long tables of elliptic integrals 
calculated by Legendre himself at immense labor. This material was further developed in 
another three-volume treatise, Traité des fonctions elliptiques (1825, 1826, 1830), the most 
signi cant of Legendre’s works in higher mathematics. The third volume, which although 
composed in 1830 did not appear until a few weeks before his death, presents an account 
of Abel’s contemporaneous research and also Jacobi’s in the same area. Notwithstanding 
the disparity in their ages, Legendre had developed an active correspondence with Carl 
Gustav Jacobi (1804-1851) about elliptic functions. On hearing that many of Jacobi’s 
results had been obtained independently by Gauss but never published, Legendre wrote 
(1827) to the young man to share his indignation: 


How can M. Gauss have dared to tell you that the greater part of your theorems on elliptic 
functions were known to him and that he discovered them as early as 1808?... This extreme 
impertinence is incredible on the part of a man who has suf cient personal merit to have no 
need of appropriating the discoveries of others....But this is the same man who, in 1801, 
wished to attribute to himself the discovery of the law of quadratic reciprocity published in 
1785 and who wanted to appropriate in 1809 the method of least squares published in 1805. 


Apparently, the passage of time did not soften Legendre’s sense of bitterness over having 
to share his honors with Gauss. 


Legendre’s Eléments de géométrie 


The work of Legendre that mainly concerns us is his E/éments de géométrie (1795). 
As the teaching of geometry began to be stressed in the eighteenth-century universities, 
many geometry textbooks were published, some introducing novel ideas. Legendre’s 
Eléments de géométrie presents an interesting twofold contrast when compared with 
contemporary English and French geometries. In the universities of Great Britain, Euclid 
met little competition, being practically the only geometry textbook used. To English 
authors of versions of the Elements, the order of the propositions in Euclid was absolutely 
essential to rigorous demonstration. Rather than depart from the original, they spent their 
efforts purging the text of faults that had crept in over time. The French, on the other 
hand, maintained a critical attitude toward Euclid as a textbook for beginners. Many 
elementary geometries appeared in France in the 1700s in which rigor and formalism 
were sacriced for the sake of a new sequence of propositions better tted to the needs 
of mathematical novices. These endeavors to make geometry palatable were frequently 
criticized for lightening work for the examiner as well as for the student. In a move away 
from the loose, intuitive presentations, Legendre’s Eléments de géométrie undertook to 
revive a taste for rigorous demonstration in France. Legendre simpli ed and rearranged 
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many of the proofs of Euclid’s Elements, but on the whole approached the severity of 
the ancient treatment more closely than his predecessors did. The clarity of Legendre’s 
exposition and the attractiveness of his style made the E/éments de géométrie one of the 
most successful textbooks ever written. Twenty editions of this book, comprising some 
100,000 copies, appeared in France alone before the author’s death in 1833. Perhaps 
the only complaint was that Legendre, prepossessed by the methods of Euclid, relied 
excessively on reductio ad absurdum, a method that “convinces but does not satisfy 
the mind.” 

It was natural that shortly after the War of Independence began there should be a 
distinct lessening of British in uence and a corresponding increase in French in uence 
on the scientic life of those colonies that were to become the United States. When 
the American Academy of Arts and Sciences was formed in Boston in 1780 at the 
suggestion of John Adams, its founders publicly stated that it was their intention “to give 
it the air of France rather than that of England and to follow the Royal Academy rather 
than the Royal Society.” Little wonder that in mathematics, as in the physical sciences, 
translations of French textbooks came to dominate teaching in the colleges of the United 
States. Legendre’s name became known to a great army of students, young people who 
had never heard of Lagrange or Laplace, when the Eléments de géométrie completely 
supplanted Euclid in American schools. 

We shall not speak here at any great length about geometry in the educational scheme 
of the United States, but a few words might prove helpful. Harvard, which was the only 
North American college in the 1600s, was founded in 1636. In its early days, geometry 
played a minor role in Harvard’s curriculum, taught only one day a week in the last year 
of college. As soon as the subject began to receive serious attention, Euclid’s Elements 
became the accepted text. The earliest known use of Euclid on the college level was 
at Yale (opened in 1701) in 1733, and then at Harvard in 1737. In the latter part of 
the 1700s, when the average age of freshmen in the American colleges was gradually 
increasing, geometry constantly crept downward until it was nally taught as a_rst-year 


576 


THEOREM 


Chapter 11 Nineteenth-Century Contributions: Lobachevsky to Hilbert 


subject. According to a graduate of the Harvard class of 1798, “The sophomore year 
gave us Euclid to measure our strength.” By 1818, geometry had been placed in the rst 
year at Harvard, and in 1844, it was made a mathematical requirement for entrance to 
the college, though only the preliminary notions were required. 

The rst translation into English of Legendre’s Eléments de géométrie was made 
in 1819 by John Farrar, a professor of mathematics at Harvard from 1807 to 1836; it 
appeared in a total of 10 editions, the last in 1841. The next translation of Legendre 
was brought out in 1822 by the famous Scottish essayist and historian Thomas Carlyle, 
who in his early life was a teacher of mathematics. Carlyle’s translation—with Charles 
Davies, a professor at West Point, as editor—was the most popular textbook in the United 
States during the nineteenth century. It ran through about 30 American editions, the last 
not appearing until 1890. Indeed, “Davies-Legendre” was adopted at Yale as late as 
1885, when Euclid was nally discarded as a text. American teachers of the rst half 
of the nineteenth century seemed willing to turn to the elementary textbooks of France, 
not only for geometrical instruction, but for a variety of subjects. The French in uence 
was so powerfully felt that a writer to an early periodical devoted to mathematics, The 
Mathematical Diary, was led to complain: 


Our elementary works on the pure and physico-mathematical sciences have savored too 
generally of a foreign mint. It would certainly be more creditable to us as a nation, and 
becoming us as an independent people, to rely less on bald translations and compilations of 
and from transatlantic publications, and more upon our own exertions. 


But not until William Chauvenet prepared his Treatise on Elementary Geometry in 1870 
was there any successful American rival to Legendre. 

Legendre’s investigations of the provability of the parallel postulate extended over 
40 years and appeared mostly in appendixes of successive editions of the E/éments 
de géométrie. All his attempts to derive the postulate from the other Euclidean axioms 
were de cient in that each one rested on some hypothesis that was logically equivalent to 
the desired statement. One unsuccessful attack was impaired by the assumption that there 
existed similar triangles of different sizes, another by the assumption of the existence of a 
circle passing through any three noncollinear points. Legendre never gave up trying, and 
ina nal monograph, Réflexions sur différentes maniéres de démontrer de la théorie de 
paralléles, which appeared in the year of his death, he brought together some half-dozen 
alleged proofs of the immortal postulate. 

Legendre, much like Saccheri and Lambert before him, approached the thorny ques- 
tion of the parallel axiom from the side of the angle sum of a triangle. He correctly 
perceived that Euclid’s fth postulate was equivalent to the theorem that the angle sum 
of a triangle was equal to two right angles. Here he argued as follows: 


If the sum of the angles in a triangle is equal to two right angles, then Euclid’s parallel 
postulate holds. 


Proof. We need only show how to obtain Playfair’s axiom from the hypothesis. 
Starting with a point P not on a given line /, construct a parallel to / through P by 
rst dropping a perpendicular PQ to / and then taking /’ perpendicular to PQ at P. 
As a step to establishing that /' is the only parallel to / through P, choose points 
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Oo = O, O1, Qo,..., On, ... on /, all on the same side of PQ, such that 
QoQ = OP, 0:02. = OP, 
0303 = O3P,..., OnOnvi = OnP <>. 


This produces a sequence of isosceles triangles, 


AQo PQ), AQ; PQ2, AQ2PQ3,..., AOnPOn41,.--- 


Q= Q% Q, Q, Q; 


To simplify notation, set a, = ZO,-;PO, = ZQn-1OnP forn = 1,2,3,.... 
Because ZO,_;Q,P and ZPO, Q,+; are supplementary angles for all n, we have 


On +ZPO,Onv1 = 180°, n=1,2,3,.... 


The assumption that the angle sum of all triangles is 180° also tells us, when it is 
applied to triangle O,PQ,,,1, that 


LPO, One 2S 80, 1, 279) 5s 
When we tie together these last two equalities, we see this situation: 
Q, = 180° — LPO, Ony) = 2041, i — ie ee oe Pee 
whence 
a1 a ay An Oy 


a=, 1 a aa ey = 5g 1? 


Now let /” be any line through the point P, and distinct from /’. If a is the 
angle between /” and /’, then for n suf ciently large, a; /2” can be made less than a; 
that is, @,41 <a. This means that /" will lie in the interior of ZOPQ,,, hence will 
intersect the side QO, of triangle POQ,. Because /” must cut line /, /’ is the only 
line through P parallel to 7, which gives us the conclusion we wanted. 


Keeping this last theorem in view, Legendre tried to show that if one assumed the 
angle sum of a triangle to be either greater that 180° or less than 180°, then a contradiction 
would be reached. Therefore, the sum of the angles of a triangle would necessarily equal 
180°, from which the truth of the parallel postulate would follow. Legendre’s “proof” 
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was widely circulated because of its appearance in the twelfth and successive editions 
of the Eléments de géométrie. It began with Legendre’s replacing a triangle ABC with 
another triangle in which one angle was at most half as large as A. (The result is more 
profound than it at rst seems, for in the absence of the parallel postulate there is no 
reason to think that all triangles have the same angle sum.) Because the forthcoming 
lemma has something of the format of Euclid’s Proposition 16, it is not surprising that 
Legendre, like all his predecessors, took for granted that straight lines were in nite. This 
quiet assumption did not bother people at the time, but ultimately became as vital as the 
parallel postulate itself. 


Let ZA be any angle of a triangle ABC. Then there exists a triangle A,B,C, with the 
same angle sum as triangle ABC, and such that LA, < 5LA. 


Proof. Let D be the midpoint of side BC; extend AD to a point E in such a manner 
that AD = DE. Because BD = CD, AD = DE, and BDA = CDE (vertical angles 
are equal by Euclid’s Proposition 15 of Book JI), the triangles BDA and CDE are 

congruent from the side-angle-side proposition. Following the notation of the angles 
in the diagram, the implication is that 23 = 26 and 24 = Z5. Hence, by substitution, 


A+(224+234+4=14+2+ 4644. 


A C 


Thus the triangle ACE had the same angle sum as triangle ABC. To complete the 
argument, note that 22 + 46 = 22 + 43 = ZA. This means that either 22 or 23 must 
be less than or equal to SLA, for if both were greater than LA, then their sum 
would be greater than /A. If £2 < LA, relabel A as A, and the other two vertices 
of triangle ACE as B, and C,; for the case in which Z3 < 3LA, take E as A,. Then 
triangle A,B,C, satis es the requirements of the lemma. 


es 


With the aid of this lemma, Legendre had little trouble in disposing of the assumption 
that the sum of the angles of a triangle could be greater than two right angles. The 
resulting theorem is usually, but mistakenly, called Legendre’s rst theorem—amistakenly, 
because Saccheri had already established the theorem almost a century earlier when he 
showed that his hypothesis of the obtuse angle could not hold. 


The angle sum of a triangle is always less than or equal to 180°. 


Proof. Assume to the contrary that there exists a triangle ABC whose angle sum is 
180° + a, where @ is a positive number of degrees. By a direct appeal to the 
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preceding lemma, it is possible to nd a triangle 4;B,C; with the same angle sum 
as triangle ABC, namely, 180° + a, in which ZA, < $LA, Applying the lemma to 
triangle A; B,C,, we obtain a third triangle 4,B,C,, whose angle sum is also 
180° + @ and one of whose angles, A>, is such that 

LA < Ly, 
2 
It is obvious that this procedure leads by mathematical induction to a sequence of 
triangles, 


1 
Ai < 5 4A). 


AA, Bi C,, AA BoCz, AA3B3C3,..., 


each with angle sum 180° + @ and each containing an angle, 2A,, for which 
1 
LA, < —(LA). 
< (4) 


By taking suf ciently large, (1/2”)(ZA) can be made less than a, whence /A, is 
less than a. When this is done, one sees that 


180° +a = ZA, +2ZB,+ LC, <a+ZB,+LCy, 
which entails that 
180° < ZB, + ZC). 


But this violates the fact that the sum of any two angles of a triangle is less than 
two right angles, or 180° (Proposition 17 of Euclid I). Having obtained the required 
contradiction, one can conclude that the angle sum of a triangle is not greater than 
180°, and so all is proved. 


| 


Having correctly established that the sum of the angles in a triangle cannot be more 
than two right angles, Legendre next tried to rule out the possibility that this sum is less 
than two right angles. In the proof we are quoting here, he reasoned as follows: Assume 
that the angle sum of triangle ABC is less than 180°, say, 180° — a, where a@ is a positive 
number of degrees, and that ZA is the smallest angle of the triangle. On side BC, construct 
a triangle BCD congruent with triangle ABC, by drawing ZCBD equal to ACB and taking 
BD = AC. Through D draw a line that meets the extensions of sides AB and AC at points 
E and F, respectively, thereby forming new triangles BED and CDF. Triangles ABC 
and BCD, being congruent, must have the same angle sum, to wit, 180° — a: 


£1+ 242+ £3 = 180° — a, 24+ 25+ 26 = 180° —a. 


E 
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From Legendre’s rst theorem, it is also known that the angle sum of triangles BED and 
CDF cannot be greater than 180°: 


7+ 28+ 29 < 180°, 410+ Z11+ 212 < 180°. 


Thus, the sum of all the angles of the four triangles cannot exceed 4(180°) — 2a. Of 
these angles, each of the groups with vertices at B, C, and D adds up to 180°: 


£3+ 244+ 28 = 464+ 274 £10 = 224+ 25+ 212 = 180°, 
from which it is evident that 
41+ 29+ 211 + 3(180°) < 4(180°) — 2a, 
or 
Z1+ 29+ Z11 < 180° — 2a. 


Whereas the original triangle ABC had an angle sum of not more than 180° — a, that of 
the triangle AEF is not more than 180° — 2a. An application of the same construction to 
triangle AEF yields another triangle having an angle sum not greater than 180° — 27a. 
If the argument is repeated n times, a triangle is produced whose angle sum does not 
exceed 180° — 2”a. For n suf ciently large, 2”a@ would be greater than 180°. Thus the 
process leads to a triangle whose angles have a negative sum—an absurd situation. The 
proof having come to a contradiction (said Legendre), the angle sum of a triangle cannot 
be less than 180°. 

Legendre concluded that because the sum of the angles of a triangle could neither 
be greater nor less than 180°, it would have to equal 180°; if this equality were granted, 
the parallel postulate would follow. Unfortunately, there is a defect in the proof he 
proposed—the angle sum of a triangle cannot be less than 180°. To construct his sequence 
of triangles, he assumed that through any point in the interior of an angle it is always 
possible to draw a line that meets both sides of the angle. Although it is not immediately 
apparent, this assumption turns out to be another equivalent of the parallel postulate. 
Legendre’s demonstration failed, although he himself never saw through the matter and 
thought that he had nally settled the question. 

One can hardly blame Legendre for being convinced that he had cleared up the 
uncertainty surrounding the parallel postulate. Good and bad mathematicians alike had 
fallen on this slippery ground. In fact, so lengthy and so persistent were their efforts that 
in 1767 d’Alembert called the state of the theory of parallels “the scandal of elementary 
geometry.” The great analyst Lagrange, according to a story told by Augustus De Morgan 
in the Budget of Paradoxes, presented a paper on parallel lines to the French Academy, 
but broke off his reading part way through with the exclamation, “I must meditate further 
on this.” With this he put the paper in his pocket and never afterward spoke of it publicly. 

Through all of this it must be remembered that for centuries Euclidean geometry had 
been the most rmly established branch of mathematics, reputedly the most complete, and 
one whose authority was derived mainly from the clarity of its axioms. Euclid’s choice 
of these “self-evident truths” had been dictated by his intuitive observation of the world 
about him. But because the fth postulate involved an in nite concept and therefore could 
not be veried experimentally, it was seen as a shocking aw in geometry. A recurring 
challenge to geometers was to discover whether this postulate could be deduced from the 
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others. If it could, this would transform it into a theorem, not an axiom, of the system and 
put it beyond question. Yet by the beginning of the 1800s, the problem of the necessity 
of the parallel postulate was not in much better shape than in antiquity. Twenty centuries 
of fruitless effort, and particularly the latest unsuccessful investigations of Legendre, 
did strike a spark of doubt in the minds of some mathematicians. The conviction had 
begun to grow that a proof of the parallel postulate was not possible within Euclid’s 


own system. 


11.1 Problems 


1. Find the aw in the following attempted proof of the 


parallel postulate by Wolfgang Bolyai (1775-1856). 
Given any point P not on a line /, construct one 
parallel /’ to 7 through P in the usual way, by 
dropping a perpendicular PO to / and erecting /’ 
perpendicular to PQ. Let /” be any line through P 
distinct from /’. To see that /” meets /, pick a point A 
on PQ between P and Q. Extend PQ beyond Q toa 
point B so that AO = OB. Now let R be the foot of 
the perpendicular from A to 7” and extend AR 
beyond R to a point C so that AR = RC. Then 4, B, 
and C are not collinear; hence there exists a unique 
circle passing through them. Because / is the 
perpendicular bisector of the chord AB of this circle 
and /” is the perpendicular bisector of the chord AC, 
/ and /” must intersect in the center of the circle. 


. Find the aw in the following attempted proof of the 
parallel postulate by A. M. Legendre (1752-1833). 
Start with a point P not on a line /, with PO 
perpendicular to / at QO and line /’ perpendicular to 
PQ at P; then /’ is parallel to /. Let /” be any line 
through P different from /’. If we assume that /” is 
not perpendicular to /, then it makes an acute angle 
with /. Pick any point R on /” such that ZOPR is 
acute. Construct ZOPR’, with R’ on the opposite side 


L...091fL 1.11 £049 £e6:... £01 


of PO from R, so that ZOPR’ = LOPR. Then Q is 
inside /R’PR, and because the line / passes through 
Q, it will intersect one of the sides of ZR’PR. If / 
meets the side of ZR’PR containing PR, then / meets 
1". If / meets the side of 2R’PR containing PR’, say 
at A, then choose a point B on the opposite side with 
PB = PA. Then triangles PBQ and PAQ are 
congruent by the side-angle-side proposition; hence 
ZPQB = LPQA = 90°. This puts the point B on /, so 
that /” and / meet at B. 


A Q 


. Prove the following sequence of results attributed to 


Legendre: 


(a) Ifthe angle sum of a triangle is equal to two 
right angles, then the same is true of any 
triangle obtained from it by drawing a line 
through a vertex to a point on the opposite 
side. 

(b) If there exists one triangle with angle sum 
equal to two right angles, then there exists an 
isosceles right triangle whose angle sum is two 
right angles and whose legs are longer than any 
given segment. [Hint: Within the existing 
triangle construct an isosceles right triangle 
with angle sum equal to two right angles. 
(Begin by dropping a perpendicular from a 
vertex to a side.) Now join eight such 
congruent triangles to form a square whose 
sides are twice the length of one of the 
congruent triangles’ (perpendicular) sides. Draw 
the diagonal of the square and repeat the 
process with one of the triangles you get.] 
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(c) Legendre’s second theorem: If there exists one 
triangle whose angle sum is equal to two right 
angles, then every triangle has an angle sum 
equal to two right angles. [Hint: Because any 
triangle can be divided into two right triangles, 
it suf ces to consider an arbitrary right triangle 
ABC. By part (b), there exists an isosceles 
right triangle DE F whose angle sum is two 
right angles and with sides greater than those of 
ABC. Split DEF into triangles so that one of 
them, A’B’C’, is congruent with triangle 
ABC.) 


D A E=C’ 


(d) If there exists one triangle whose angle sum is 
less than two right angles, then every triangle 
has an angle sum less than two right angles. 


In Problems 4-10 you may assume Euclid’s rst four 
postulates, and hence any result that is derived from 
them. 


. A Saccheri quadrilateral is a quadrilateral ABCD in 


which ZA and ZD are right angles and the sides AB 
and DC are equal. Side AD is known as the base, 
the opposite side BC as the summit, and ZB and ZC 
as the summit angles. Verify each of the following: 


(a) A line that passes through the midpoints of the 
base and the summit of a Saccheri quadrilateral 
is perpendicular to both of them. [Hint: First 
show that ABAE is congruent to ACDE, and 
then that ABFE is congruent to ACFE.] 

(b) Ifa line is the perpendicular bisector of the 
base of a Saccheri quadrilateral, then it is also 
the perpendicular bisector of the summit. 

(c) The lines containing the base and summit of a 
Saccheri quadrilateral are parallel. 


5. Consider a quadrilateral ABCD whose base angles 


ZA and ZB are right angles. Prove the following 
assertions: 


C 

D E 
L_ | [I 

A B 


(a) If AD < BC, then ZC < ZD. [Hint: There 
exists a point E on BC with BE = AD; apply 
the exterior angle theorem to ADEC.] 

(b) If ZC < ZD, then AD < BC. [Hint: Suppose to 
the contrary that AD = BC or AD > BC] 

(c) If ZC = ZD, then ABCD is a Saccheri 
quadrilateral. 


. Let ABCD and XYZW be Saccheri quadrilaterals 
with bases AD and XW. Show that if AD = XW and 
AB = XY, then BC = YZ, and ZB = LY, and 

1G = LZ. 


7. Fill in any missing details in the following proof that 


the base of a Saccheri quadrilateral is not longer than 
the summit. Given a Saccheri quadrilateral 

A,B, B,A2, with base A, Ap, let / be the line 
containing A; and A>. Pick distinct points 

A3, As,..., An, Angi on /, appearing in this order, so 
that Ax Aga = A, Ap fork = 2. oi oe A. Also let By 
be the point on the same side of / as B, such that 

By, Ax is perpendicular to / at A, and B,A, = Bi Ai, 


where k = 3,4,...,n + 1. In this way a sequence of 
n Saccheri quadrilaterals is set up, end to end. (We 
do not know whether the points B,, Bo,..., Br4i are 
collinear.) 

B, B, B, eee eee, B, B, +1 

A, Ay A, A, A, +1 


By Problem 6, we have B, By = B,B; =--- = 
B,-\ By = By, By+1. By the polygon inequality, it also 
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follows that 


Aj Ans < Ay B) +B, By + By B3 +--+ + By Brat 
+ Brat Anqis 


or nA, Ay < 2A4,B, +nB,B). But then 
n(A, A, — B, By) < 2A) Bi, 


for any positive integer n. If 4; A> > B, Bo, the 
displayed inequality leads to a contradiction. 


Prove these statements: 


(a) The angle sum of a right triangle is always less 
than or equal to 180°. [Hint: Given a right 
triangle ABD, with right angle at 4, construct a 
Saccheri quadrilateral ABCD having base AD. 
According to Problem 7, we nd BC > AD; 
whence in triangles BAD and DCB, ZBDC is 
greater than or equal to ZABD.] 


(b) The angle sum of an arbitrary triangle is always 
less than or equal to 180°. 


Prove that if Saccheri’s hypothesis of the acute angle 
holds, then the summit of a Saccheri quadrilateral is 
longer than its base; thus it follows by Problem 8 
that the angle sum of a triangle is less than 180°. 
[Hint: Consider a Saccheri quadrilateral ABCD, with 
base AD. Let E and F be the midpoints of the base 
and summit, respectively. If AE > BF, pick a point 
G on BF extended, so that GF = AE; then AGFE is 
a Saccheri quadrilateral, with base EF. By 
hypothesis, the summit angle EAG is acute, which 
leads to a contradiction. Thus AE < BF and 
similarly ED > FC.] 


Gc___3 F Cc 
\ 


A E D 


Prove that if Saccheri’s hypothesis of the acute angle 
holds, then the segment joining the midpoints of the 
base and summit of a Saccheri quadrilateral is shorter 
than each side; hence, there exist parallel lines that 
are not equidistant from one another. [Hint: See 
Problems 4 and 5.] 
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11. Explain the fallacy in the following “proof” that all 


triangles are isosceles. Given a triangle ABC, the 
angle bisector from vertex A will meet the 
perpendicular bisector of the side BC at a point P, 
which is either inside, on, or outside triangle ABC. 
From P drop perpendiculars to AB and AC, meeting 
these sides, or their extensions, at D and E, 
respectively. Several diagrams are shown herewith 
and the discussion applies equally to each. 


A 
ADK 
B P=M Cc. 

A 
B Cc 


iP 


The right triangles APD and APE have the common 
hypotenuse AP, and the sides PD and PE are equal. 
(Points on the bisector of an angle are equidistant 
from the sides of the angle.) These triangles are 
therefore congruent, whence, as corresponding sides, 
AD and AE are equal. Also, the right triangles PMB 
and PMC are congruent by the side-angle-side 
theorem. Because PD = PE and PB = PC, it follows 
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that the right triangles PDB and PEC are congruent, in the fourth diagram, lengths are subtracted to give 
and so DB = EC. In the rst three diagrams, lengths AB = AD — DB = AE — EC = AC. In either event, 
are added to give AB = AD+ DB = AE+ EC = AC; AB = AC, and the triangle ABC is isosceles. 


By the beginning of the 1800s, the question whether 


Vz The Founders of Non-Euclidean Euclid’s parallel postulate could be demonstrated as 


Geometry 


a logical consequence of the others remained unre- 
solved. At some point, mathematicians were bound 
Gauss’s Attempt at to realize that the unending record of failure in the 
search for a proof of the troublesome postulate was 
not due to any lack of ingenuity on their part, but 
rather to the actual independence of the fth postu- 
late from the other axioms. This meant that it would be possible, by substituting a contrary 
axiom about parallels, to develop an equally valid companion geometry to Euclid’s. When 
this idea nally dawned, it came not to one but to three mathematicians, more or less 
simultaneously and in widely separated parts of Europe. Cases of simultaneity or near 
simultaneity of discovery had happened before in the history of mathematics—as in the 
seventeenth century with the invention of calculus by Newton in England and Leibniz in 
Germany. And it would happen again. When the spirit of the time is ripe for a new result 
to come forth, it will not long be delayed. Thus, it was in the rst third of the nineteenth 
century that Gauss in Germany, Bolyai in Hungary, and Lobachevsky in Russia were 
discriminating enough to reject a postulate that for 2000 years had been the main corner- 
stone of geometry. In Einstein’s striking phrase, they “challenged an axiom.” Although 
each was unaware of the thoughts of the other two, each conceived the notion of replac- 
ing the parallel postulate with the counterintuitive axiom that “through a point not on a 
line there exists more than one parallel to the line,” while retaining all the other axioms. 
This was a daring innovation in its day, for it went against the time-honored tradition 
that axioms must be self-evident truths. Through the customary rules of deduction, the 
three men developed a sequence of theorems resembling those Saccheri obtained from 
his hypothesis of the acute angle. Notwithstanding the breach of one’s intuitive picture 
of space (the angle sum of a triangle, for instance, does not add up to two right angles), 
their geometric system is free from internal contradiction. Whereas Saccheri had been 
too awed by a conviction of the absolute necessity of Euclidean geometry to recognize 
what he had done, Gauss, Bolyai, and Lobachevsky appreciated the revolutionary impli- 
cation of their discovery—the geometry of Euclid, being not the only logically consistent 
geometry, might not even be the correct system for describing the physical world. 
From the detailed publication of much of Gauss’s correspondence, we now know that 
Gauss was the rst to reach any advanced conclusions concerning a consistent geometry 
different from that of Euclid. Because he failed to make public his thoughts on the 
subject, the honor for discovering non-Euclidean geometry is usually divided between 
John Bolyai and Nicolai Lobachevsky. We can at least credit Gauss with having realized 
that along the path Bolyai and Lobachevsky traveled, complete success was bound to 
be achieved. Scholars have long remarked that Gauss’s publications are not an adequate 
re ection of his full greatness, for he published relatively little of his work—only about 
half of the innovative ideas with which he is credited. In his writings, Gauss was as exact 
and elegant as Lagrange, but more dif cult to follow than even Laplace. Not only did 
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he remove every trace of the reasoning by which he reached his conclusions, but also 
he made a concerted effort to give proofs that although strictly logical were as concise 
as possible. The result was always a_nished work of art, for until Gauss had polished 
a paper over and over, he refused to allow it to appear in print. These demands for 
excellence found expression in his unofcial coat of arms, a tree bearing only seven 
fruits. The motto beneath reads Pauca Sed Matura (Few, but ripe’’). 

Ideas came so quickly from Gauss’s fertile imagination that each one inhibited the 
development of the preceding. Because of the great length of time needed to polish a paper 
to the point of perfection on which he insisted, he developed the habit of leaving his rough 
notes in a desk drawer, often not even telling anyone about them. This reluctance to make 
known many of his original ideas provided considerable frustration for Gauss’s contem- 
poraries, who often found, laboring for years on some important new development, that 
Gauss had anticipated them. One instance is the theory of elliptic functions, about which 
Gauss discovered many of the results Abel and Jacobi subsequently published. There is a 
story that Jacobi, who was considered the “‘second-best” mathematician in Europe, came 
to Gauss to relate a new creation only to have the latter pull from his desk drawer some 
papers that contained the same discovery. In exasperation, Jacobi remarked, “It is a pity 
that you did not publish this result, since you have published so many poorer papers.” 

Gauss began to meditate about parallel lines as early as 1792. He was only 15 at the 
time, but had already told his friend Schumacher that if the Euclidean system were not the 
only “true” geometry (that is, the correct idealization of the properties of actual physical 
space), then he could work out another logically consistent geometry. Unfortunately, he 
refrained from publishing any statement of his theory, large or small, throughout his 
lifetime. What we know of Gauss’s successive ventures into the subject can be gleaned 
only from letters to interested colleagues and some notes left behind after his death. 
Through this correspondence, one can trace the development of Gauss’s thought, from 
skepticism concerning the purported proofs of the parallel postulate to con dence in the 
validity of his new geometry. 

At rst, Gauss blundered in good company by trying to deduce the truth of the 
fth postulate from the other axioms and the 28 propositions proved without it. He soon 
recognized the profound dif culties involved, and in a letter written to Wolfgang Bolyai 
in 1804, he spoke of a “group of rocks” on which his attempts had always been wrecked, 
adding: “I have still ever the hope that these rocks sometime and, indeed, before my 
death will permit a passage.” As late as 1813, there was still no light, for we read: “In 
the theory of parallels we are even now not further than Euclid. This is the shameful 
part of mathematics, which sooner or later must receive a wholly new form.” 

Over the years, Gauss cautiously and reluctantly concluded that the parallel postulate 
could not be proved on the basis of the remaining nine axioms. His nearest-to-public 
utterance on the matter was a book review (1816) of a certain tract on parallels, in 
which he spoke of “vain attempts to conceal the gulf, which one cannot conceal, with 
an unsound web of sham proof.” Although this mere hint that the parallel axiom cannot 
be proved was “besmirched with mud” by Gauss’s critics, his doubts were not dispelled. 
A letter written to Heinrich Olbers in 1817 marked a turning point in Gauss’s thinking: 


I am becoming more and more convinced that the necessary truth of our [Euclidean] geom- 
etry cannot be demonstrated, at least not by the human intellect to the human understanding. 
Perhaps in another life, we shall obtain insights into the nature of space which are now 
beyond our reach. 
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From that time onward, he seemed rmly convinced that the parallel postulate was 
independent of the other Euclidean axioms, so that it would be possible to adopt a con- 
tradictory axiom and build an entirely new geometry logically as legitimate as Euclid’s. 
In rejecting the fth postulate, he chose to assume (as Bolyai and Lobachevsky would 
after him) that through a given point there could be drawn more than one parallel to 
a given line. As Gauss developed the consequences of this assumption, he mentioned 
something of his work to a few trusted friends but concealed his investigations from the 
world at large. 

Like Newton, Gauss had an intense dislike of controversy and seemed unwilling to 
venture publicly into any area that might subject him to vulgar attack. He was certain 
that his discoveries in this alternative system of geometry would shock layman and 
mathematician alike. Indeed, his results ran so counter to the accepted view, expressed by 
Kant, that our inner consciousness allows us to imagine geometric gure with Euclidean 
properties only, that he felt by divulging them he might subject himself to ridicule. Thus, 
he expressly begged his friends to keep silent about the information he imparted to them. 
The fullest indication of Gauss’s feelings is contained in a letter written in 1824 to Franz 
Taurinus. After pointing out an error in an attempted proof of the parallel postulate by 
Taurinus, Gauss went on to say: 


The assumption that the sum of the three angles of a triangle is less than 180° leads to a 
curious geometry, quite different from our own [the Euclidean], but thoroughly consistent, 
which I have developed to my satisfaction....The theorems of this geometry appear to be 
paradoxical and, to the uninitiated, absurd; but calm, steady re ection reveals that they con- 
tain nothing impossible. ...In any case, consider this a private communication, of which no 
public use or use leading to publicity is to be made. Perhaps I shall myself, if I have at some 
future time more leisure than in my present circumstances, make public my investigations. 


Gauss continued with his development of the fundamental results of a new geometry and 
was again considering writing them up, possibly to be published after his death. He took 
up the matter in a reply (1829) to the astronomer Friedrich Bessel: 


It may take a very long time before I make public my investigations on the issue. In fact, 
it may not happen during my lifetime, since I fear the scream of the Boeotians [a gurative 
reference to the dullards, for the Boeotians were reputed to have been one of the more 
simple-minded Greek tribes] were I to completely express my views. 


It may seem strange to the modern observer that Gauss, who was recognized in his 
own lifetime as one of the greatest mathematicians, would choose to withhold publication 
of his non-Euclidean geometry lest its unpopularity damage his reputation. But in the 
early 1800s, the immense authority of the German philosopher Immanuel Kant (1724— 
1804) still dominated the intellectual world. Kant’s reputation as a thinker of the most 
abstract kind has tended to overshadow his reputation as a scientist, but until 1770 he was 
primarily attracted to the study of the natural sciences rather than philosophy. Newtonian 
physics and its astronomical rami cations especially aroused his interest. Kant had neither 
the mathematical background nor the necessary means of observation and experiment to 
be a strict scientist. He did, however, contribute to scientic knowledge to the extent of 
anticipating the nebula hypothesis of the origin of the solar system some 40 years before 
the publication of Laplace’s Systéme du monde. 

Kant’s speculations rst appeared in 1755 in a small, anonymous volume entitled 
Universal Natural History and Theory of the Heavens. (It may have been anonymous for 
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fear of the reaction of the Protestant clergy.) The main object of the book was to establish 
the existence of God by showing that the Newtonian universe was not the product of 
mere chance but instead designed to evolve in an orderly fashion. Unfortunately, the 
publisher went bankrupt just at the time the work was to appear, so that Kant failed to 
get an adequate hearing for his theory. In 1763, he tried again to put his views before the 
public, this time in a tract called The Only Possible Argument for a Demonstration of the 
Existence of God. Such studies made Kant’s name widely known in Germany, and several 
times he was offered chairs at universities. Wishing to remain in his native K6nigsberg, 
he continued in the laborious role of privatdozent (a private teacher, recompensed only 
by such fees as the students are willing to pay). As such, he gave an enormous number 
of lectures on a bewildering variety of subjects: physics, mathematics, logic, ethics, and 
anthropology. In 1764, he was offered, but wisely refused, the professorship in poetry at 
KG6nigsberg, and in 1765, he was made assistant librarian at the university. It was only 
in 1770, at the age of 46, that Kant was appointed professor of logic and metaphysics at 
KG6nigsberg, the post he held until his death in 1804. The 12 years from 1769 to 1781 
were spent in the most profound speculation of the philosophical issues he ultimately 
presented in the Critique of Pure Reason (1781). 

Kant’s problem was to reconcile his belief in the preestablished harmony of the 
universe with the confusion of data collected by experimental observation. In this, he 
succeeds quite deftly in the Critique of Pure Reason, his treatise on the theory of knowl- 
edge, a subject that he created. Kant sought to bridge the gulf between the British em- 
piricists, who held that all knowledge arises from experience, and the equally dogmatic 
Continental rationalists, who professed that all knowledge arises from unaided thought. 
He agreed that knowledge begins with external experience, so far as some sensation must 
precede and arouse the operations of thought; but, he maintained, the mind can act on 
sense impressions only because it is already endowed with “intuitions” of space and time 
that are independent of experience and that mold it. As Kant declared in the Critique of 
Pure Reason, “the concept of [Euclidean] space is by no means of empirical origin, but 
is the inevitable necessity of thought,” and as a consequence the sole spatial relations 
acceptable to the mind were those of Euclid. No other system of geometry could possibly 
exist, for no other geometry was thinkable. To dispute the idea that the parallel postulate 
was inherent in the structure of the mind itself as a divinely implanted intuition was to 
challenge Kant’s theory of knowledge, which Kant called “the Copernican revolution in 
philosophy.” 

For himself, Gauss doubted that the mind compels us to view the world in only 
one way. He refrained from public controversy with its attendant criticism, yet thought 
that determining the geometry of space was an empirical question, to be veried like 
any other physical law by actual measurement. The theorem concerning the sum of the 
angles of a triangle seemed to provide an experimental means for deciding once and for 
all which geometry, his or Euclid’s, best described the space of our physical experience. 
In the traditional geometry, the Euclidean, the angle sum of a triangle is always 180°, but 
Gauss’s new geometry predicted a result less than 180°. With this in mind, Gauss laid 
out a triangle in the neighborhood of Géttingen whose sides were 40 or so miles long 
and whose vertices were carefully surveyed points on three mountain summits. Had he 
detected a signi cant deviation from 180°, the experiment would have been conclusive. 
When the data were collected, the result was an angle sum within 2” of 180°, a difference 
that could be charged to the unavoidable errors in observation. Gauss concluded that with 
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the instruments at his disposal, the measurement of physically de ned triangles would 
not discriminate between the two geometries. 

It is remarkable how closely the trains of thought of the mathematicians involved 
in developing non-Euclidean geometry were related. Lobachevsky conducted a similar 
experiment with angle sums, the results of which were published in 1829 in his trea- 
tise On the Foundations of Geometry. The only difference was that he analyzed some 
existing astronomical measurements to test which geometry best accounted for the data. 
Lobachevsky calculated that the angle sum of the triangle determined by the earth, the 
sun, and the xed _ star Sirius differed from 180° by an amount less than 0.000004”. Al- 
though this led him to infer that “the exactitude of traditional geometry is far-reaching,” 
the matter was far from settled in favor of Euclidean geometry, because Lobachevsky 
had no way of knowing whether he was measuring a signi cantly large area of space. 
Later mathematicians were subsequently to argue that no amount of evidence could be 
obtained to threaten the claim that space was Euclidean, hence the question should not 
be seriously proposed, even for testing; in that case, the Euclidean theory was preferable 
on the grounds of elegance and simplicity. 

Sometime toward 1830, Gauss took alarm over his long delay in committing his 
non-Euclidean geometry to paper and thought that if he did not get his ideas out at once 
he might never do so. In a letter dated May 17, 1831, Gauss wrote to the astronomer H. 
C. Schumacher: 


In the past few weeks I have begun to write down some of my meditations [on the theory 
of parallels], a part of which I have never previously put in writing, so that already I have 
had to think it all through anew three or four times. But I wished that this should not perish 
with me. 


Found among the papers after his death, there was a brief synopsis of the new theory of 
parallels. Because Gauss was not resolute enough to risk coming out in the open with his 
ndings, he did not produce the revolution in mathematics that Bolyai and Lobachevsky 
were to bring about; his lack of moral courage in facing “the scream of the Boeotians” 
(that is, the followers of Kant) made it inevitable that posterity withhold a portion of the 
honor that could have been entirely his. 


The Struggle of John Bolyai 


Perhaps the reason that Gauss did not go further in recording his earlier investigations 
was that on February 14, 1832, he received a copy of the famous Appendix of John 
Bolyai. John’s father, Wolfgang Bolyai (1775—1856) had studied at Jena, then afterward 
at Gottingen, from 1796 to 1799. Gauss was among Wolfgang Bolyai’s close friends at 
G6ttingen. Bolyai, after returning to Hungary, maintained a correspondence with Gauss 
that lasted, sometimes with years-long interruptions, for the remainder of their lives. 
During their student days, the two men had frequently discussed problems related to 
the theory of parallels. In 1804 Bolyai, convinced that he had succeeded in proving 
the parallel postulate, sent a little tract on this subject, Theoria Parallelarum, to Gauss. 
The reasoning was incorrect, and Gauss in replying tactfully pointed out the error of 
assuming that through any three noncollinear points there passes a circle. Bolyai mailed 
a supplementary paper to Gauss four years later, but when there was no answer, he 
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became discouraged with his “theory of the Gdttingen parallels” and turned to other 
matters. His ideas on elementary mathematics were collected in a large two-volume 
Tentamen Juventutem Studiosam in Elementa Matheseos Purae (An Attempt to Introduce 
Studious Youth to the Elements of Pure Mathematics), published in 1832-1833 although 
it bears the date 1829. It contained an Appendix composed by his son, John. 

The chief claim to fame of Wolfgang Bolyai must doubtless be that he was the 
father of John Bolyai (1802-1860). The father gave the young boy early instruction 
in mathematics and his progress was so quick that when Wolfgang was sick on one 
occasion, he did not hesitate to send his 13-year-old son to teach the college classes 
in his stead. The elder Bolyai hoped that John would go to Géttingen and study with 
his old friend Gauss; in 1816, Bolyai wrote to ask whether Gauss would take John into 
his household as an apprentice mathematician and to inquire what the youngster should 
study in the meantime. Gauss never answered this letter and never wrote again for 16 
years. Instead, John entered the Imperial Engineering Academy in Vienna in 1817, to 
receive a military education. He nished his courses at the academy in 1823, and as 
a sublieutenant, entered on an army career. During the next 10 years, he built up a 
reputation as a profound mathematician, an impassioned violin player, and an expert 
fencer who dueled regularly. (He once accepted the challenge of 13 cavalry of cers 
on the condition that he be allowed to play a violin piece after every two duels; he 
vanquished all 13.) Plagued with intermittent fever, Bolyai was pensioned off by the 
service in 1833 as a semi-invalid. Under Gauss’s care and direction, the younger Bolyai 
might have enriched the great learned journals with discovery after discovery. As it was, 
the sum of his publications was two dozen pages. 

John Bolyai had from his father some of the inspiration to original research that 
the latter had received from Gauss. While still at the academy, John was giving his 
attention to the parallel postulate, though rst on the side of proving it. Recalling his 
own unsuccessful efforts, Wolfgang urged the boy not to become preoccupied with the 
theory of parallels: 


Do not waste one hour’s time on that problem. It does not lead to any result; instead it 
will come to poison all your life. ...1 believe that I myself have investigated all conceivable 
ideas in this connection. 


His father’s advice was founded on bitter experience, but far from being dissuaded, John 
was moved to greater efforts. 

After several vain attempts to prove the parallel postulate, the young man began to 
think of it as an independent assertion and succeeded in constructing an entirely new 
geometry based on the denial of this axiom. He did not know that the same idea had 
already occurred to Gauss and to Lobachevsky. His sense of joy and triumph at solving 
what had bafed the world for 2000 years was expressed in a letter to his father in 1823: 


I am resolved to publish a work on parallels as soon as I can complete and arrange the 
material, and the opportunity arises. At the moment I still do not clearly see my way through, 
but the path which I have followed is almost certain to lead me to my goal, provided it is 
at all possible. I have not quite reached it, but I have discovered things so wonderful that I 
was astounded and it would be an everlasting pity if these things were lost. When you, my 
dear father, see them, you will understand. All I can say at present is that out of nothing 
I have created a strange new world. All that I have sent you previously is like a house of 
cards in comparison with a tower. 
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The expansion and arrangement of the ideas proceeded more slowly than John had 
expected, and not until 1825 did he commit to writing a treatise on the subject. His 
father suggested that his be translated into Latin and issued as an appendix to the planned 
Tentamen; moreover, he urged that this be done without delay: 


It seems to me advisable, if you have actually succeeded in obtaining a solution of the 
problem, that, for a two-fold reason its publication be hastened; rst , because ideas easily 
pass from one to another who, in that case, can publish them; secondly, because it seems to 
be true that many things have, as it were, an epoch in which they are discovered in several 
places simultaneously, just as the violets appear on all sides in the springtime. 


Wolfgang Bolyai’s vivid presentiment was ful Iled by the coincident discovery of non- 
Euclidean geometry by his son, and by Gauss and Lobachevsky. John Bolyai’s treatise 
was printed separately under the title Appendix Scientiam Spatii Absolute Veram Exhibens 
(Appendix Explaining the Absolutely True Science of Space) and appeared before the 
volume of the Zentamen with which it was afterward bound up. Excluding the title page 
and one that gave an index of notation, the work itself contained only 24 pages—which 
have been described by George B. Halsted (1853-1922) as “the most extraordinary two 
dozen pages in the whole history of thought.” 

Knowing Gauss’s interest in the subject of non-Euclidean geometry, Wolfgang sent 
an advance copy of his son’s work to Gauss in June 1831. The elder Bolyai no doubt felt 
that his belief in his son had been vindicated, and John must have expected that public 
praise from Gauss would make him famous. The copy did not reach its destination, 
and a second copy of the Appendix was mailed in January 1832. Gauss’s reaction was 
typical—sincere approval, but lack of support in print. In February, Gauss wrote in great 
enthusiasm to his former student C. L. Gerling, who was professor of astronomy at 
Marberg, saying: 


Let me add further that I have this day received from Hungary a little work on non-Euclidean 
geometry in which I nd all my own ideas and results developed with great elegance, 
although in a form so concise as to offer great dif culty to anyone not familiar with the 
subject....I regard this young geometer Bolyai as a genius of rst order. 


Then in March, he replied to his “old, unforgettable friend” Wolfgang Bolyai. It is easy 
to see how this well-intentioned letter had a devastating effect on the young Bolyai: 


If I begin by saying that I dare not praise this work, you will of course be surprised for 
a moment; but I cannot do otherwise. To praise it would amount to praising myself. For 
the entire content of the work, the approach which your son has taken, and the results to 
which he is led, coincide almost exactly with my own meditations which have occupied my 
mind for the past thirty or thirty- ve years....It was my plan to put it all down on paper 
eventually, so that at least it would not perish with me. So I am greatly surprised to be 
spared this effort, and am overjoyed that it happens to be the son of my old friend who 
outstrips me in such a remarkable way. 


Wolfgang may have been pleased because he understood that his son was being praised, 
but John was sadly distressed by the great mathematician’s reply. He even imagined that 
before the Appendix was published, his father had secretly con ded some of his results 
to Gauss, who was now trying to claim them as his own. John eventually set aside these 
suspicions, but like many others who found that Gauss had anticipated them, he still 
felt cheated of his earned honors. Certainly Gauss, who was quite sensitive to all forms 
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of suffering, had no desire to hurt him; but Bolyai’s mental depression deepened when 
the Appendix met with complete indifference from other mathematicians, and for long 
periods he did almost no creative work. The unhappy fact remains that he never published 
anything further, although he did leave behind some thousand pages of manuscript. The 
nal disappointment came in 1848, when Lobachevsky’s Geometrische Untersuchungen 
zur Theorie der Parallellinien (1840) reached him through his father, and he learned 
that the young Russian had obtained priority by publishing the radically new geometry 
in 1829. 

It was long after Bolyai’s death that recognition as one of the founders of non- 
Euclidean geometry nally came to him. The view that Euclidean geometry was the only 
possible geometry was so rmly rooted that his Appendix was practically forgotten for 
35 years. In fact, the Hungarian version of volume | of the Zentamen, which was pub- 
lished in 1834, did not contain the Appendix at all. It was rescued from oblivion when 
Richard Baltzer drew public attention to the work of Bolyai and Lobachevsky in his Ele- 
mente der Mathematik (1867). Precipitated by Baltzer, translations of the Appendix into 
French (1868), Italian (1868), German (1872), and English (1891) made the masterpiece 
accessible to the international scientic community. 

Although we shall not make any lengthy analysis of the cultural milieu in which 
Lobachevsky worked, we should say a few words about the institutional setting of 
mathematics in Russia in the early 1800s. Russian scientic and mathematical contact 
with the outside world was always scanty and intermittent. Toward the end of the 1700s, 
an intense suspicion of Western ideas had led to a perilous state of decline in science. The 
display piece of Russia’s contribution to modern scientic thought, the St. Petersburg 
Academy, had lapsed into inactivity, its staff reduced to 14 full members. Censorship 
of the journals of the scientic bodies of the West deprived the academy even of news 
of developments abroad. Aside from the Academy, the empire’s only other center of 
scientic learning was Moscow University. 

When Alexander I became tsar in 1801, he responded to the mounting demands 
for extensive educational reform by reopening Dorpat University and founding four new 
universities: Vilna (1802), Kazan (1804), Karkov (1804), and St. Petersburg (1819). Rec- 
ognizing that scientic knowledge can be translated into economic and military strength, 
he insisted that the new schools give great weight to the various natural and mathemati- 
cal sciences in their curricula. Professors were directed to incorporate the latest scienti c 
ideas in their lectures, with the implication that they were expected to keep abreast of 
developments in their elds. They were required to hold monthly meetings devoted to 
discussions of scientic papers presented by the teaching staff. These directions were 
unduly optimistic. Because Russia never really had a system of lower schools that would 
provide the training for those willing to pursue advanced studies, the new universities 
were hard-pressed to II their classrooms. Kazan had only 40 students in 1809, and the 
more established Moscow University listed an enrollment of 135. What students were 
available were mostly drawn from the theological seminaries, and although they were 
well versed in Greek and Latin, they knew little science or mathematics and no other 
modern European language. 

The problems of the edgling universities were compounded by an acute shortage 
of candidates for teaching positions. The authorities tried to overcome this de ciency 
by hiring some established foreign scholars, including several distinguished members of 
Western learned societies. (The scientic fortunes of the St. Petersburg Academy would 
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have been immensely improved if only Gauss had chosen to accept the appointment 
offered him.) Even though these foreign instructors usually could not lecture in the 
Russian language, they managed to reestablish the avenues of mathematical contact with 
the West. By bringing to Russia the newest ideas of Lagrange, Laplace, Gauss, and other 
stalwarts of the day, they nurtured a generation of native mathematicians who were to 
make important strides during the second quarter of the nineteenth century. 

Unfortunately for the development of the universities, the rst decade of Alexander’s 
reign marked the last time in the nineteenth century that the Russian government regarded 
higher education as an unqualied good. Greatly shocked by Napoleon’s invasion of 
Russia in the latter half of 1812, the Russian people reacted in two ways. First, the ancient 
Russian distrust of outsiders was revived by the pillage of the country by foreigners; 
second, there was a great surge of religious mysticism in giving thanks to God, who 
once more had ung out the invaders and saved Mother Russia. With the awakening of 
national pride and the religious reaction against rationalism, it was inevitable that the 
Russian state began turning against the existing university system, based as it was on 
Western, particularly German, models. In 1815, the government decreed that henceforth 
all university lectures must be delivered in the Russian language. To save Russia from 
the misfortunes of Germany, where “the professors of godless universities imbue hapless 
youth with the subtle poison of skepticism and hatred of authority,” Russian scholars were 
forbidden to study in Germany and then the universities were prohibited from employing 
any Russian citizen who had attended a German university. There were dismissals at 
Karkov and Dorpat, but most of the foreign professors departed voluntarily, with the 
result that Russia’s universities once again withered into intellectual wastelands. 


Creation of Non-Euclidean Geometry: Lobachevsky 


The rst mathematician to express his results on non-Euclidean geometry openly, 
both orally in 1826 and in print in 1829, was a product of Russia’s new university system. 
Nicolai Ivanovitch Lobachevsky (1793-1856) was the son of a poor government clerk, 
who died when the boy was only seven. The widow, nding herself in extreme poverty, 
moved the family to remote Kazan on the threshold of Siberia, where she succeeded 
in getting her three sons admitted to the secondary schools on public scholarships. In 
1807, the young Lobachevsky entered Kazan University as a free student, planning to 
prepare for a medical career. He was destined to spend the next 40 years of his life 
there, as student, teacher, and administrator, an isolated scholar in Europe’s easternmost 
university. 

In the hope of transforming Kazan from an intellectual outpost on the periphery of 
the West into the equal of any European university, the authorities had recently acquired 
the services of four distinguished German professors. Among these was the mathemati- 
cian Johann Bartels (1769-1836), one of Gauss’s early teachers at Caroline College in 
Brunswick. Bartels came with the reputation of being more skilled in the art of teaching 
than in independent scientic investigation, and under his tutelage, Lobachevsky soon 
found himself deeply involved in the study of mathematics. In 1811, he graduated from 
Kazan with a master’s degree in physics and mathematics. The and the following year, 
he began his career by holding geometry classes for minor government of cials, who by 
law needed a college degree to advance to higher civil service positions. Two years later, 
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Lobachevsky received a regular appointment at the university as an assistant professor, 
lecturing on various aspects of mathematics, physics, and astronomy. He was promoted 
to full professor in 1816, at the early age of 23. 

It is not clear just when Lobachevsky shook himself free from Euclid’s two thou- 
sand years of authority. No doubt Bartels was familiar with Gauss’s special interest in 
non-Euclidean geometry, and Lobachevsky probably rst heard about it from him. But 
Bartels never saw Gauss after 1807 and received (1808) only one letter from his former 
pupil during his entire stay at Kazan, so that it is unlikely that he had any inkling how 
the whole problem should be treated. We know that until the early 1820s (long after 
Bartels had returned to Germany), Lobachevsky was still working along conventional 
lines and not searching for a new geometry. A manuscript for a geometry textbook that 
he drew up in 1823, presumably for classroom use, contained three attempted “proofs” 
of Euclid’s parallel postulate. Despite the proofs that he put forth, Lobachevsky made the 
signi cant statement that all such proofs “no matter of what kind, can only be regarded 
as clari cations, but do not deserve to be called mathematical proofs in the full sense.” 
Although beginning to appreciate the dif culties encountered in the attempts to prove the 
postulate, he apparently had not then excluded the possibility that a demonstration might 
yet be found. It might well have been that Lobachevsky, in trying to bring his textbook to 
satisfactory completion, turned from the attempt to perfect the structure of Euclidean ge- 
ometry and entered an entirely new path that would lead him to success. The manuscript 
itself was never published (that is, until 1910), rejected by the government press on the 
grounds that it used meters as units of measure—innovations of the French Revolution. 

The next three years saw the evolution of an alternative geometry that did not involve 
the dubious postulate. At a meeting of the mathematics and physics faculty of Kazan 
University held on February 23, 1826, Lobachevsky delivered a paper called Exposition 
succinte des principes de la géométrie avec une démonstration rigoureuse du théoréme 
des paralléles. In spite of the ominous-sounding “rigorous proof of theorem of parallels” 
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in the latter half of the title, it is likely that the lecture contained the basic outline of his 
historic attempt to establish a new geometric system independent of the fth postulate. 
No part of this French manuscript has ever been found, but its essence was incorporated 
into his memoir On the Foundations of Geometry, printed in installments in the Kazan 
Messenger, a monthly journal published by the school, in 1829-1830. This work, the 
rst account of non-Euclidean geometry to appear in print, marked the of cial birth of 
the subject. It was in Russian, so it remained unknown to most foreigners, and even at 
home it attracted little attention. The indifference was partly due to a lapse of literary 
judgment, which let Lobachevsky slight his creation with the deprecating name imaginary 
geometry. 

In the hopes of acquainting the learned world with his grand departure from Eu- 
clidean geometry, Lobachevsky submitted the manuscript of On the Foundations of Ge- 
ometry to the St. Petersburg Academy of Sciences to be considered for publication in 
one of its scholarly journals. The verdict was that the work was not precise enough and 
was unworthy of being printed under the auspices of the academy. The reviewer, Mikhail 
Ostrogradski, failing to notice the startling theme of the paper, concentrated his remarks 
instead on two de nite integrals that entered into certain of Lobachevsky’s calculations. 
Thinking that these integrals were themselves some “new method,” the reviewer com- 
plained that one of them was false and the second “easily deducible from the elementary 
principles of the calculus of integration.” A St. Petersburg journal followed the lead 
by printing an uncomplimentary review of On the Foundations of Geometry, with the 
apparent intent of making Lobachevsky and his work objects of public ridicule. 

A lesser man might have given up the struggle at this point, but Lobachevsky re- 
sponded by writing a series of papers in an effort to convince the mathematicians of 
the world of the merits of his research. The rejected manuscript was expanded into New 
Elements of Geometry, with a Complete Theory of Parallels, which appeared in the re- 
cently founded scientic journal of Kazan University during the years 1835-1838. This 
time Lobachevsky made no effort to communicate with the Academy. Then he published 
(1835) a long article that bore the title “Imaginary Geometry” in Moscow University’s 
Messenger of Europe, the rst time one of his studies was printed outside of Kazan. 
He also translated it into French to be carried by Crelle’s Journal (1837). Perhaps the 
best summary of his new geometry was a little book of 61 pages, Geometrische Unter- 
suchungen zur Theorie der Parallellinien (Geometrical Investigations on the Theory of 
Parallels), published in Berlin in 1840. Gauss _ rst learned of Lobachevsky’s contributions 
to non-Euclidean geometry when he received a copy of this last work from the author, 
and he replied to him in congratulatory fashion. That the great German mathematician 
was deeply impressed is indicated by a letter to Schumacher in 1846: 


I have recently had occasion to look through again that little volume by Lobachevsky. ... You 
know that for fty-four years now (even since 1792) I have held the same conviction. I have 
found in Lobachevsky’s work nothing that is new to me, but the development is made in a 
way different from that which I have followed, and certainly by Lobachevsky in a skillful 
way and a truly geometrical spirit. 


Gauss studied Russian in his old age, so as to be able to consult Lobachevsky’s other 
works in the original language. He also arranged (1842) for Lobachevsky to become 
a corresponding member of the Géttingen Academy of Sciences. But he stubbornly 
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withheld the public support that would have made the new ideas mathematically 
acceptable. 

At age 34, Lobachevsky was elected rector of Kazan University, a post he held 
for 19 years. The honor, coming as it did in 1827, was more likely a reward for his 
dedicated service as chairman of the university’s library and construction committees 
than a recognition of his attempt to erect an entire geometry on new foundations. As he 
neared old age, a bureaucratic revamping of the administrative structure of the Kazan 
school district brought Lobachevsky’s abrupt dismissal from the university. Sorely hurt, 
he lingered on in the neighborhood of Kazan, trying to be of some use by occasionally 
showing up for doctoral examinations. Finally, the premature death of a son coupled with 
rapid failure of his eyesight took its toll on his health and spirit. 

The two heretic geometers—Lobachevsky, who misnamed his creation imaginary 
geometry, and Bolyai who used the nobler designation absolute science of space—were 
denied the welcome that so many centuries of anticipation seemed to promise. No leading 
mathematician of the time gave a word of public approval to what Hilbert later called 
“the most suggestive and notable achievement of the last century.” Both men were aware 
of the revolutionary magnitude of their geometrical theories and expected to be right- 
fully heralded; both were bitterly disappointed with the response but reacted in entirely 
different ways. Bolyai withdrew from all mathematical activity in apparent disgust with 
the world and himself. To the last days of his life, Lobachevsky waged a resolute and 
uncompromising struggle to gain wider circulation for his ideas. In 1855, on the occasion 
of the ftieth anniversary of the founding of Kazan University, he made a nal effort 
by publishing a comprehensive exposition of his system of geometry. Under the title 
Pangeometry or a Summary of the Geometric Foundations of a General and Rigorous 
Theory of Parallels, it appeared simultaneously in Russian and French in the scienti c 
journal of Kazan University. The treatise was not written in his own hand, but dictated, 
for Lobachevsky had become blind. Although he died without seeing his ideas become 
a recognized part of mathematical thought, they achieved gradual acceptance during 
the remainder of the century. In a lecture delivered in 1873, the English mathemati- 
cian William Kingdon Clifford went so far as to compare the effect of Lobachevsky’s 
new doctrine with the scientic revolution wrought by Copernicus’s heliocentric sys- 
tem saying, “What Vesalius was to Galen, what Copernicus was to Ptolemy, that was 
Lobachevsky to Euclid.” As if to atone for the initial neglect, much of the mathematical 
world now refers to the non-Euclidean geometry that he developed as Lobachevskian 
geometry. 

As we have remarked several times, the chief difference between the axiomatic 
structure of classical Euclidean geometry and the new geometry of Lobachevsky resides 
in their respective parallel axioms. Lobachevsky replaced Euclid’s fth postulate with a 
contrary one and kept all the remaining postulates unaltered. From the time of Proclus, 
most geometers accepted Playfair’s axiom as the appropriate wording for a postulate on 
parallels. If this version is chosen, then a contrary statement says that through any point 
P not on a line / there exists either no parallel or more than one parallel to /. Because 
the assumption of no parallel would contradict the in nite extent of a straight line (a 
tacit assumption Euclid and others used), Lobachevsky took the following supposition 
as a Starting point: 
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There exist two lines parallel to a given line through a given point not on the line. 


By parallels are here meant lines that do not intersect /. From this postulate, together with 
the remaining axioms of Euclid, plus some additional axioms required for the rigorous 
foundation of Euclidean geometry, Lobachevsky developed his new geometry to the point 
of working out its trigonometry. Since the technical content is neither simple nor short, 
we shall content ourselves with a few remarks concerning parallel lines. 

All the Euclidean propositions that do not depend on Euclid’s parallel postulate, in 
particular the rst 28, were available to Lobachevsky. Thus, he knew that given a point 
P not on a line / one could construct a line /’ through P parallel to 7. The procedure 
would be to drop a perpendicular PQ to / and take /’ perpendicular to PO at P; having 
a common perpendicular segment, / and /’ would then be parallel, by Proposition 27 of 
Euclid I. What additional information could Lobachevsky glean from his new parallel 
postulate? 

Consider one of the half-lines PS' of /’. Of the various lines through P that lie in the 
interior of SPQ, some (such as the line through P and R) will intersect /; others will not. 
This situation allows us to divide these lines into two sets S, and S: in S| are just those 
lines that intersect /, whereas in Sp are just those lines that do not meet /. Each line in 
S, “precedes” each line in Sz in the sense of making a smaller angle with the segment 
PQ. Now there exists a line /* through P that brings about the division of lines into the 
two sets. Because /* either meets / or does not meet it, /* must be either the “last” line 
in Sj (it is preceded by every member of S;) or the “rst” line in Sp (it precedes every 
member of S2). To which set does this boundary line between the members of S; and 
S> belong? 


To see that there is no last line in Sj, let us assume to the contrary that /* is in S;. 
Then /* intersects / at a point E. Choose any point F on/ such that E is between QO 
and F’. Then the line through P and F is in S;. Because /* precedes this line, /* is not 
the last line in S;, and a contradiction is reached. This means that the dividing line of 
the two sets is the rst of those that do not intersect 7. We shall call /* the right limiting 
parallel to / through P. We choose this terminology because as a point R on / recedes 
endlessly from Q on /, the line through P and R approaches the limiting position /*. An 
analogous discussion, dealing with lines through P that lie in the interior of TPQ, gives 
rise to a left limiting parallel /,,. That is, there are two lines /* and /, through P that do 
not intersect /, and are such that any line through P lying within the angle formed by 
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the two lines and containing PQ intersects /. 


[* 


These limiting parallels are situated symmetrically about PQ in the sense that the 
angles w and f that /* and /,, make with PQ are equal. Suppose one of the angles, say, 
a, is greater than the other. Then there exists a line / between PQ and /* that intersects 
/ at a point R such that ZRPO = B. Let R’ be the point on the opposite side of PO from 
R and satisfying R’O = RQ. Then triangles R’PO and RPO are congruent, so that the 
corresponding angles OPR’ and ZOPR are equal. Hence, /OPR’ = B. The implication 
is that the line through P and R’ coincides with /,. But this is impossible, since /, does 
not intersect /. This contradiction allows us to discard the situation in which a > £; 
similarly, a < 6 cannot hold and so a = f. 


The angles a and # that /* and /,, make with PO are called the angles of parallelism 
at P with respect to /, and are important to Lobachevsky’s development (for instance, 
he showed that the size of a would depend on the length of the perpendicular PQ). It 
is already known that a = B < 90°. If wa = B = 90°, then /* and /, would be the same 
straight line; namely, both would coincide with the perpendicular /’ through P to PQ. 
Any other line through P would enter the interior of one of the angles a or 6 and would 
therefore have to intersect /. It follows that /’ is the only line through P parallel to /, a 
denial of Lobachevsky’s parallel axiom. Thus, the equal angles a and 6 must be acute. 

We have proved one of the rst theorems of Lobachevsky’s geometry. 


If P is any point not on a line 1, there exist two distinct lines I* and 1,, that do not 
intersect I and that form equal acute angles with the perpendicular PQ to 1. 


Actually, there are in nitely many lines through P that do not intersect /, whence it is true 
that an in nite number of parallels to / through P exist. Take any line / through P lying in 
one of the pair of vertical angles bounded by /* and /,, and not containing PQ. If/ meets /, 
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say, at the point R, then because /* passes through the vertex P of triangle POR, it must 
intersect the opposite side OR. From this contradiction, it can be concluded that any such 
line / is parallel to /. This concludes our brief exposition of Lobachevsky’s geometry. 


Models of the New Geometry: Riemann, Beltrami, and Klein 


For 35 years after the appearance of the pioneering work of Bolyai and Lobachevsky, 
the subject of non-Euclidean geometry was essentially ignored. The relative obscurity 
of the two young founders contributed to this delay, and so did the slow passage of 
scientic knowledge from one part of Europe to another. Few mathematicians could read 
Russian, the language in which Lobachevsky usually published. Further, Kant’s views 
on the Euclidean nature of space were still powerfully dominant. Kant had held that 
experience could not contradict these axioms, because intuitions of a Euclidean space are 
part of the human manner of perceiving reality. Not until the 1870s was the revolutionary 
signi cance of the new geometric ideas really understood, following the belated (1868) 
appearance in print of the famous probationary lecture Bernhard Riemann gave in 1854. 

Bernhard Riemann (1826-1866), who was born about the time non-Euclidean ge- 
ometry was discovered, was the son of a Lutheran pastor. At age 19, he enrolled at the 
University of Géttingen as a student of theology and philosophy, a choice dictated by his 
wish to please his father. He also attended lectures on mathematics and nally received 
the elder Riemann’s permission to devote himself entirely to the subject. After a year at 
G6ttingen, where he found the instruction antiquated (even Gauss taught only elementary 
courses), Riemann was drawn to Berlin to study under such great mathematicians as Ja- 
cobi, Dirichlet, and Eisenstein. (During the Revolution of 1848, as a member of the loyal 
student corps, he spent two days on guard duty protecting the king at the royal palace.) In 
1849 he returned to Géttingen to complete his training for a doctorate. His thesis of 1851, 
written under Gauss, dealt with surfaces over a complex domain; these are now called 
Riemann surfaces. This work at once established his reputation as a mathematician of 
rst rank. Perhaps because the thesis involved many original ideas that had never gotten 
beyond Gauss’s desk drawer, it excited Gauss to an unusual degree. His of cial report to 
the faculty stated: “The dissertation submitted by Riemann offers convincing evidence of 
the author’s thorough and penetrating investigations in those parts of the subject treated in 
the dissertation, of a creative, active, truly mathematical mind, and of a gloriously fertile 
originality.” On Gauss’s recommendation, Riemann became a privatdozent at Gottingen 
in 1854, subsisting on fees paid to him directly by those students who chose to attend his 
lectures. In 1857 he was promoted to the paying position of assistant professor. When 
Gauss died in 1855, his chair as professor of mathematics had gone to Dirichlet; when 
Dirichlet passed away four years later, Riemann himself succeeded to the position. But 
he had already contracted tuberculosis and was a dying man. In an attempt to cure the 
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illness in a warmer climate, Riemann spent his last years in Italy, where he died in 1866 
at the age of 39. 

On admission as a privatdozent at Géttingen, Riemann was called on to prove his 
mettle as a lecturer by delivering a Habilitationsvortrag (probationary lecture) before 
the faculty. For this trial, he submitted a list of three possible topics to the faculty; he 
felt well prepared to discuss either of the rst two topics. Riemann rashly listed as his 
third offering a subject on which Gauss had re ected some 60 years, the foundations 
of geometry. Contrary to Riemann’s expectations, Gauss selected this last topic for the 
test lecture. The strain of carrying out this dif cult assignment, while also working as 
an assistant to Wilhelm Weber in a mathematical physics course brought on a temporary 
nervous breakdown. 

In comfortable retrospect, the reading of Uber die Hypothesen welche der Geometrie 
zu Grunde liegen (On the Hypotheses that Underlie the Foundation of Geometry) on 
June 10, 1854, is seen as one of the highlights of modern mathematical history. Because 
Riemann adapted his lecture to the intended audience, the entire philosophical faculty 
at Gottingen, it contained no specic examples and practically no formulas. Yet despite 
its intuitive character, it was extraordinarily powerful in generalities and suggestive in 
nature. It is said that nobody present understood Riemann’s approach to geometry except 
the aged, legendary Gauss; and Weber said that even Gauss was perplexed. 

Although Riemann’s inaugural lecture did not immediately affect the intellectual 
world, its publication two years after his death caused a stir among those mathematicians 
who sought to Il in the details. With the discovery of competing geometries, no one 
geometry could be regarded as a collection of truths about physical space. Riemann, in 
assessing just what facts we can hold as certain, had the marvelous perception that the 
space of our experience may yet be nite. As he af rmed in the lecture: 


In the extension of space constructions to the immeasurably great, we must distinguish 
between unboundedness and in nite extent; the rst pertains to the extent relations, the 
latter to the measure relations. 


600 


Chapter 11 Nineteenth-Century Contributions: Lobachevsky to Hilbert 


The surface of a sphere, with great circles understood as “lines,” furnishes a good illus- 
tration of what Riemann meant; a “line” is not in nite in extent, because after a nite 

prolongation it returns on itself, yet it is unbounded in the sense that it can be traversed 
endlessly about the sphere. Riemann also argued in favor of geometry as an empirical 
science. He held that because observation on physical space had not con rmed _ the exis- 
tence of parallel lines, one could, with as much reason as not, assume that every pair of 
lines would meet at some nitely distant point. 

Mainly as a result of Riemann’s thinking, attention was directed to a different type 
of non-Euclidean geometry, one that takes as its alternative to the parallel postulate the 
statement that there are no parallels to a line through a point not on the line, in short, that 
any pair of lines must meet somewhere. The geometry stemming from this assumption 
requires an even more drastic break with tradition than that made by Gauss, Bolyai, 
and Lobachevsky, for the Riemannian parallel axiom affects the import of the remaining 
axioms. In fact, it con icts with a combination of Postulates 1 and 2 of Euclid. 

When considering Euclidean geometry, we observed that Proposition 16 of Book 
I, the so-called exterior angle theorem, implies the existence of parallel lines. Thus, the 
supposition that there are no parallel lines would be inconsistent with the axioms of 
Euclid that led to this proposition. (A set of axioms is said to be consistent if they and the 
consequent theorems, taken together, produce no logical contradiction.) A brief examina- 
tion of Euclid’s proof of Proposition 16 reveals that in it he unconsciously assumed the 
innite extent of straight lines; if lines are considered nite in length yet boundless, then 
his arguments fails. To avoid a contradiction in the geometry suggested by Riemann, 
it is necessary to modify Euclid’s axiom scheme further, by replacing Postulate 2 by a 
Postulate 2’: A straight line is boundless. Another disturbing factor is that in assuming 
that two straight lines always meet, we allow the possibility that they will intersect at 
more than one point. This con icts with Postulate 1 of Euclid, which requires that two 
points determine one and only one line. As if two serious departures from Euclid are 
not enough, Riemannian theory necessitates discarding the traditional form of Postulate 
1 and adopting instead Postulate 1’: Two distinct points determine at least one line. 

The rst of the non-Euclidean geometries involved nothing more than replacing the 
parallel postulate by the contrary statement that through a point not on a given line there 
passes more than one parallel to the line, but the new geometry of Riemann was achieved 
by making three changes in the axiom system framed by Euclid. At rst sight this may 
seem an exorbitant price to pay, but the geometry developed from these hypotheses 
turned out to be as consistent as Euclid’s. In fact, the resulting system corresponded to 
what Saccheri obtained under his hypothesis of the obtuse angle. The theorems are, to 
be sure, radically different from those to which we are accustomed and cannot fail to be 
a little disconcerting at rst. For instance, one of Riemann’s results is that all straight 
lines have the same nite length, and another asserts that all perpendiculars to a straight 
line must meet in a point. You can probably almost anticipate the theorem on angle sums 
of triangle: The sum of the angles of any triangle is greater than 180°. This may appear 
to contradict Legendre’s rst theorem. In the demonstration of that theorem, however, 
Legendre used the property that a straight line is in nite, a fact unavailable to Riemann. 

This second period in the development of non-Euclidean geometry focused on the 
problem of its logical consistency, a problem whose solution brought to a close the many 
efforts to deduce the parallel postulate as a theorem from Euclid’s other assumptions. As 
far as the creators of non-Euclidean geometry went in their investigation of the subject, 
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no evident contradictions were encountered. They left open, however, the possibility that 
further effort might yet produce a contradictory statement as a theorem of the system. 
If this were to happen, not only would the approach of Saccheri be vindicated, but 
all the work of Gauss, Bolyai, Lobachevsky, and Riemann would prove to be, quite 
exactly, much ado about nothing. Consequently, doubt hung over their enterprise like a 
threatening cloud. Although they felt con dent that their new geometries were as valid 
as Euclid’s, this feeling was still an act of faith. 

In the development of any axiom system—one de ning a geometry or some other 
structure—there is the task of proving that the system is consistent by showing that no 
contradictions can occur. The consistency of any set of axioms is ordinarily established 
by nding a model whose elements and relations are speci c interpretations of the unde- 
ned technical terms of the axiom system, and in which the axioms are realized as true 
statements about the model. Because the model itself will have been constructed within 
the framework of some other set of postulates, the most we can say about the axiom 
system under investigation is that it is relatively consistent with the system of the model. 
That is, the system of axioms that interests us will be seen to be consistent, provided that 
the axioms in the system that furnished the model are themselves consistent. This sort of 
consistency proof, of course, merely shifts the burden of proof of consistency from one 
system to another; and yet it is the best we can provide. Because no absolute method for 
testing the consistency of a set of axioms had ever been derived, one has to be content 
with relative consistency. 

The basic idea of a proof of the relative consistency of non-Euclidean geometry 
was given in 1868 by the Italian geometer Eugenio Beltrami (1835-1900) in a paper 
entitled Saggio di interpretazione della geometria non-Euclidea (Essay on the Interpre- 
tation of Non-Euclidean Geometry). His method consisted of nding a model within 
Euclidean geometry that with suitable interpretations had the same postulation structure 
as the non-Euclidean geometry of Bolyai and Lobachevsky. The Euclidean surface on 
which Beltrami gave a partial representation of non-Euclidean geometry is called a pseu- 
dosphere, because it has certain properties in common with a sphere. The “pseudosphere” 
is the surface of revolution obtained by revolving a curve known as a tractrix about its 
asymptote. The resulting gure, which resembles an in nitely long horn, has constant 
negative curvature. 


The Pseudosphere 


Provided that the role of “straight line” between two points on the pseudosphere is taken 
by the geodesic connecting the points (the path of shortest length, like a great circle on a 
sphere), the geometry on this surface satis e s the postulates of Bolyai and Lobachevsky. 
The axioms of classical non-Euclidean geometry t the geodesics on the pseudosphere, 
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so every theorem, as a logical consequence of the axioms, can be interpreted as a fact on 
the pseudosphere. (Actually, Beltrami succeeded in representing only a limited portion 
of the Lobachevsky plane on the pseudosphere, and the task of interpreting all this non- 
Euclidean geometry fell to Felix Klein in 1871.) Beltrami’s model made the “imaginary 
geometry” of Lobachevsky “real,” in the mathematical sense, by giving it a readily 
visualizable interpretation on a familiar Euclidean surface. More important, the geometry 
on the pseudosphere provided an interpretation in which Euclid’s parallel postulate could 
not possibly hold, yet in which all his other axioms were true. 

Beltrami’s success in realizing the geometry of Bolyai and Lobachevsky on the sur- 
face of the pseudosphere showed that non-Euclidean geometry is as logically consistent 
as the geometry of Euclid. Every theorem of their geometry, when interpreted in the 
model at hand, gets translated into an ordinary Euclidean result; any possible inconsis- 
tency within classical non-Euclidean geometry would reveal itself in a corresponding 
inconsistency in the Euclidean geometry of geodesics on the pseudosphere. Thus, if one 
is willing to grant that Euclidean geometry is consistent (and no satisfactory proof of this 
has ever been given), the non-Euclidean geometry of Bolyai and Lobachevsky is equally 
correct. 

The discovery of a model for non-Euclidean geometry also revealed the inherent 
impossibility of proving the parallel postulate on the basis of the axioms of Euclid. If 
the parallel postulate could be deduced from the remaining axioms, then it could be 
converted into a theorem in non-Euclidean geometry. (Aside from the fth postulate, 
the axioms of Euclid are the same as the axioms of Bolyai and Lobachevsky.) But this 
theorem would contradict the form of the parallel hypothesis Bolyai and Lobachevsky 
used, making their non-Euclidean geometry inconsistent; and this is not the case. It is 
now clear why two thousand years of efforts to prove Euclid’s parallel postulate from 
the rest of the postulates were in vain. If Euclidean geometry is assumed consistent, no 
proof can ever be found. 

The non-Euclidean geometries had not yet become familiar topics among mathemati- 
cians when in 1871 and 1873 Felix Klein (1849-1925) published two monographs enti- 
tled Uber die Sogenannte Nicht-Euklidische Geometrie (On the So-Called Non-Euclidean 
Geometry). Klein pointed out a clear distinction between two geometrical theories that 
assume Riemann’s parallel postulate. In one of the systems, two straight lines meet at 
a single point, and in the other, two straight lines intersect at two points. His essential 
work in these publications was to furnish models for Lobachevskian geometry and the 
two kinds of Riemannian geometry. Perhaps the simplest model for the rst type of Rie- 
mannian geometry is gotten by taking only half of a sphere with its boundary circle (one 
must identify as equal any two points on the boundary that are diametrically opposite). 
The great circle arcs on the hemisphere are the “lines” of this geometry. The sphere 
itself provides a model for the other type of geometry. We cannot be certain that Rie- 
mann recognized the possibility of both geometries; his celebrated memoir did not deal 
in specics. For the two new non-Euclidean geometries, the case for consistency can be 
reduced to that for Euclidean geometry; that is, if one is willing to believe that Euclidean 
geometry is consistent, then so also are the two types of Riemannian geometry. The 
unanswered question whether Euclidean geometry can ever be shown to be internally 
consistent will be deferred for the time being. 

Klein’s most important achievement in geometry was the creation of the Erlanger 
Programm, a bold proposal to use the group concept to classify and unify the principal 
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geometries then existing. When Klein was 23 years old, he had been made a full pro- 
fessor at the University of Erlangen. By the custom in German universities, he gave 
(1872) an inaugural lecture for presentation to the philosophical faculty and the senate. 
The views expressed in this address, “A Comparative Review of Recent Research in 
Geometry,” are known as the Erlanger Programm for geometry. In essence, Klein de- 
scribed geometry as the study of those properties of gures that remain invariant under a 
particular group of transformations. Not only did this provide a neat way of codifying all 
the seemingly unrelated geometries that were known in the late 1800s, but it indicated 
that new geometries could be de ned by starting with different transformation groups. 

In 1886, Klein accepted a call to the University of Géttingen to Il the role of 
leading research mathematician. His extensive activity rescued the university from the 
mathematical doldrums into which it had fallen after the premature death of Riemann 
and helped raise Géttingen to greater renown than even in the days of Gauss. Klein’s 
stimulating in uence can be seen from the number (48) of dissertations that he personally 
supervised. His “favorite pupil” in these Géttingen days was the Englishwoman Grace 
Chisholm Young. 


Grace Chisholm Young 


Grace Chisholm was a pioneer in mathematics, one of the few women of her gener- 
ation to achieve an international reputation in the subject. After graduating from Girton 
College, Cambridge, she had been advised to continue her studies under Klein. Permis- 
sion was not entirely his to give, resting instead with the Ministry of Culture in Berlin; 
at that time, women could not be of cially admitted to German universities. Through 
Klein’s committed support and in uence, approval was secured, as long as the only 
exceptional cases were foreign graduate students. In the fall of 1893, Grace Chisholm 
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was one of three women allowed to attend regular lectures and seminars at Gottingen. 
(Also arriving on the same errand was Mary Winston, who had just completed a year 
of graduate study at the newly opened University of Chicago; with the awarding of her 
degree under Klein’s direction in 1897, she became the rst American woman to obtain 
a Ph.D. in mathematics from a foreign university.) Grace Chisholm’s doctorate in 1895 
was the rst awarded in Germany, through the normal examination process, to a woman 
in any eld whatever. 

Returning to England on completion of her doctoral research studies, she married 
William Henry Young (1863-1942), one of her former tutors. Young, between the ages of 
25 and 35, had published practically no mathematics, but deliberately set himself to earn- 
ing quite large sums through the private teaching of undergraduates. At the end of their 
rst year together, the couple agreed to “throw up lucre,” go abroad, and devote them- 
selves energetically to research in the theory of functions of a real variable. Living mostly 
in Gottingen until 1908 and then in Switzerland, they carried out investigations that helped 
make England part of the modern mathematical world. Grace Chisholm Young, either 
alone or in collaboration with her husband, wrote 31 papers and several books. 


The most conspicuous and distinctive feature of 


11.3 The Age of Rigor nineteenth-century mathematics was the introduction 


of a strictly logical approach; indeed, historians have 
D’Alembert and Cauchy frequently labeled this period the Age of Rigor in 
on Limits mathematics. A critical spirit soon permeated the 

whole of analysis, beginning with the calculus, and 
toward the end of the century, this led to an overhaul of the foundations of geometry. 
In the extremely rapid development of the calculus in the 150 years after Newton and 
Leibniz discovered it, mathematicians plunged ahead without logical support. They were 
aware of the requirement of proof, but this tended to take the form of a search for an 
explanation that would be intuitively plausible instead of mathematically exact. Rules of 
procedure—differentiation and integration, but also rearrangement of series—were nor- 
mally used in a formalistic way without regard for their validity. The success in applying 
the conclusions to physical problems gave con dence that the mathematics itself must 
be correct. “Go forward and faith will come to you,” d’Alembert is supposed to have 
said. The highly unsatisfactory state of the logical underpinnings of the calculus could 
not long continue unchallenged, and in the second half of the 1700s, an effort was made 
to supply the missing rigor in its fundamental conceptions. 

The illegitimate son of an aristocratic woman—the sister of a cardinal—and a cavalry 
of cer, Jean-le-Rond d’Alembert (1717-1783) was left at birth on the steps of the little 
church of St. Jean le Rond, close to Notre Dame in Paris. The father quickly sought out 
the child and placed him with a poor foster family. The name d’Alembert was appended, 
for reasons unknown, when the child had grown to young manhood. D’Alembert received 
an annuity from his natural father, which provided for an excellent formal education. He 
embarked upon a career in the law and then turned to medicine, before settling his 
attention on mathematics. 

D’Alembert would later write, “Mathematics was the only occupation that seriously 
interested me.” His rst mathematical paper was submitted to the Académie des Sciences 
in 1739, and he rose rapidly to prominence with the publication of two scientic works, 
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the Traité de dynamique (1743) and its companion volume Traité des fluides (1744). 
The 7raité de dynamique contained the observation that Newton’s third law of motion 
applied to solid bodies free to move as well as to stationary bodies; what is now called 
d’Alembert’s Principle reduced, in effect, problems in dynamics to problems in statics. 
In 1746 he was awarded the Berlin Academy’s prize for a paper on the general cause 
of winds. 

By midcentury, d’Alembert was widely acknowledged the most in uential math- 
ematician in France. His investigation of the motion of the vibrating string led to the 
derivation and solution of the partial differential equation known as the “wave equation.” 
Nearly all d’Alembert’s work in mathematics was produced between 1743 and 1754. In 
later years his mathematical productivity suffered as he devoted more and more time 
to the famous Encyclopédie, which he edited with the philosopher Diderot, and to his 
position as permanent secretary of the Académie Francaise, a society founded in 1635 
for raising the standards of French literature and language. Frederick the Great fully 
expected d’Alembert to succeed the aging Euler as head of the Berlin Academy; but he 
rejected Frederick’s repeated pleas and the offer of a princely stipend, arranging instead 
for Lagrange to go in his place. 

D’Alembert felt that a rm logical framework for the calculus lay in the use of limits. 
He told Euler in a letter (1759), “I believe I have developed the true metaphysics of their 
principles [limits], so far as it is possible.” But d’Alembert failed to appreciate that such 
a subtle concept required an extremely careful formulation; in the article “Limits” for 
the Encyclopédie he gave a vague and purely verbal de nition: 


One magnitude is said to be the limit of another magnitude when the second may approach 
the rst within any given magnitude however small, though the rst magnitude may never 
exceed the magnitude it approaches. 


Unable to escape the traditions of eighteenth-century mathematics, d’Alembert’s expla- 
nation of limits relied on geometric visualizations and vague ideas about “closeness.” 
His contemporaries could hardly be satis ed with the “continuous motion” description 
of how a secant to a curve becomes its tangent. 


It is easy to see that the secant 4B, which is drawn through the two points A and B, slowly 
approaches the tangent to the curve as B approaches A, and will na lly become identical 
with the tangent when the two points coincide. The tangent is the limit of the secants; they 
approach closer and closer to it without ever reaching it. 


Unfortunately, d’Alembert’s conception of limit did not have as much in uence as it 
deserved; his scheme remained one among several competing approaches to founda- 
tional questions. It was only when the limit notion was translated into the language of 
inequalities and applied to functions, instead of variables, was the calculus rigorously 
grounded. 

The leading authorities of the time, Euler and Lagrange, made equally unsuccessful 
but nevertheless fruitful attempts to provide an acceptable foundation for the calculus. 
Euler’s Institutiones Calculi Differentialis (1755) and the later J/nstitutiones Calculi 
Integralis (1768-1770) brought together all that was then known on the subject. The 
wide acceptance of the works helped standardize the notation and content of this vast 
branch of mathematics. Euler divorced calculus from its geometrical origins by treating 
it as an extension of algebra. Finite formulas were naively accepted as carrying over 
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unconditionally to the in nite case; questions of convergence were completely ignored. 
It is not surprising that Euler’s trust in freewheeling calculation sometimes yielded 
dubious, if not absurd, conclusions. Lagrange’s approach to the subject led down 
a mathematical path that won him few supporters. Feeling that limits were alien to 
analysis, he sought to establish the results of the calculus by dispensing entirely with the 
limit concept. The central role in Lagrange’s Théorie des fonctions analytiques (1797) 
was played by power series under the unwarranted assumption that every function 
allowed such an expansion. His ndings were generally correct, although intuitively 
reasoned rather than grounded in rigorous demonstration. 

For the calculus to be put on a sound footing, the fetters of geometrical constructions 
and mechanical analogies had to be removed; and the subject had to be built anew solely 
on considerations derived from arithmetic and algebra. Many mathematicians of the time, 
among them d’Alembert, insisted that the logical basis of the calculus was to be found 
in the method of limits. Although the leaders of this movement understood the idea of 
a limit a little better than most of their contemporaries, they failed to give the concept a 
clear and precise de nition that would make it logically tenable. Augustin Louis Cauchy 
(1789-1857) nally developed an acceptable theory of limits, and in doing so removed 
much of the doubt about the soundness of the notion. 

Cauchy acquired his early education from his father, a barrister and master of clas- 
sical studies. In 1805 he entered the Ecole Polytechnique in Paris, proceeding two years 
later to the civil engineering school Ecole des Ponts et Chaussées (School of Bridges 
and Roads). Thereafter he worked brie y (1810-1813) as a military engineer at the har- 
bor of Cherbourg, where Napoleon had started to build a naval base for his intended 
invasion of England. Cauchy rst attracted the attention of the leading mathematicians 
of France, among them the aging Lagrange, by a series of brilliant memoirs that date 
from 1811 when he was but 22 years old. For reasons of health, Cauchy was persuaded 
by Lagrange to abandon the profession of engineering and to devote himself to pure 
science and mathematics. Cauchy returned to Paris in 1813 to accept an instructorship 
at the Ecole Polytechnique, rising to full professor in 1816. As we have observed, it 
was the practice of scientic academies to offer prizes for solving major outstanding 
problems. Cauchy won a grand prize from the French Académie des Sciences in 1816 
for a 300-page paper on the propagation of waves at the surface of a liquid. 

Although he was worthy of admission to the Académie, Cauchy lost the respect of 
many scientists at this time by being appointed to the Académie by royal decree instead 
of being elected by its members. Moreover, the seat that became Cauchy’s had become 
vacant because of the political expulsion of the noted geometer, Gaspard Monge. An 
active partisan of the revolution, and subsequently an enthusiastic supporter of Napoleon, 
Monge was not forgiven by the returned rulers of France during the Bourbon Restoration 
of 1814. 

The changing political situation in France had unexpected effects on Cauchy’s career. 
He was an ardent royalist, supporting the Bourbon monarchy almost to the point of 
perversity. When Louis-Phillipe came to power following the Revolution of 1830, Cauchy 
refused, as a loyal supporter of the deposed King Charles X, to swear allegiance to the 
new government. Abandoning his academic post at the Ecole Polytechnique, Cauchy 
went into a self-imposed exile. He did not regain public employment in France for 18 
years. Cauchy spent part of this time teaching at the University of Turin (1831-1833) and 
part in Prague (1833-1838), where the ex-monarch had summoned Cauchy to tutor his 
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grandson. When Cauchy returned to France in 1838, he was permitted to resume his post 
at the Académie but not his professorship at the Ecole Polytechnique, because he was 
still adamant in refusing to take an oath of allegiance. He had to be content to support 
himself by teaching at a Jesuit college. When Louis-Phillipe was himself ousted in the 
Revolution of 1848 and oaths were no longer required, Cauchy was appointed professor 
of celestial mechanics at the Sorbonne, where he remained for the rest of his career. 

Like Euler’s, Cauchy’s mathematical output was prodigious and embraced almost all 
branches of the subject. In total he wrote eight full-length books as well as 789 papers, 
which in a modern edition Il 26 large volumes. Cauchy was so active that he had to 
establish a private journal, the Exercises de mathématiques (1826-1830), continued in 
a second series as Exercices d’analyse mathématique et de physique, whose issues were 
composed entirely of Cauchy’s own work. In 1835, the Académie des Sciences began 
to publish a weekly bulletin, the Comptes rendus. Cauchy inundated this journal with 
a succession of his papers to such an extent that the Académie became alarmed at the 
rapidly mounting printing bill; to cope with Cauchy’s staggering productivity, it passed 
a rule (which is still in force today) limiting each contribution to a length of four pages. 
In all, the Comptes rendus printed 589 articles by Cauchy. 

Cauchy was not the rst mathematician to attempt a logical underpinning for the 
calculus. In his in uential textbook Théorie des fonctions analytiques (1797), Lagrange 
tried to base the whole subject on a series expansion suggested by Brook Taylor. Lagrange 
implicitly assumed that any function f has an in nite Taylor series of the form 


he hs 
Seth Gr Bid ea Per es 
where p,q,r,... are new functions of x different from but somehow derived from /. 
In the expression 
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V goes to zero as h does, so Lagrange’s idea was to dene the derivative of f with 
respect to x as the coef cient of A in the linear term of the Taylor series expansion for 
f(x +A). He introduced an accent mark as the notation to indicate the derivative, writing 
p = f'(x), and called this the rst derived function of f (thus providing the source 
of our term “derivative”). Higher order derivatives are de ned to be the coef cients of 
h? /2!, h3/3!,... and denoted by f”(x), f’"(x),.... Through this new approach Lagrange 
concluded that the calculus could be “redeemed from all considerations of the in nitely 
small, from vanishing quantities, from limits and from uxions, and reduced to algebraic 
analysis of nite quantities.” 

The Théorie des fonctions analytiques was a work of grand design, systematically 
developing the major properties of the calculus based on this de nition of the derivative. 
By freeing the differential calculus from “every illicit supposition” and relying on the 
method of power series, Lagrange declared that he could bring to the subject “the rigor of 
the ancient demonstrations.” Although the Théorie des fonctions analytiques was greatly 
applauded at rst, mathematicians soon perceived certain aws in Lagrange’s scheme. 
Expanding a function in a Taylor series is not an orthodox algebraic process, as Lagrange 
erroneously thought. It is still necessary to consider the matter of convergence, which 
must be discussed in terms of limits; his claim to have circumvented the limit concept 


p+, 
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was therefore illusory. Nor does every function allow a Taylor series expansion. For a 
function to have this series representation, all its derivatives must rst exist: the series 
is constructed from the derivatives, and not the other way around. A further dif culty, 
which Cauchy was the rst to point out, is that a given Taylor series does not determine 
a function uniquely. For example, the function f(x) = e7!/ *” with f(0O) = 0, and the 
constant function g(x) = 0 both have the same Taylor series at x = 0. 

Even though the Taylor series process proved inadequate for his primary purpose, 
Lagrange nonetheless made a start in the direction of setting the known results of the 
calculus within a rigorous framework. Inspired by Lagrange’s investigations, Cauchy 
took up the task of providing a logically consistent foundation for the subject, one that 
did not rest on any appeal to geometric notions such as slope or tangent. His program 
of rigorization rested on “purely analytic and direct proof,” with an inequality-based 
translation of the limit concept serving as the cornerstone. Many of the details of Cauchy’s 
new approach to analysis were developed in the classrooms of the Paris colleges in which 
he taught. In an age that made little distinction between research work and teaching 
material, he was not above putting his notes into book form “for the greater use of 
the students.” Cauchy’s elementary textbooks, combining clear exposition with strict 
reasoning, were far ahead of the loose and slack works of the previous generation, and 
set the style for future mathematical writing. 

The formulation of elementary calculus current today in most textbooks is essen- 
tially what Cauchy expounded in three great treatises: Cours d’analyse de I’Ecole Royale 
Polytechnique (1821), Résumé des lecons sur le calcul infinitesimal (1823), and Lecons 
sur le calcul différentiel (1829). The rst and most important of these works is based, 
as the title indicates, on the lectures in analysis that he had been presenting at the Ecole 
Polytechnique. In the introduction to the Cours d’analyse, Cauchy was explicit in his 
aspiration to remove the immense obscurity that reigned in analysis: “As to methods, I 
tried to Il them with all the rigor one requires in geometry, so as never to resort to 
the reasoning taken from the generality of algebra.” In the “generality of algebra,” he 
included the habits of assuming for complex numbers all the properties of real numbers, 
and for in nite series the properties of nite sums. 

The Cours d’analyse contains the de nition of limit that was used until the 1870s, 
when the modern € — 6 version was rst put forth. Banishing the rough, geometrically 
intuitional concepts of the early analysts, Cauchy appealed to the notions of number, 
variable, and function, saying: 


When successive values attributed to a variable approach inde nitely to a xed value so as 
to end by differing from it by as little as one wishes, this last is called the limit of all the 
others. 


From this purely arithmetical de nition of limit, Cauchy then went on to give the central 
concepts of the calculus—continuity, differentiability, and the de nite integral—a formal 
precision that had been lacking in the work of his predecessors. It is remarkable how 
closely his de nition of a continuous function, for instance, parallels what we use today: 


The function f(x) is continuous within given limits if between these limits an in nitely small 
increment i in the variable x always produces an in nitely small increment, f(x +7) — f(x), 
in the function itself. 


Because he swept away the old foundations based largely on faith and formal 
techniques, Cauchy has come to be regarded as the creator of calculus in the modern 
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sense. Calculus after Cauchy was no longer a set of problem-solving operations but a set 
of theorems based on rigorous de nitions. 

Indiscriminate manipulation of in nite series, with the question of convergence 
blithely ignored, had gone on for over a century before the time of Cauchy. Although 
better mathematicians were generally wary of divergent series, lesser talents were of- 
ten led to completely absurd results when working with them. A well-known example 
involves the series expansion 


1 
1+x 


Guido Grandi (1671-1762), a professor at Pisa, in his Quadratura Circuli et Hyperbolae 
of 1703, set x = 1 in this expression and grouped the terms on the right-hand side in 
pairs to obtain sums of both 0 and 1/2: 


= lax te? ax faas., 


1 
ini I)+d—-1)+---=0+0+---=0. 
This, he argued, proved mathematically that the world could be created out of nothing. 
Cauchy showed that the convergence, and thus the meaning, of the series in question is 
assured only when |x| < 1. 

To bring order to these logically questionable practices, Cauchy devotes much of the 
Cours d’analyse to a careful treatment of series. He rejects the attribution of a sum to 
a nonconvergent series, because no numerical veri cation is possible; and stresses that 
the convergence of an innite series wp + u; +u2+--- is determined by the tendency 
of the nite, partial sums S, = up + u; +---+u, to a limiting value: 


If, for ever increasing values of n, the sum S, inde nite ly approaches a certain limit S, 
the series will be called convergent, and the limit in question will be called the sum of the 
series. If, on the contrary, while n increases inde nite ly, the sum S, does not approach a 
xed limit, the series will be divergent and will no longer have a sum. 


A page later Cauchy indicates the profound result that today is called the “Cauchy 
convergence criterion.” Speci cally: A sequence S, converges to a limit if and only if 
the difference S,, — S, can be made less in absolute value than any assignable quantity, 
provided m and n are suf ciently large. The great merit in this criterion is that it provides 
an internal condition that can be checked by looking at the sequence itself, not at a 
proposed limit. Cauchy has little dif culty in proving that his convergence principle 
is necessary, but he avoids the difculty of showing its suf ciency and simply claims 
that when the condition is ful Iled, the convergence of the sequence is assured. A more 
satisfactory proof requires a ner analysis of the structure of the real line (more precisely, 
the completeness of the real numbers), something that was unavailable at that time. 
The topics that Cauchy treated in the Cours d’analyse were subtle, and by modern 
standards even he made mistakes at the lowest level of his analysis. For instance, when 
handling in nite series of functions he did not distinguish between what we now call 
pointwise convergence and uniform convergence over an interval. The most famous 
“false theorem” contained in the Cours d’analyse asserts that the sum-function S(x) = 
uo(x) + uj(x) + u2(x) +--+ of a convergent series of continuous functions u,,(x) is itself 
a continuous function. In an 1826 paper on the binomial series, Abel pointed out that 
this assertion is false. He remarked, “It seems to me that this theorem of Cauchy admits 


610 


Chapter 11 Nineteenth-Century Contributions: Lobachevsky to Hilbert 


exceptions. For example the series 
: Ih or 
sin(x) — 5 sin(2x) + 3 sin(3x) —--- 


is not continuous at x = (2n 4+ 1), n being an integer.” 


Fourier’s Series 


Discontent with the concept of convergence of series was further heightened by the 
publication in 1822 of a treatise written by a part-time mathematician, Jean-Baptiste- 
Joseph Fourier (1768-1830). Fourier was born in Auxerre, France, the son of a master 
tailor. Orphaned at the age of 9 years, he was placed by the Bishop in the nearby military 
school run by the Benedictines. The boy’s interest in mathematics was rst aroused in 
this school. As he grew older he formed the hope of a commission in the artillery, but 
his application was met with the crushing reply that as he was not of noble birth he could 
not be admitted “even if he were a second Newton.” So instead, he entered an abbey 
where he taught mathematics to the other novices while preparing to take vows and join 
the Benedictine teaching order. 

The outbreak of the French Revolution in 1789 turned the young man’s life over 
again. When all the monastic schools were suppressed and a revolutionary decree for- 
bade the taking of religious vows, he returned to Auxerre to teach mathematics at his 
old school. The school continued to function under the auspices of the municipality. 
The abbey was made available for the pupils’ use, and the church became the school’s 
chapel. 

Fourier became involved in local politics in the spring of 1793 and for a time 
continued both to teach at his school and act as a member of the local revolutionary 
committee. In the shifting political situations of the time, Fourier was arrested more than 
once just before Robespierre fell from power. Fourier was fortunate to escape execution. 

Even after his release from prison, Fourier continued to play a part in local politics 
but he was also named, in 1794, to the hastily created Ecole Normale where he had 
the opportunity to attend lectures of such luminaries as Monge, Lagrange, and Laplace. 
This in turn led him to a position as assistant lecturer at the Ecole Polytechnique and his 
subsequent appointment to Lagrange’s chair in mathematics there. 

In 1798, Monge selected Fourier as one of the scientists to accompany Napoleon 
on his ill-fated Egyptian campaign. The savants of the expedition remained three years 
and established the Institut d’Egypte, with Monge serving as president, Napoleon as 
vice-president, and Fourier as secretary. There was a time when Fourier, having been 
appointed to a number of administrative positions, virtually governed half of Egypt. On 
his return to Paris, Fourier resumed his teaching duties at the Ecole Polytechnique, but 
relinquished them after three months when Napoleon chose him to be prefect of the 
Department of Isére (centered at Grenoble), an of ce he retained for the next 14 years. 

In the midst of all his governmental responsibilities, Fourier somehow found time to 
write the “Historical Introduction” to the massive Description of Egypt, and to compose 
his landmark work on the theory of heat distribution, the Théorie Analytique de la Chaleur 
(Analytic Theory of Heat). Napoleon conferred the title of Baron of the Empire on Fourier 
in 1808, and thereafter made him a Count. 
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The restoration of the French monarchy deprived Fourier of his Napoleonic title, 
position, and pension; however, through the intercession of a former pupil at the Ecole 
Polytechnique, he was given the directorship of the Bureau of Statistics for the Seine, a 
post with few duties but a suf cient salary. Fourier’s election (1816) to the reestablished 
Académie des Sciences was initially refused by the new king, Louis XVIII (brother of 
the guillotined Louis XVI), who could not forgive Fourier’s zealous support of Napoleon. 
Later when feelings had subsided, the nomination was unopposed, and Fourier went on 
to serve the Académie in the in uential position of Permanent Secretary from 1822 until 
his death in 1830. 

When it nally appeared in 1822, Fourier’s Théorie Analytique de la Chaleur had 
been almost 20 years in the writing. The rst decisive step in the work was the submis- 
sion of a lengthy memoir, Théorie de la propagation de la chaleur dans les solides, to the 
Institut de France, the Revolution’s renaming of the Académie, in 1807. The commit- 
tee set up to report on the memoir—Lagrange, Laplace, Monge, and Lacroix—strongly 
opposed it on two grounds: the “analytic dif culty” in deriving the differential equa- 
tions governing heat ow, and the extensive use in their solution of the trigonometric 
expansions known today as Fourier series. As a means of settling the question, the In- 
stitut made the propagation of heat the subject of the grand prize in mathematics for 
the year 1812. The paper Fourier presented at the end of 1811 was identical with his 
1807 memoir as regards essential contents. Although the judges awarded him the prize, 
they once again had strong reservations, stating that his entry “still leaves something to 
be desired in the realms of both generality and rigor’; the Institut was in no hurry to 
print the Prize Essay. (Fourier always considered the reproach unjust: when he became 
Permanent Secretary some 10 years later, he insisted that the Académie publish the Prize 
Essay just as originally submitted.) 

Fourier began a third version of his heat researches, now in the form of a book. 
Reworking and expanding its mathematical portions, the Prize Essay was incorporated 
practically without change into the Théorie Analytique de la Chaleur. The physicist Lord 
Kelvin (William Thompson) later referred to the work as “a great mathematical poem.” 

Few other works have had such a profound in uence on subsequent developments 
in mathematics as the Théorie Analytique. The older generation of mathematicians was 
deeply disturbed by Fourier’s statement that an “entirely arbitrary” (periodic) function 
can be expressed by an in nite trigonometric series; that is, if the function f(x) is de ned 
on the interval —w < x < m and has period 27, (f(x + 27) = f(x) for any x), then it 
can be written as 


1 oe) 
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where the constants a, and b, are determined from the formulas: 
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The coef cients a, and b, chosen in this way are known as the Fourier coef cients of the 
function f(x)—tacitly assumed to be integrable—and the corresponding series is said to 
be the Fourier series of f(x). 

Fourier’s assertion, in this degree of utter generality, is now known not to be strictly 
true: there do exist periodic, continuous functions which do not admit Fourier series 
representations. The weak point in Fourier’s investigation lies in the failure to justify 
mathematically whether this type of trigonometric series converges for a particular value 
of x and, even if it converges, whether the sum must be f(x). As supporting evidence 
for the reasonableness of his claim, he calculated a, and b, for small values of n in a 
number of specic cases. One instance was the function f(x) given by f(x) =~ for 
—m <x <a and repeated periodically over the rest of the real line 


Ax) 


For this function, Fourier showed that 
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a result which he insisted was “not a question of faith but of mathematics.” 

The Théorie Analytique was rich in implications that went far beyond its seemingly 
narrow scope. Fourier’s rule for forming his trigonometric series involved computing 
the coef cients as integrals. It was current among mathematicians of the eighteenth 
century to regard integration as simply the reverse process of differentiation; that is, the 
integral de ning the constant a, was taken to mean some function with derivative equal 
to f(x)cosnx. Since f(x)cosnx could not always be expressed as the derivative of a 
known function, Fourier instead interpreted its integral to be the area of the region under 
the graph of f(x) cosnx for x between —z and z. Such an area made sense even if f(x) 
was an “unruly” function that produced a graph with a number of corners and breaks. 

There was a clear need to develop a theory of integration, de ned independently 
of differentiation, that would embrace discontinuous functions on an equal footing with 
continuous ones. The familiar approximating-sum conception of a de nite integral was 
presented by Riemann in one of the two Habilitationsscrift papers he submitted upon 
joining the faculty at Géttingen in 1854; it was not published until 13 years later, and 
then only after his untimely death. Riemann’s version of integration covered a much 
broader class of functions than the continuous functions. The extent of the generalization 
was strikingly exhibited when he offered an example of a Riemann-integrable function 
having in nitely many discontinuities within the interval of integration. 

Fourier’s memoire also forced a revision in the prevailing notion of functions. The 
word “function” appeared in print for the rst time in articles by Leibniz in 1692 
and 1694, where it was used to designate a geometric object—tangent, subtangent, or 
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normal—connected with a point on a curve. He had introduced the terms “variable” and 
“constant” somewhat earlier. The new terminology was given a broader mathematical 
meaning by Euler; at the very outset, in the fourth paragraph of his in uential Jntroductio 
in analysin infinitorum (1748), we read the following: 


A function of a variable quantity is an analytical expression composed in any way whatever 
from that variable quantity and numbers and constant quantities. 


Euler furnished no explicit de nition of an “analytical expression,” enumerating instead 
the admissible operations by means of which it would be built up (the four usual algebraic 
operations, extraction of roots, raising to powers, along with trigonometric, exponential, 
and logarithmic processes). Further on he writes, “thus az — 4z*, az + bya? — 2?, c’, 
etc., are functions of z.” Euler also instituted the convention of using the letter f and 
parentheses for a function f(z). 

Euler, as well as other mathematicians of the time, maintained that a function should 
be represented over its whole domain by exactly one “analytic expression.” For them, a 
relation f(x) such as 


fx) —x forx <0 
Sa e— 

x for x >0 
would not be considered a genuine function, but as two functions juxtaposed. Underlying 
this view was a geometric image of a “nice” curve, free of jumps and having a tangent 
at each point. Euler’s own view of functionality gradually evolved, for in the preface 
to the Jnstitutiones calculi differentialis he suggested a totally different de nition of a 
function, one connected with a formal expression: 


If, therefore, x denotes a variable quantity, then all quantities which depend upon x in any 
way or are determined by it are called functions of it. 


Nor was Fourier’s conception bound by the constraints of the past. His “arbitrary func- 
tions” could be represented by a graph (curve) whose different parts might have no 
logical connection. He remarked in the Théorie Analytique, 


The function f(x) represents a succession of values or ordinates each of which is arbi- 
trary....We do not suppose these ordinates to be subject to a common law; they succeed 
each other in any manner whatever, and each of them is given as if it were a single quantity. 


Although vague and inexact, this formulation comes close to the modern view that a 
function is a relationship between variables. 

Another half century would pass before this tangle of dif cult questions raised by 
Fourier’s work was adequately answered. In an article for Crelle’s Journal of 1829, 
Dirichlet succeeded in giving a list of suf cient conditions for expansion of a function in 
a Fourier series: If f(x) is bounded in the interval —z < x < z, possesses only a nite 
number of maxima and minima, and is continuous except possibly at a nite number of 
points, then its Fourier series converges to f(x) at each point x of continuity. (Before 
Cauchy brought respectable de nitions to mathematical analysis in the 1820s there was 
no de nition of continuity at all, only a kind of verbal description about different parts 
of a curve “joining together without interruption.”) The same paper provides an example 
of a function that does not satisfy “the Dirichlet conditions,” namely, f(x) = c when 
x is rational and f(x) =d when x is irrational, c # d. This celebrated function is the 
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rst explicit illustration of a function not given by an analytic expression, or several 
such; moreover, there is no value of x for which it is continuous. We are also indebted 
to Dirichlet for formulating (1837) a very broad de nition of function that was almost 
universally accepted for the rest of the nineteenth century: 


y is a function of the variable x, de ned on the interval a < x < b, if to every value of the 
variable x in this interval there corresponds a de nite value of the variable y. Also, it is 
irrelevant in what way this correspondence is established. 


Cauchy’s standard of rigor was immeasurably in advance of his contemporaries— 
except for Gauss, whose insistence on careful deduction was the admiration of all. This 
standard also served as a model for a generation of mathematicians. “He is at present,” 
Abel said of Cauchy, “the only one who knows how mathematics should be treated.” 
Gradually, mathematicians came to realize that the last word in rigor had not been 
said and such verbal phrases of Cauchy’s as “approaches inde nitely,” “in nitely small 
increase,” and “as little as one pleases,” lacked meaningful exactness. The gure who 
towers above all others in the movement to make these ideas still more precise, whose 


name has become a synonym with rigor, is Weierstrass. 


The Father of Modern Analysis, Weierstrass 


Many great mathematicians give evidence of their unusual ability at an early age, but 
there are those who abandon other lines of work to turn to mathematics at a later period. 
Weierstrass is in this latter class; he showed his special aptitude along this line at an age 
greater than that ever reached by Galois, and at which Gauss had already completed the 
Disquisitiones Arithmeticae. Weierstrass was such a striking exception to the common 
notion that a mathematician of rst rank must make his mark early in life that it is worth 
taking a closer look at his career. 

Karl Weierstrass (1815-1897) was hindered in his early years by a domineering 
father, a customs of cer in a salt works, who tried to force his son into an uncongenial 
occupation. The young boy had made a brilliant record in secondary school, winning 
several prizes before graduation. At his father’s insistence, Weierstrass spent the four 
years from 1834 to 1838 at the University of Bonn, trying to master law and public 
nance; this was the usual course of study for one seeking an administrative post in the 
Prussian civil service. More attracted to a wild life of drinking and fencing, he came 
to shun all lectures and left the university without receiving a diploma. Weierstrass 
next enrolled (1839) in an accelerated program at the Academy of Miinster, which was 
near his home, to prepare himself for teaching mathematics at the secondary level. He 
was fortunate to attend the lectures of Christoph Gudermann (1798-1852), his only 
signi cant mathematics teacher, who aroused his interest in the newly created theory 
of elliptic functions of Abel and Jacobi. We are told that at the opening lecture of 
Gudermann’s course on elliptic functions there were 13 students, but for the rest of the 
semester the only one in attendance was Weierstrass. As part of his state examination, 
he was given a mathematical problem to be written and completed in six months’ time. 
Weierstrass, on his own request, was assigned a dif cult task by Gudermann, namely, 
to represent an elliptic function as the quotient of two convergent power series. The 
response, Weierstrass’s rst mathematical paper, was praised by Gudermann as an 
important advance. The article might have changed the course of Weierstrass’s life if it 
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Karl Weierstrass 


(1815-1897) 


(From A Concise History of Mathematics by Dirk 
Struik, 1967, Dover Publications, Inc., N.Y.) 


had appeared in print, but because Miinster awarded only teaching certi cates and not 
doctorates, publication of such papers was not customary there. (The paper remained in 
manuscript until 1894, when it was printed in Weierstrass’s collected works.) 

Weierstrass nally secured his teaching license at the age of 26, and from 1841 
until 1854 presented such subjects as science, writing, and gymnastics to youngsters in 
obscure secondary schools in Prussia. In the “unending dreariness and boredom of these 
years,” with no access to a mathematical library nor any colleague for mathematical 
discussions, he carried on a remarkable study of abelian functions. It is reported that 
when Weierstrass once missed the start of morning classes, the school principal went 
to his home only to nd that Weierstrass had worked the whole night through on his 
research, completely unaware of the approach of dawn. He asked the principal to excuse 
his tardiness, for he was on the verge of a discovery that would surprise the mathematical 
world. These ideas eventually appeared in the program of studies for 1848-1849 of the 
Catholic Gymnasium in Braunsberg, East Prussia, perhaps the last place that one would 
look for an epochmaking memoir. Had Weierstrass’s rst publication caught the eye 
of any professional mathematician, it would indeed have created a furor. As it was, 
the article was totally unintelligible to the students, parents, and teachers who read the 
program. 

Fortunately for mathematics, Weierstrass’s next memoir, Zur Theorie der Abelschen 
Functionen fared better, appearing in Crelle’s Journal for 1854. This research paper 
elicited enormous interest, with the mathematician Liouville calling it “one of those works 
that mark an epoch in science.” By 1855, the year of Gauss’s death, Weierstrass was just 
gaining the recognition his talent deserved. The University of Kénigsberg bestowed an 
honorary doctorate on the once obscure teacher, and the ministry of education granted 
him a year’s leave with pay to pursue his research. 

Germany at this time was not a unied nation but a collection of many small, in- 
dependent states. It had no educational center such as Paris or London; yet the various 
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states took pride in their institutions of higher learning and eagerly competed for great 
scholars. It was therefore not uncommon to nd German professors transferring from 
one chair to another with a higher endowment. Gauss’s death brought about several such 
shifts, thereby opening the way for Weierstrass. When Dirichlet was called to succeed 
Gauss at Gottingen, Dirichlet’s place in Berlin was occupied by his friend Kummer, who 
had been at Breslau. Determined not to return to the provinces of Prussia, Weierstrass ap- 
plied for Kummer’s vacant post at Breslau. Kummer recommended against Weierstrass’s 
application, apparently feeling that the publication of one paper, however noteworthy, 
was not a sufcient guarantee of future productivity. However, when Weierstrass was 
offered a special professorship at any Austrian university of his choice, Kummer changed 
his mind. He secured (1856) a position for Weierstrass at the Industrial Institute at Berlin 
(the forerunner of Berlin Technical University) teaching technical subjects, and at the 
same time an assistant professorship at the University of Berlin. Another 8 years passed 
before, in 1864, he was able to withdraw completely from the Industrial Institute and 
assume a full professorship at the University of Berlin, where he spent the last 40 years 
of his life. What is noteworthy is that Weierstrass did not begin his professional tenure 
at Berlin until an age when most mathematicians cease their creative work. 

Weierstrass, despite his late start, was the world’s greatest analyst during the last 
third of the nineteenth century—the “father of modern analysis.” It is dif cult to date 
his contributions, because he was slow in publication, preferring instead to make his 
ideas rst known to the general mathematical public through his lectures at the Univer- 
sity of Berlin. No one else offered the same subject matter, so these lectures enjoyed 
an increasing reputation and attracted the most gifted students from around the world. 
Weierstrass’s rare mathematical gifts were matched by his resourceful teaching. Each of 
his meticulously prepared lectures was a logically intricate exercise in creative work for 
himself and his students alike. He often took his listeners to the frontiers of research, 
for the content of his course was usually new mathematics in the making. Indifferent to 
fame, he endeared himself to students with the liberality with which he allowed them to 
develop results that he would not take time to write up. The lecture notes and personal 
research of such students as Georg Cantor (1845-1918), Sonya Kovalevsky (1850-1891), 
Gésta Mittag-Lef er (1846-1927), Carl Runge (1856-1927), Max Planck (1858-1947), 
Otto Holder (1859-1937), and David Hilbert (1862-1943) spread the new analysis cre- 
ated by Weierstrass. One of those who attended his lectures in 1872 on the calculus of 
variations, Oskar Bolza (1857-1942), went on to teach at the newly founded University 
of Chicago for 17 years; there he was in uential in building up a mathematics faculty 
of unusual strength. 


Sonya Kovalevsky 


Any discussion of Weierstrass’s life would be incomplete without mentioning his 
close personal relationship with the most talented of all his students, Sonya Kovalevsky. 
Unlike many of her female contemporaries, who were prohibited from developing their 
mathematical talents, Sonya Kovalevsky was fortunate in having the opportunity to attend 
a German university. But it cannot be said that Germany had any real sympathy for 
the higher education of women. Until the turn of the century, German universities had 
admitted no women students except those who came from abroad, and even when this 
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Sonya Kovalevsky 
(1850-1891) 


(The Bettmann Archive.) 


barrier had fallen, women in Prussia still did not have the right to qualify for higher 
degrees. 

Sonya Korvin-Krukovsky (1850-1891) later known as Sonya Kovalevsky, was born 
in Moscow to a family of the minor nobility, her father being an artillery general. 
Although regarded as the greatest woman mathematician before 1900, she did not receive 
a formal mathematical education in her own homeland. In her early childhood she was 
exposed to mathematics in a curious way. During the renovation of the family country 
estate, one of the children’s rooms was temporarily wallpapered with some lithographed 
notes of Ostrogradski’s lectures on differential and integral calculus. In her published 
recollections, A Russian Childhood, Kovalevsky recalled that she had “passed hours 
before that mysterious wall, trying to decipher even a single phrase and to discover the 
order in which the sheets ought to follow each other.” In St. Petersburg, at the age of 
15, when she took more rigorous private lessons under the tutelage of a professor at the 
naval academy, the teacher was astounded with the quickness with which she grasped 
the ideas “just as if I had known them before.” 

Russia’s dismal showing in the Crimean War (1853-1856) had been widely at- 
tributed to its scientic and technological backwardness, a backwardness for which a 
disproportionate emphasis on the classical university education was partly responsible. 
The immediate reaction was another reform of the university system; purchase of foreign 
textbooks was allowed without interference from the censors, students regained their 
traditional right to attend Western universities, and the decree against hiring foreign pro- 
fessors was dropped. Whereas a new university statute of 1863 opened the institutions 
of higher learning to all social classes, women were still barred by law; the lower urban 
classes and peasantry were still kept out by tradition and economics. The very idea of 
higher education for women seemed ludicrous, and there was strong resistance to any 
suggestion that they deserved the same secondary-school training as men. 

One of the few ways for a Russian woman to continue her education was to study in 
a foreign university. It was not unusual at that time for young Russian women to enter 
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into a marriage of convenience in order to gain the freedom to travel abroad, with or 
without the attendance of her husband. Sonya Korvin-Krukovsky followed this pattern; in 
1868 she married Vladimir Kovalevsky, a geology student at Moscow University. In the 
spring of 1869, the Kovalevskys left for Heidelberg University, one of Germany’s oldest 
institutions and then famous for such gures of modern science as Bunsen, Helmholtz, and 
Kirchhoff. Madame Kovalevsky obtained special permission to attend several courses, 
but the teaching of the mathematician Leo Konigsberger, one of Weierstrass’s rst pupils, 
made the strongest impression on her. At the end of 1870, she transferred to Berlin to 
learn from the master himself. (Her husband had already left for Jena to get a doctorate.) 

At the age of 55, the lifelong bachelor Weierstrass met the 20-year-old Russian 
student. Soon impressed by her ability, Weierstrass claimed that Kovalevsky had “the 
gift of intuitive genius” to a degree he had seldom found even among his older and 
more advanced students. But the University of Berlin was far more conservative than 
Heidelberg, and as a woman, Kovalevsky was not allowed to take courses. Weierstrass 
rst tried to get the university senate to admit her to his lectures, and when this was 
refused, he offered to work with her privately. For the next four years, he shared his 
lecture notes, his unpublished manuscripts, and his latest theories on analysis with a 
“refreshingly enthusiastic participant.” By 1874, Sonya Kovalevsky had completed three 
outstanding research papers: On the Theory of Partial Differential Equations, On the Re- 
duction of a Definite Class of Abelian Integrals, and Observations on Laplace’s Research 
on the Form of Saturn’s Rings. The strength of these works, along with Weierstrass’s 
strong recommendation, quali ed her (1874) for a doctorate without examination, and in 
absentia, from the University of Géttingen (which she never actually attended). She had 
petitioned that the oral examination be waived as her German was inadequate. This was 
the rst time that a woman had applied for a higher degree in mathematics, so Weierstrass 
requested that the university be particularly careful to uphold high standards in judging 
her. The degree was awarded summa cum laude. 

In spite of an advanced degree and strong letters of recommendation from 
Weierstrass, Kovalevsky was rebuffed in her attempts to get an academic position 
anywhere in Europe. For this reason, she returned to St. Petersburg. During her absence 
from Russia, the advocates of higher education for women had managed to institute 
a program known as Higher Courses for Women in several universities, Moscow in 
1872 and St. Petersburg in 1878. Kovalevsky’s mathematical training quali ed her to 
teach in the program, but Russian law made it impossible for a woman to occupy a 
university position, even if she met the highest standards of scholarship. For nine years, 
she occupied herself in various nonscienti c pursuits: writing newspaper articles, theater 
reviews, poetry, and even a small novel. Although Weierstrass’s letters kept her in 
touch with recent developments, she seemed on the verge of abandoning mathematics. 
At his suggestion, she read a paper on abelian integrals at a scientic congress in St. 
Petersburg in 1879 and another on the refraction of light through a crystalline medium 
at Odessa in 1883. Then in 1883, Kovalevsky was invited by Gésta Mittag-Lef er, 
another of Weierstrass’s distinguished students, to teach at the newly founded University 
of Stockholm. One newspaper greeted her arrival: 


Today we do not herald the arrival of some vulgar insigni cant prince of noble blood. No, 
the Princess of Science, Madame Kovalevsky has honored our city with her arrival. She is 
the rst woman to lecture in all of Sweden. 
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Though the “princess” was_ rst appointed a privatdozent, the lowest possible rank on the 
university scale, she was soon given a post that carried a salary with it. Mittag-Lef er 
managed to raise private funds to match the small sum the university offered. Over 
the succeeding four years, Kovalevsky taught (in imperfect German) 12 courses on the 
newest and most advanced topics in analysis. 

Sonya Kovalevsky’s prominence as a professional mathematician reached its peak 
in 1888, when she received the famous Prix Bordin from the French Académie des 
Sciences for her paper On the Rotation of a Solid Body About a Fixed Point. Voting 
unanimously in favor of the winning memoir, the selection committee, unaware that the 
winner was a woman since each entry was submitted anonymously, “recognized in this 
work not only the power of an expansive and profound mind, but also a great spirit 
of invention.” Because of the exceptional merit of the work, the monetary value of the 
prize was raised from 3000 to 5000 francs, a considerable sum of money at that time. 
Weierstrass rejoiced at this, writing to Kovalevsky, “Competent judges have now given 
their verdict that my favorite pupil, my ‘weakness,’ is not a frivolous marionette.” When 
word of her great success reached Russia, Kovalevsky nally received recognition in 
her own country. The St. Petersburg Academy of Sciences elected her a corresponding 
member, a purely honoric title, to be sure, but she was the rst woman so honored. 
In 1889, Stockholm appointed her a full professor of mathematics, the of ce she held 
until her untimely death. Less than two years later, at the height of her career, Sonya 
Kovalevsky died of inuenza, which at the time was epidemic. Mittag-Lef er wrote 
of her: 


She came to us from the center of modern science full of faith and enthusiasm for the ideas 
of her great master in Berlin, the venerable old man who has outlived his favorite pupil. 
Her works, all of which belong to the same order of ideas, have shown by new discoveries 
the power of Weierstrass’ system. 


The Axiomatic Movement: Pasch and Hilbert 


Weierstrass often demonstrated the need for a strictly logical approach to mathe- 
matics by constructing counterexamples to plausible and widely held notions. The prime 
instance of this is the case of the continuous, but nowhere differentiable, function. Charac- 
teristic of the rst half of the nineteenth century was the loose way in which geometrical 
and other intuitive ideas were used in analytic proofs. Although Cauchy had clari ed the 
foundations of calculus, nearly all the mathematicians of his era believed erroneously 
that a continuous function, pictured by their spatial intuition as a “smooth” curve, must 
be differentiable at most points. André-Marie Ampere, who is best known for his work 
in electricity, wrote a paper Recherches sur quelques points de la théorie des fonctions 
dérivées in 1806. In this work, he tried to prove (or so it seems, because the paper is 
confusing to read) the false proposition that every continuous function has a derivative 
except possibly for a few isolated places. His contemporaries realized that the demon- 
stration was wrong, but there was little doubt about the correctness of the assertion. 
The mathematical world was profoundly surprised when Weierstrass read a paper to the 
Berlin Academy of Sciences in 1872, presenting an example of a continuous function 
that has a derivative at no point of the real line—or what is the same, a continuous curve 
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having no tangent at any point. Weierstrass’s example, 
lo @) 
y \ a" cos(b"x), 0<a<l,ab>1+3, 
n=0 


is supposed to have been given in his classroom lectures as far back as 1861, although it 
was published, with his assent, for the rst time in 1874 by one of his pupils, Paul du Bois- 
Reymond. An equally rude shock to the geometric preconceptions of what a continuous 
curve ought to be was Giuseppe Peano’s discovery (1890) of a continuous curve that 
passes through every point of a square. Those mathematicians who considered the class of 
continuous functions coextensive with the functions pictured by their geometric intuition 
treated the examples of Weierstrass and Peano as pathological cases, quite outside the eld 
of orthodox mathematics. “I turn away with fear and horror,” wrote Hermite, “from this 
lamentable sore of continuous functions without derivatives.” But the real signi cance 

of the varieties of abnormal behavior that continuous functions can show is that it taught 
mathematicians caution about the deceptive evidence of quasi-geometric proofs. 

The critical reorganization of the foundations of analysis, fostered largely by Cauchy 
in the early 1800s, was continued with notable success in the second half of the century 
by Weierstrass and his followers. It is customary to say that Weierstrass “arithmetized 
analysis” (a phrase Felix Klein coined in 1895), by which is meant that he freed analysis 
from the geometrical reasonings and intuitive understandings so prevalent at the time. 
Among other things, this called for giving a more precise formulation of the idea of 
a limit. Cauchy’s de nition had used the expression “approaches inde nitely,” which 
suggests vague intuitions of continuous motion. With Weierstrass, the now-accepted €-é 
terminology became part of the language of rigorous analysis: 


A function f(x) has a limit ZL at x = xo if for any positive number e, there exists a 5 such 
that | f(x) — L| < € for all x in the deleted interval 0 < |x — xo| < 6. 


The unequivocal language and symbolism—the fearsome “epsilontics” of contemporary 
undergraduates—banished the older vague ideas and made really convincing proofs pos- 
sible. Along with the new de nition, Weierstrass also gave us the standard notation 
lim,-,, to express “the limit as x approaches x9.” In a paper of 1841, he wrote “lim,” 
and in 1854, “lim,—.5 dy = ©.” 

Another cause for concern was the lack of precise meaning in the phrase “real 
number.” Cauchy in his Cours d’analyse had regarded a real number as the limit of 
an in nite sequence of rational numbers. But this involves a serious error in reasoning. 
The limit, if irrational, does not logically exist until irrational numbers are de ned; 
that is, use of the de nition of limit requires the demonstrated existence of the very 
quantity whose de nition is being attempted. Aware of the shortcomings in Cauchy’s 
approach, Weierstrass advocated developing an algebraically self-contained notion of 
a real number that in no way presupposed their existence. The ultimate aim was to 
rest the majestic edice of analysis on the arithmetic of the positive integers alone, to 
“arithmetize analysis.” Weierstrass, Cantor, and Dedekind are credited with having made 
the basic advance in building rigorous theories of the real-number system. However, no 
account given by Weierstrass himself survives except in the lecture notes of his students. 
The Cantor-Dedekind theory of irrational numbers is perhaps a story best avoided, for 
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as one recent writer said, “The irrational number, logically dened, is an intellectual 
monster.” 

The interest in postulational methods generated by Weierstrass’s arithmetization of 
analysis carried over into attempts to give Euclidean geometry a strict axiomatic foun- 
dation. Euclid’s Elements contain the earliest attempt to axiomatize geometry, but it was 
far, very far, from being a perfect work. De nitions were given that did not de ne, 
axioms were implied but not stated, and certain proofs were needlessly complicated. 
One should not, of course, be too hard on Euclid. All writers on elementary geometry 
to almost 1900 used their visual imaginations freely, though usually unconsciously, in 
assuming further facts not guaranteed by Euclid’s axioms. Recognition of these de cien- 
cies nally forced mathematicians to reexamine the foundations of Euclidean geometry. 
The aim of geometers in the late 1800s was to give a set of independent axioms from 
which it would be possible to prove, with the rigor the times demanded, all the fa- 
miliar theorems of Euclid’s geometry. Unlike Euclid, who tried to give a description 
of the basic objects with which the axioms deal, these geometers understood that not 
all terms can be dened; the properties of the unde ned terms are specied solely by 
the axioms. 

The rst rigorous attempt to remove the hidden assumptions that marred Euclid’s 
work was made by Moritz Pasch (1843-1930) in his Vorlesungen Uber Neuere Geome- 
trie (1882). Pasch’s philosophy for choosing the axioms of his treatment of geometry 
was that the initial notions, the so-called nuclear concepts, should have a common basis 
of acceptance in human experience. He emphasized, however, that after the basic ax- 
ioms had been introduced, the logical deduction of the remaining propositions should 
proceed without regard to empirical signi cance. The term point was allowed, but not 
line, because no one had ever observed a complete straight line; instead, the notion of 
line segment was taken as unde ned. Likewise, planar surface, but not plane, was a 
primitive term. Pasch axiomatized the order of points on a line (or the relation of “‘be- 
tweenness”) for the rst time; Euclid never mentioned this notion explicitly but used 
it frequently. Pasch’s groundbreaking ideas were modied and elaborated by a_ our- 
ishing Italian school of geometers whose leaders were Peano, Veronese, Pieri, Padoa, 
Burali-Forti, and Levi-Civita. Guiseppe Peano’s (1858-1932) I Principii di geometria, 
logicamente espositi (1889) is largely a translation of Pasch’s treatise into the notation 
of a symbolic logic that he had invented. The highly complicated symbolism made the 
work dif cult to follow, even the subject of ridicule by some mathematicians. Mario 
Pieri (1860-1904), a pupil of Peano, proposed a novel system of axioms for Euclidean 
geometry based on the two unde ned concepts of point and motion. However, the pos- 
tulational development of the subject that clings closest to the Euclidean tradition, with 
its customary primitive notions of point, line, and plane, was attributable to the great 
German mathematician David Hilbert. Because of the clear simplicity of its axioms, 
stated with a minimum of unfamiliar and often unnecessary symbolism, his treatment 
came to receive the widest acceptance. Hilbert succeeded, as neither Pasch nor Peano 
had been able, in convincing mathematicians of the abstract and purely formal nature of 
geometry. 

David Hilbert (1862-1943), the son of a district judge, was born near K6nigsberg in 
East Prussia. In 1880, he enrolled at the local university, the University of K6nigsberg, 
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where Immanuel Kant had studied and taught. Although K6nigsberg was far removed 
from the center of things in Géttingen and Berlin, it had developed a distinguished 
mathematical tradition in the 1800s, beginning with the activity of Carl Gustav Jacobi 
in the early decades and maintained subsequently by Ferdinand Lindemann. Lindemann 
had proved the long-suspected transcendence of the number z, but accomplished little 
of note thereafter. Under his direction, Hilbert completed his doctoral thesis in 1885. 
Over the next few years, he rose in the academic ranks at Kénigsberg. He quali ed as 
a privatdozent in 1886, was appointed to an assistant professorship in 1892, and in the 
following year advanced to full professor, succeeding Lindemann. Then, on the initiative 
of Felix Klein, he was offered a chair in mathematics at Géttingen. Klein later recalled, 
“My colleagues had suggested at the time that I should call a comfortable young colleague 
for me, but I told them, ‘I shall call the most uncomfortable one.’” Hilbert arrived at 
G6ttingen in 1895, almost exactly one hundred years after the arrival of Gauss, to remain 
there until his of cial retirement in 1930. The world had looked on Gauss as the greatest 
mathematician living in the rst decades of the nineteenth century, the man who kept 
G6ttingen alive with creative impulse. Hilbert was to play a similar role in the new 
century. 

During the winter term of 1898-1899, Hilbert broke with his current interest in the 
theory of algebraic number elds to present a series of lectures on the postulational de- 
velopment of Euclidean geometry. These were edited in a slim volume Grundlagen der 
Geometrie (Foundations of Geometry) published as a memorial address in connection 
with the celebration at the unveiling of the Gauss-Weber monument at Gottingen in June 
1899, the same year in which Pieri’s studies were made available. Within a few months af- 
ter its printing, Hilbert’s 92-page book attracted attention all over the mathematical world. 
A French edition appeared soon after the German publication; and an English rendition 
was brought out in 1902 by E. J. Townsend, who received his doctorate under Hilbert. 
The work was gradually modernized and expanded through a sequence of appendixes 
that appeared in seven German editions in the author’s lifetime. An eleventh edition 
(1972), prepared by Paul Bernays, is twice as long as the original. Legendre’s E/éments 
de géométrie passed through more editions than Hilbert’s Grundlagen der Geometrie, but 
the two books are not comparable. Legendre’s work was a textbook addressed to school- 
boys, whereas Hilbert’s, though widely read by mathematicians, exerted no immediate 
inuence on elementary teaching. Yet next to the Elements itself, the Grundlagen der 
Geometrie has become the second most in uential work written on elementary geometry. 
The axiomatic method as it is understood today was initiated by Pasch, but it was the 
Grundlagen der Geometrie, backed by Hilbert’s great mathematical authority, that was 
most responsible for spreading the axiomatic viewpoint. 

Hilbert began his classic Grundlagen der Geometrie with the words, “Let us consider 
three distinct systems of things.” The “things,” his unde ned terms, are called points, 
lines, and planes; these are connected by three unde ned relations, which he chose to 
indicate by the familiar names incidence (on), order (betweenness), and congruence. The 
21 axioms of his treatment are given in ve groups dealing respectively with incidence, 
order, congruence, parallelism, and continuity. This arrangement illuminates how the 
logical dif culties in Euclid were overcome. For instance, his axioms of congruence 
remedy the defect of superposition—it was postulated that two triangles are congruent if 
two sides and the included angle of one are congruent with two sides and the included 
angle of the other. Hilbert’s second group of axioms built on the earlier work of Pasch 
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by including what is today called Pasch’s postulate: A line that intersects one side of 
a triangle, not at a vertex, must intersect another side of the triangle. The fourth group 
consists of just one axiom, Playfair’s form of the parallel postulate. The number of axioms 
may appear unusually large when compared with certain other treatments, such as that 
given by Oswald Veblen (in 1904), which contains the two primitive notions “point” and 
“between” and 12 axioms; but it is no worse than the axiomatization Pieri devised, with 
its 20 postulates. (Two of Hilbert’s axioms were later discovered to be implied by the 
others, so that in the original formulation, the axioms did not form an independent set.) 
Hilbert was chie y responsible for acceptance of the view that a system of axioms 
can have an intrinsic integrity independent of any physical reality, that is, by being a 
consistent system. He is often quoted as having urged: “It must be possible to replace 
in all geometric statements the words point, line, plane by table, chair, mug.” In other 
words, because one is not concerned with the nature of points, lines, and planes except so 
far as their properties are exhibited in the axioms themselves, one could as well call these 
unde ned_ things by any convenient names. Euclid believed the axioms to be self-evident 
truths derived from experience with actual space, but to Hilbert they were merely the 
ground rules from which one set out to develop the logical consequences. Physical reality 
had no place in his order of ideas; the discovery of non-Euclidean geometry had, after all, 
revealed the folly of relying too heavily on spatial intuition. Hilbert became the champion 
of the formalistic school of thought in which the whole of mathematics is conceived in the 
form of theorems, meticulously symbolized, and deduced from uninterpreted axioms. This 
attitude was summed up by Hilbert’s remark: “Mathematics is a game played according to 
certain rules with meaningless marks on paper.” Of course, not everyone subscribed to this 
position. When Hilbert rst came to the notice of the mathematical world in 1888 through 
his great work on the theory of invariants, Paul Gordan (1837-1912) proclaimed, “This is 
no longer mathematics, it is theology.” Later he conceded that “theology has its merits.” 
As long as Euclid’s geometry was considered a catalog of truths about the phys- 
ical world, the question of its consistency never arose. But once it had been reduced 
to an exercise in formal logic, the absolute necessity arose for a proof of freedom of 
contradiction. That the consistency of the non-Euclidean geometries had been shown to 
depend on the consistency of Euclidean geometry added further impetus to the prob- 
lem. By interpreting a point as an ordered pair of real numbers and a line as a linear 
equation, Hilbert constructed (1904) a model of Euclidean geometry within the domain 
of arithmetic; thus, any contradiction drawn from the axioms of the former would be 
recognizable in the latter. The logical admissibility of Euclidean geometry came down to 
granting the consistency of the arithmetic of the real numbers, which still left one with 
the basic problem of demonstrating the latter. The questions had been simply shifted 
from one branch of mathematics to another, but even this was a substantial achievement. 
With continuing optimism, Hilbert and his disciples set about trying to prove the con- 
sistency of arithmetic itself. Efforts to solve this problem revealed the astonishingly com- 
plex logical structure of mathematics. In 1931, a 25-year-old mathematician at the Univer- 
sity of Vienna, Kurt Gédel, published a paper that left little hope that a system of axioms 
wide enough to encompass arithmetic could be supported by a proof of consistency. Bear- 
ing the forbidding title Uber Formal Unentscheidbare Satze der Principia Mathematica 
und Verwandter Systeme (On Formally Undecidable Propositions of Principia Mathemat- 
ica and Related Systems), Gédel’s paper is a milestone in the history of modern logic 
and led to an invitation for him to join the Institute for Advanced Study in Princeton. 
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Gédel’s main conclusions were twofold. In the rst place, he showed that the formal- 
ization of logic contained in the famous Principia Mathematica (three volumes, 1910— 
1913) of Whitehead and Russell had inherent limitations when applied to suf ciently rich 
deductive systems. Speci cally, in any system with “enough” axioms—such as Hilbert’s 
treatment of Euclidean geometry or Peano’s axiomatization of arithmetic—there are cer- 
tain statements that can never be proved or refuted by using only the methods and con- 
cepts of that system; that is, undecidable propositions exist. It is natural to ask whether 
this defect could be remedied by adding further assumptions that would allow one to 
derive previously undecidable propositions. The answer is no, for Gédel also proved that 
new undecidable propositions could always be constructed in the enlarged system. In 
short, there is no system in which all mathematical questions expressible in the system 
can be decided on the basis of its axioms. This brings us to Gédel’s second surprising 
conclusion, namely, the statement that the system is consistent is one of the undecidable 
propositions. Consistency can never be established by operating within the system in 
question; it must involve essentially new methods that are incapable of formalization 
in the system. As Hermann Weyl (1885-1955), a leading mathematician of the recent 
era, so aptly put it, “If the game of mathematics is actually consistent, then the formula 
of consistency cannot be proved within this game.” Gédel’s incompleteness theorems 
dealt a mortal blow to the nal objective of Hilbert’s career, a formalized version of all 
classical mathematics. 

Solving specic problems has always been important in determining the lines of 
development in mathematics. The calculus of variations, for instance, owes its origin 
to John Bernoulli’s determination of the “curve of quickest descent,” and the attempts 
to solve Fermat’s Last Theorem extended the boundaries of number theory. Hilbert’s 
conviction that great individual problems were the lifeblood of mathematics (“If one 
seeks methods without any de nite problems in mind, his search is then mostly in vain.”’) 
led him to put forth a list of problems on which he believed mathematicians should 
concentrate their efforts. The rst International Congress of Mathematicians met at Zurich 
in 1897, and the second was held at Paris in 1900. At the Paris congress, Hilbert was 
invited to give one of the leading addresses. With the new century stretching before him, 
Hilbert felt it appropriate to review the basic trends of mathematical research at the end 
of the nineteenth century and to indicate what he thought the future might hold. He did 
this by formulating 23 unsolved problems, extending over all elds, that he felt should 
occupy the attention of mathematicians in the next era. (The second of these was to 
establish that the axioms of arithmetic are consistent.) Hilbert’s prestige induced many 
followers to consider the problems he had listed, and less than a year after the Paris 
congress two were solved by his own pupils. The list became something of a chart by 
which mathematicians measured their progress in the new century. “A mathematician,” 
wrote Hermann Weyl 40 years later, “who had solved one of them thereby passed on to 
the honors class of the mathematical community.” 

Lacking time, Hilbert was able to describe for the congress only 10 of the 23 
problems “behind which the future lies hidden.” Hilbert spoke in German, but a 
French summary was provided for the audience. The entire text of his now-famous 
address was published several months later by the Géttingen Academy under the ti- 
tle “Mathematische Probleme.” Mary Newson (formerly Winston) prepared an English 
translation to appear in the 1902 issue of the Bulletin of the American Mathematical 
Society. 
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11.3. Problems 


1. The fth volume of d’Alembert’s eight-volume 
Opuscles Mathematiques (1761) contains the “Ratio 
Test for Convergence”: 


A series of positive terms s; + s2 + 53 +::-, 


; , Sntl : 
with r = lim,_... ——, converges if r < 1 and 
S 


n 
diverges ifr > 1. If r = 1, no conclusion can 


be drawn. 


Use this test to examine the convergence of these 
series: 


1 2 3 4 
(a) atmptptmat 
1 1 1 1 


(b) ag ae et 


3 3 
ps ete 
1! 2! 3! 4! 
ptt tat 


(d) 


1 n 
[Hint: In part (d), recall that e = lim (: + =| | 
n 
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2. Obtain the Fourier series of the function f(x) of 
period 27 dened by 


fx) 0, if -a7<x<0O 
xy = 
‘ x, fO0<x<az7 
3. (a) Verify that the square-wave function f(x), 
where 
-=, if —2z7 <x <0 
fo=} _ 
l > if0<x<z 


f(x + 2m) = f(x) 
has the Fourier series representation 


sin3x  sin5x 


3 5 


(b) Justify Fourier’s assertion that the expression 
for f(x) in part (a) “contains the series of 
Leibniz,” that is, the series 


an ee on 
4 3 5 #7 ‘ 


f(x) =2 (sins + 


4. (a) Fourier’s Théorie Analytique de la Chaleur 
treats the periodic function f(x) = |x|, 


8. 
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—m <x <a. Establish that its Fourier series is 
given by 
4 


f@= 


Na 


cos3x  cos5x 
cos x + a a NRE 


(b) From the series in (a) obtain the in nite sum of 
reciprocals of odd squares 


x? 1 1 1 


Prove that the sequence s,, 52, 53,..., dened by 


eee Bde ee Nee n>1 

"10 100 1000 10" ~ 
converges to 1; hence, we may write 0.9999... = 1. 
(Hint: First con rm that 


Euler falsely claimed that the divergent series 
log2 — log3 + log4 — log5+--- 


had a sum ; log +. Show that he may have been led 
to conclude this by rst taking the square root of 
Wallis’s formula 

zw 22 4 4 6 6 


2°1335 5 7 
and then using logarithms. 


If Mercator’s series 


ee 
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is rearranged as 


show that the new series has sum + log 2. This 
indicates that operations for nite sums are not 
necessarily valid for in nite sums. 


Employ the series for log 2 to show that 


1 1 1 
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9. The sum of the convergent series 
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(b) Use the Cauchy convergence criterion to show 


1 1 l that the sequence 5), 52,53, ... given by 
nm=atatataotts lone 
1 5 2° 22° 32° a ne 
+ tt) on Anse converges to a limit. [Hint: For m > n, 
can be found by considering the array Gees oto ! a l 
oe n+1 m(m — 1 
1+ 5 +4 44 2+ ¢ 4 a : 
1 1 1 1 
g 0° GF gr ag aoe ~| 
I 1 1 nom 
g ‘hog or ae oF 
3 t 6 + 11. Using Weierstrass’s de nition of limit, verify the 


(a) First add the columns of the array and then add 
these results to obtain the series. 

(b) Now add the rows of the array and then add 
these results to obtain the sum. 


following: 
(a) The function f(x), dened by 
x if x is rational 
f(x) = 


0 if x is irrational, 


has a limit at the point x = 0. 


10. (a) Prove that the sequence s), 57, 53,..., where (b) The function f(x), de ned by 
fa oe aces, n>1, 1 if x is rational 
2) 3 n 


does not satisfy the Cauchy convergence 
criterion. [Hint: Deduce that sy, —s, > 4.] 


0 if x is irrational, 


reo= | 


has no limit at any value of x. 


11.4 Arithmetic Generalized 


By the early years of the nineteenth century, an exaggerated 
adulation of Newtonian methods had led English mathemat- 
ics to its lowest point; very little work of any great merit was 
being accomplished. With interests chie y restricted to theo- 
retical astronomy and applications of mathematics in physics, 
English mathematicians seemed completely unaware of the 
brilliant advances in analysis made by their Continental counterparts. It was said with a 
certain smugness that a student of French mathematics “may scarcely know a wheelbar- 
row from a steam engine.” The stimulus for reestablishment of mathematical ties with the 
rest of Europe nally came from a group of very talented undergraduates at Cambridge 
University: Charles Babbage, George Peacock, and John Herschel. 

Charles Babbage (1791-1871), the only son of a wealthy banker, entered Cambridge 
in 1810. Prior to starting his studies there he had read extensively in a number of 
mathematical texts including S. F. Lacroix’s Traité Elémentaire du Calcul Différentiel 
et du Calcul Intégral, probably the most modern treatise of its day. Babbage quickly 
discovered that he knew more about calculus than did his instructors, who were unable 
to answer his questions. His reading of the 7raité convinced him that England, in its 
adherence to uxional notation and its emphasis on geometric arguments, was out of 
touch with the Continental analysts. To “awaken the Mathematicians from their dogmatic 
slumber,” he joined with a dozen like-minded students—among them George Peacock 
and John Herschel—to form (1812) the Analytical Society. Its aim was to raise the level 
of mathematical instruction at Cambridge, starting with the replacement of Newton’s dot- 
notation for the calculus (regarded in England as sacrosanct) by Leibniz’s more elegant 


Babbage and the 
Analytical Engine 
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d-notation. Other than arousing a certain opposition and ridicule from the faculty, little 
was accomplished by way of change in the outworn curriculum. 

Perhaps aware that he could not compete successfully with his peers, Babbage de- 
clined in 1813 to take the Tripos, the mathematics honors examination. Herschel came 
out First Wrangler (best candidate), and Peacock, Second Wrangler. After leaving Cam- 
bridge, the three friends took their ambitious campaign further by publishing (1816) an 
English translation of Lacroix’s Traité. They followed it with a companion volume of 
problems with solutions, all in Leibniz’s notation. The joint work did a great deal to 
introduce the previously inaccessible Continental methods of analysis into the lecture 
rooms of British universities. Considering how long England’s intellectual isolation had 
dragged on, it is surprising how quickly Newton’s dot-notation was displaced. By 1820, 
the Leibnizian differential notation was well established on the Tripos examination, and 
by 1830 the calculus textbooks had abandoned their exclusive use of uxional symbol- 
ism. The original innovators of mathematical reform went on to achieve renown, but 
each in a different eld: Peacock as professor of mathematics at Cambridge, Herschel as 
an astronomer, and Babbage for his efforts to construct a “calculating engine.” 

On returning to London, Babbage devoted his youthful energies to a wide range of 
activities. He contributed two long papers on the calculus of functions to the Philosophical 
Transactions of the Royal Society (1815, 1816); and he was elected a Fellow of the Society 
at the age of 25. He found time in 1827 to publish a set of logarithmic tables, considered 
the most accurate of their day. To ensure their readability, Babbage had the same page 
printed on 151 different tints of paper, each in 10 different colors of ink; it was nally 
decided to use a black ink on rather bright yellow paper. During this same period he 
secured the Lucasian Chair of Mathematics, holding the position for 11 years although 
he seems never to have resided in Cambridge nor given a lecture. 

English science was still largely an amateur activity supported through private pa- 
tronage. Babbage was instrumental in the founding of a number of specialist societies, 
such as the Royal Astronomical Society (1820), the British Association for the Advance- 
ment of Science (1831), and the Royal Statistical Society (1834). He also created quite 
a stir by writing the highly polemical Reflections on the Decline of Science in England, 
published in 1830. In it he attacked the ruling clique of the established Royal Society 
for its insuf cient support of the new scientic ideas required in an industrial age, and 
called for “some direct interference by the government.” Another charge was that real 
scientists were but a small minority in the society, many of whose members were there 
merely through some aristocratic connection. Moreover, of the 760 papers appearing in 
the Philosophical Transactions between 1800 and 1830, only 44 were mathematical. 

None of the mentioned accomplishments eclipse Babbage’s reputation, as seen from 
our century, as the pioneer of the modern computer. Contemporary opinion was not so 
kind. The young Babbage was regarded as an eccentric genius, and later in his life, he 
was seen as an irascible crank and possibly somewhat deranged. In fact, the personal 
disasters he suffered in 1827 led Babbage to a breakdown; his father and two of his 
children died that year, and he lost his wife in childbirth. 

According to his own testimony, he conceived the idea that mathematical compu- 
tations could be mechanized while still a student. The story is that Babbage was in 
the rooms of the Analytical Society, staring at a table of logarithms, when a member 
asked, “What are you dreaming about?” He replied, “I think that all these tables might 
be calculated by machinery.” 
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Babbage started to build his Difference Engine, as he called it, in 1820. The machine 
was designed both to carry out calculations automatically and to prevent errors of hand- 
transcription by printing out numerical tables of various kinds. Two years later Babbage 
was able to demonstrate a small, hand-cranked model capable of handling six- gure 
numbers. The Royal Astronomical Society was so impressed that it awarded its rst 
Gold Medal to him. Aided by a government grant of £1500, he then began work on 
a much-enlarged Difference Engine that could treat 20 places of gures. But Babbage 
greatly underestimated the dif culties of the task and the expenses involved: current 
machine-tool technology could not produce components with the precision required. As 
he struggled with constant redesigns, he began to imagine a much more elaborate, steam- 
powered calculating machine, which he eventually called the Analytical Engine. All work 
was brought to a halt in 1833, while further grants were sought from the government. 
Incredibly, the bureaucrats delayed nine years before informing Babbage that they had 
decided to abandon nancial support. The nal accounting determined that the treasury 
had invested £17,000 in the project; Babbage is said to have spent an additional £20,000 
of his own fortune. The partially completed machine with its engineering drawings was 
given to the Science Museum in London. The rst full-sized Difference Engine ever 
built was unveiled by the museum in 1991, in time for the two-hundredth anniversary 
of Babbage’s birth: the massive contrivance weighed three tons and consisted of four 
thousand individual parts. 

Shortly after the 1833 collapse of the 10-year project to construct the Difference 
Engine, Babbage embarked on the design of his Analytical Engine. Where the earlier 
machine produced tables by repeated additions, he now envisioned a device that would 
mechanize the four basic mathematical operations, and hence be able to carry out any 
calculation. The operating instructions were to be fed into the machine in coded form 
on punched cards. (Punched cards were not Babbage’s invention; around 1800, Joseph 
Jacquard had developed an automatic loom that used a set of 20,000 such cards to 
weave predetermined patterns.) The Analytical Engine was conceived on a grand scale: 
it would store 1000 numbers of 50 digits each in its memory, and when additional values 
were required for a computation in progress, a bell would signal the attendant. Although 
partial assemblies were built, no working version of the Analytical Engine was ever 
completed. Only the government had the resources for such an ambitious scheme; and 
having committed public funds once with nothing to show for them, of cials were not 
inclined to repeat the attempt. 

Babbage himself left no complete description of the workings of the Analytical 
Engine. However, on a trip to Turin in 1840, he addressed a group of interested scientists 
and engineers. A young engineering of cer in the audience, L. F. Menabrea—later to 
become prime minister of a uni ed Italy—was inspired to write a general account of the 
machine. At Babbage’s suggestion, the paper was translated from French into English 
by his close friend Ada Lovelace, the mathematically gifted daughter of the poet Byron. 
The translation was complemented by such detailed explanatory notes that the published 
version was three times longer than Menabrea’s original article. Because in the mid- 
nineteenth century it was considered undignied for a woman to sign any scientic 
work, Lovelace contented herself with a modest initialing “A. A. L.” at the end of the 
notes. She was only 27 years old when the Sketch of the Analytic Engine appeared (1843); 
it provides a wealth of detail as to exactly how the machine was to function. 
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The reforms advocated by Babbage’s Analytical Society not so much modernized 
mathematical instruction in England as opened it up. Contemporary Continental ideas 
were now readily available, along with traditional material from the Elements and the 
Principia. One result was that pure mathematics experienced a modest comeback during 
the 1830s. Although analysis had suffered a decline in the post-Newton years, the neglect 
of algebra had been even more noticeable. The rising generation of mathematicians— 
George Peacock, Augustus De Morgan, George Boole, William Rowan Hamilton, and 
Arthur Cayley—sought a better understanding of the logical foundations of arithmetical 
algebra and the abstract nature of its operations. 


Peacock’s Treatise on Algebra 


Babbage’s old friend George Peacock (1791-1858) is usually viewed as breaking the 
ground for abstract algebra as we know it today. After nishing Second Wrangler behind 
Herschel, Peacock took Holy Orders—a precondition for holding a fellowship—and rose 
through Cambridge’s faculty ranks to become professor of geometry and astronomy in 
1837. He was appointed dean of Ely Cathedral two years later, and remained there the 
last 20 years of his life. Like Babbage, Peacock retained the sinecure of a university 
professorship, much to the annoyance of his academic colleagues. 

Peacock is best known for his textbook, A Treatise on Algebra, an early precursor of 
the formalistic approach algebra would take on later in the century. This minor milestone 
was published in 1830 and substantially ampli ed into a two-volume work in 1842-1845. 
It appears to have been written in response to the continuing objection to the validity 
of negative numbers within common algebra. Critics argued that the lack of a precise 
de nition rendered the symbol —b, where 4 is a positive integer, logically meaningless 
as a mathematical entity; they rejected as nonsensical attempts to legitimize negatives 
by analogy with debts and credits. But the crusade to banish these “quantities less than 
nothing” would also mean discarding imaginary numbers /—5, as well as negative and 
imaginary roots of equations. 

Peacock sought to free algebra from its arithmetic roots by distinguishing between 
“arithmetic algebra” and “symbolic algebra.” The former dealt with letters and symbols 
that represented arithmetic quantities, positive integers, and the ordinary operations 
to which they are submitted. In the latter purely formal system, the symbols are 
unconstrained, independent of any particular interpretation, and so could stand for 
negatives or imaginaries. Peacock proclaimed symbolic algebra to be “essentially a 
science of symbols and their combinations, constructed upon its own rules,” which ought 
to be approached as a logically deductive system in the manner of Euclid’s Elements. 
This attempt to elevate symbolic algebra to the standing of geometry earned Peacock 
the title of “the Euclid of Algebra.” Others were more wary of the symbolic treatment 
of algebra. To Augustus De Morgan, it appeared “like symbols bewitched and running 
around the world in search of a meaning.” 

To ensure in advance the applicability of his twofold system, Peacock chose to 
restrict the operations of symbolic algebra to those of arithmetic. As he explained it, 
“it [arithmetic algebra] necessarily suggests its principles, or its rules of combination.” 
The foundation of his “science of symbols” rested on what he called the Principle of 
Permanence of Equivalent Forms: 
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Whatever form is algebraically equivalent to another, when expressed in general symbols, 
must continue to be equivalent whatever the symbols denote. 


This fuzzy principle decreed that identities valid for positive integral values would also 
be applicable to nonintegral values. Typical is the expression 


a’ —b* =(a+b)(a— b) 


which was on solid ground in arithmetic algebra as long as a > b. Peacock’s principle 
allowed him to assert that it held for all integers without any restriction on the relative 
sizes of a and b. In a similar fashion, the “rule of indices” aa" = a"'" for positive 
integers m and n was rendered an equivalent form in symbolic algebra, and therefore ex- 
tendable to negative and fractional exponents. While a'/*, for instance, had no meaning in 
arithmetic algebra, it follows that a!/? - a!/* = a!/?+!/? = aq in symbolic algebra, whence 
a'/? can be interpreted to mean ./a. Furthermore, it was assumed to have meaning for 
all sorts of values of the base a outside the scope of ordinary arithmetic. 

Because Peacock’s advance in the direction of abstract algebra never assumed the 
mathematical freedom to depart from the rules of traditional arithmetic, it failed to realize 
its potential usefulness; for, in generalizing only symbols and not operations, it could 
not have described an algebraic system that lacked a commutative multiplication. His 
Treatise did, however, succeed in bringing the logical foundations of algebra to the 
forefront of English mathematical interest. One vexing problem that still remained was 
that of addressing the “mysterious terror” of imaginary numbers. 


The Representation of Complex Numbers 


As we saw in Section 7.3, Italian mathematicians of the sixteenth century intro- 
duced expressions for the square roots of negative numbers to satisfy the demand that 
all quadratic and cubic equations have solutions. But they displayed a skeptical attitude 
toward such expressions, referring to the mysterious quantities a + /—b as “impossi- 
ble” or “nonexistent” numbers. In 1637, when Descartes published his Géomeétrie, he 
contributed the term “imaginary” as a name. We read there: 


Neither the true nor false [negative] roots are always real, sometimes they are imaginary. 


The realization that the use of these new numbers enabled a polynomial of degree n to 
have n roots led to their reluctant acceptance. During the eighteenth century, imaginary 
numbers were used extensively in higher analysis, especially by Euler, because they 
produced concrete results. It was assumed not only that imaginaries exist for the ordinary 
purposes of analysis, but that they obey the same rules of algebraic operation as the 
familiar real numbers. Euler was the rst to employ the now-standard notation 7 for the 
imaginary unit /—1; in a memoir, De Formulis Differentialibus Angularibus, presented 
to the St. Petersburg Academy in 1777, he wrote: 


In the following, I shall denote the expression ./—1 by i so that 7i = —1. 


Before Euler, the symbol /—1, as distinct from ./—a, seldom if ever occurred. 
Although complex numbers (that is, numbers of the form a + bi with a and 5 real) 
were being admitted in formal calculations on an equal footing with real numbers, doubts 
concerning their precise meaning and nature continued to plague mathematicians. Evi- 
dence of Euler’s concern with their vague status appears in his celebrated Algebra (1770), 
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where he remarks that “such numbers, which by their nature are impossible, are ordinar- 
ily called imaginary or fanciful numbers because they exist only in the imagination.” It 
would be almost three centuries after their introduction before an adequate theory would 
be available to interpret them properly. In the absence of a full understanding of these 
new numbers, progress in justifying their logical foundation moved along two lines: one 
approach sought to anchor the complex numbers in a geometric interpretation, whereas 
the other called for expanding the number concept to a wider “eld of arithmetic” that 
would embrace them. 

John Wallis seems to have been the rst to attempt, although unsuccessfully, any 
graphic representation of the complex numbers. In his A/gebra of 1685, he suggested that, 
because ./—bc is the mean proportion between +5 and —c, the geometric interpretation 
of ./—bc could be obtained by applying the Euclidean mean-construction to two directed 
line segments representing +b and —c. But after touching on this notion, he did nothing 
of consequence with it. 

The geometric interpretation of a complex number as a point in the plane is a simple 
idea, but it took a long time to break through. When it nally came, it occurred at 
nearly the same time to three persons who had no connection with or knowledge of each 
other: a Norwegian surveyor and cartographer Caspar Wessel (1745-1818), a French- 
Swiss bookkeeper Jean Robert Argand (1768-1822), and that greatest of all German 
mathematicians, Carl Friedrich Gauss (1777-1855). 

Caspar Wessel’s fame is based on his only mathematical paper, one in which he es- 
tablished his priority in publication of the geometric representation of complex numbers. 
In 1797, he presented his ideas on the subject before the Royal Academy of Sciences 
of Denmark; the paper, written in Danish, was published two years later in the Philo- 
sophical Transactions of the academy. It speaks well for the academy that the members 
received Wessel’s work sympathetically, for he was neither one of its members nor was 
he considered a mathematician. Unfortunately, his account appeared in a journal that few 
European scholars were likely to read. It passed unnoticed for a century until, on the 
one-hundredth anniversary of its publication, the academy issued a French translation, 
Essai sur la représentation analytique de la direction. 

In the opening paragraph of this paper, Wessel indicated the objective of his study: 


This present attempt deals with the question, how we may represent direction analytically; 
that is, how shall we express right lines [line segments] so that in a single equation involving 
one unknown line and others known, both the length and direction of the unknown line may 
be expressed. 


His rst step was to give a de nition of addition of line segments (vectors), placing the 
initial point of one at the terminal point of another: “Two right lines are added if we 
write them in such a way that the second line begins where the rst one ends, and then 
pass a right line from the rst to the last point to the united lines; this line is the sum of 
the united lines.” He observed that for his addition, “the order in which these lines are 
taken is immaterial;” that is, the commutative law holds. 

Next, Wessel turned to the multiplication of line segments. His approach involved 
setting up two mutually perpendicular coordinate axes, with +1 denoting a unit in one 
direction and +e a unit perpendicular to it: 


Let +1 designate the positive rectilinear unit and +e¢ a certain other unit perpendicular 
to the positive unit and having the same origin; then the direction angle of +1 will be 
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equal to 0°, that of —1 to 180°, that of +e to 90°, and that of —e to —90° or 270°. 
By the rule that the direction angle of the product shall equal the sum of the angles of 
the factors, we have: (+1)(+1) = +1; (41)(-1) = -1; (DCE) = +1; 41)(+e) = +e; 
(+1)(-e)=-6; (-I)(-e)=+e; (+e)\Ge)=-1; (Fe)(-e)=4+1; (-e)(-e) = -1. 
From this it is seen that € is equal to /—I, and the divergence of the product is deter- 
mined such that not any of the common rules of operation are contravened. 


Thus, only in the course of computing the multiplication table for the four different units 
+1, —1, +e, —e€ does Wessel indicate that « = /—1. He stated that any line segment 
(Wessel called these “indirect lines”) can be represented by the expression a + be, and 
derived the rule 


(a + be)(c + de) = (ac — bd) + (ad + be)e 


for their multiplication. Because Wessel’s treatment anticipated not only the notion of a 
vector space but also an algebra, it was unfortunate for mathematics that it lay buried 
for a century. 


Jean Robert Argand’s contribution, Essai sur une maniére de représenter les quantités 
imaginaire dans les constructions géométriques, fared somewhat better than the paper of 
Wessel. It was privately printed in 1806, in a small edition that lacked the author’s 
name on the title page. The work might soon have been forgotten except for a peculiar 
chain of events in 1813 that rescued it from oblivion. Argand had shown his treatise 
to Adrien-Marie Legendre before its publication, and Legendre discussed the interesting 
deliberations of this unknown mathematician in a letter to the brother of J. F. Frangais. 
Francais saw the letter, and was so taken with the notions in it that he developed them 
further in a publication (1813) in the journal Annales de mathématiques. In the nal 
paragraph, he pointed out that he had taken some of his concepts from Legendre’s letter 
and expressed the hope that the anonymous “rst author of these ideas” would identify 
himself and publish his results. Hearing of Frangais’s paper, Argand responded with an 
article in the next issue of the Annales in which he summarized the main points of 
his original work. But, despite its great merit, this publication was without signi cant 
in uence. 

Wessel and Argand were little known; thus it was the authority of Carl Gauss that 
brought about the general acceptance of the interpretation that his two predecessors had 
failed to put across. Gauss seems to have been in possession of a geometric theory of 
complex numbers around the turn of the century. In his (1799) doctoral dissertation on 
the fundamental theorem of algebra, he employed the idea without explicit mention; in 
a letter of 1811 to Bessel, it is clearly outlined; and nally in 1831, in a commentary 
on his paper Theoria Residuorum Biquadraticorum it is publicly described. The novelty 
is that Gauss gives the representation of complex numbers as points in the plane, rather 
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than as directed line segments as did Wessel and Argand. That is, he replaced the number 
a+ bi by the point (a, b). With this approach, nothing remained of the mystic avor that 
was so long attached to these numbers. Gauss af rms: 


That this subject [of imaginary magnitudes] has hitherto been considered from the wrong 
point of view and surrounded by a mysterious obscurity, is to be attributed largely to an 
ill-adapted notation. If for instance, +1, —1, /—I had been called direct, inverse, and lateral 
units, instead of positive, negative, and imaginary (or even impossible) such an obscurity 
would have been out of the question. 


He added the opinion that his presentation “puts the true metaphysics of imaginary 
numbers in a new light,” and that all the difculties would now disappear. Just as the 
real numbers can be interpreted as representing a line, so the complex numbers can 
represent a plane. Gauss also introduced the technical term “complex number” for the 
quantity a + bi, as opposed to the phrase “imaginary number,” which he thought imputed 
some dark mystery to these numbers. 

As a result of these endeavors, we nd French mathematicians referring to the 
“Argand diagram” in texts, whereas Germans speak of the “Gaussian plane.” The Nor- 
wegians, with becoming modesty, refrain from such patriotics. 


Hamilton’s Discovery of Quaternions 


The climax of the attempt to establish the theory of complex numbers on a rm 
mathematical footing can be found in the work of the Irish scientist, William Rowan 
Hamilton (1805-1865). 

Hamilton was a child prodigy whose maturity ful Iled all that his early precocity 
promised. The fourth of nine children of a practicing attorney, he was born in Dublin, 
Ireland, and except for short visits elsewhere spent his whole life in that same city. 
Hamilton’s preliminary education, which was carried on at home, was mainly in lan- 
guages, classics, and mathematics. The young Hamilton was pro cient in Oriental as 
well as European tongues. By the time he was 7, he could read Latin, Greek, Hebrew, 
French, and Italian; at the age of 13 he had a working knowledge of as many languages 
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as the years he had lived. Hamilton’s artistic inclination was just as strong; often he said 
that, although he made his living as a mathematician, he was a poet at heart. 

The young Hamilton’s thoughts were turned to mathematics by a meeting in 1818 
with Zerah Colburn, an American youngster who gave a demonstration in Dublin as 
a “lightning calculator.” The meeting was in the nature of a public test of arithmetic 
skills, and Hamilton came out second best, which only induced this competitive youth to 
begin a furious study of the original texts of Euclid (in Greek), Newton (in Latin), and 
Laplace (in French). At 17 he detected an error in Laplace’s Mécanique céleste, which 
was communicated to the president of the Royal Irish Academy. After reading Hamilton’s 
article on the subject the president is supposed to have remarked, “This young man, I do 
not say will be, but is, the rst mathematician of his age.” 

In 1823, Hamilton took the entrance examination for Trinity College, Dublin, and 
came out rst ina _ eld of one hundred candidates. Once there, his progress continued 
to be brilliant. He achieved the previously unheard-of distinction of winning the highest 
possible marks in both mathematics and English verse. In 1827, he presented to the 
Royal Irish Academy his “Theory of Systems of Rays,” a long and technical treatise 
on geometric optics. This paper led to his appointment (while still an undergraduate) as 
professor of astronomy at Trinity College; it was understood that, except for 12 annual 
lectures, Hamilton would be left free to pursue his own lines of interest. His professorship 
automatically included the titles of Astronomer Royal of Ireland and Director of the 
Dunsink Observatory, positions that he retained until his death. Hamilton devoted himself 
entirely to theoretical studies and did almost nothing as a practical astronomer, perhaps 
because the research equipment was poor and inadequate and he himself lacked technical 
training. He was knighted in 1835 as an honor for his work in optics, particularly the 
result that light refracts in certain biaxial crystals in a conical con guration of rays. 

Although such ideas as negative and imaginary numbers appeared essential for alge- 
bra, Hamilton could not reconcile himself to the interpretations set forth in his day. Until 
these could be adequately de ned, algebra remained for him “obscure and doubtful.” 
Thus, in the early 1830s, Hamilton struggled to clarify the shaky logical foundations of 
the subject, hoping to create “a SCIENCE properly so called: strict, pure and indepen- 
dent; deduced by valid reasonings from its own intuitive principles.” In uenced largely 
by Kant’s Critique of Pure Reason, he concluded that the mental intuition of time is the 
rudiment from which such a science may be constructed. Hamilton maintained that since 
geometry is a science of space, and since time and space are “pure sensuous forms of 
intuition,” algebra must be a science about time. 

Hamilton’s Theory of Conjugate Functions, or Algebraic Couples: With a Preliminary 
Essay on Algebra as the Science of Pure Time is his attempt to found algebra on a set 
of axioms based on “order and continuous progression, or, as it might be called, PURE 
TIME.” Although points of this work had been read to the Royal Irish Academy as early 
as 1833, the entire treatise was rst published in the volume of the academy’s transactions 
for 1837. The paper is divided into three parts: 5 pages of General Introductory Remarks, 
the 95-page Essay on Algebra as the Science of Pure Time, and the 29-page Theory of 
Conjugate Functions, or Algebraic Couples. 

The middle section, the Essay, is one of the earliest efforts to list systematically the 
properties of the real number system. Starting with the intuitive notion of order in time, 
Hamilton builds up the natural numbers through a sequence of equal time steps taken 
from an arbitrarily chosen zero moment. This approach allows him to dene negative 
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numbers by a temporal opposite direction, steps backward in time. After obtaining the 
laws of rational numbers, his assumption of “continuous progression” from moment to 
moment in time permits a natural extension to properties of irrationals. The metaphysical 
efuent accompanying Hamilton’s presentation discouraged many from reading it, and 
it was to have little in uence on the arithmetization of analysis done in Germany during 
the last half of the century. 

In the Algebraic Couples portion of his paper, Hamilton develops the complex num- 
bers in terms of ordered pairs of real numbers in almost the same way it is done in 
modern texts. He begins by taking time steps to form, as he called it, a number couple. 
Considering possible ways of combining these couples, he arrives at the rules 


(b1, b2) + (a1, a2) = (b1 + a1, bz + aa), 
(by, bz) « (a1, 42) = (bya, — byap, bya, + yap), 


along with the inverse operations of subtraction and division. In introducing these de ni- 
tions, Hamilton felt it necessary to protect himself from the possible objection that they 
are wholly arbitrary, pointing out: 


Were these de ni tions even altogether arbitrary they would at least not contradict each other, 
nor the earlier principles of Algebra. It would be possible to draw legitimate conclusions 
by rigorous mathematical reasoning from premises thus arbitrarily assumed. But persons 
who have read with attention the foregoing remarks of this theory, and have compared them 
with the Preliminary Essay, will see that these de nitions are really not arbitrarily chosen. 
Though others might have been assumed, no others would be equally proper. 


Hamilton then goes on to demonstrate that addition and multiplication are commutative, 
and that multiplication distributes over addition. He misses the associative law, probably 
because it did not occur to him that there might be an algebraic system in which it would 
not hold. 

Hamilton believed that imaginary numbers had no real meaning. They could not be 
properly de ned and therefore had to be excluded from ordinary algebra. The advantage 
of his number couples was that they provided a means of expressing complex numbers 
that avoided any reference to imaginaries. To obtain the traditional form of a complex 
number, we note that 


(a,, a2) = (a1, 0) + (a2, 0)(0, 1) =a, +aav-l. 
Hamilton expanded on this idea by writing: 


In the THEORY OF SINGLE NUMBERS, the symbol /—I is absurd, and denotes 
an IMPOSSIBLE EXTRACTION, or a merely IMAGINARY NUMBER; but in the 
THEORY OF COUPLES, the same symbol J! is signi cant , and denotes a POSSI- 
BLE EXTRACTION, or a REAL COUPLE, namely (as we have just now seen) the 
principal square-root of the couple (—1,0). In the latter theory, therefore, though not in 


the former, this sign ./—1 may be properly employed; and we may write, if we choose, for 
any couple (a), a2) whatever, 


(4,,@2) =a, + a@V-1 eres 


In this way, “the metaphysical stumbling-blocks” that beset algebra were cleared away. 
For some historians of mathematics, the conception of complex numbers as number 
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couples is Hamilton’s greatest achievement in algebra, even more signi cant than his 
later discovery of quaternions. 

Hamilton closed his paper with the statement that “the author hopes to publish 
hereafter many other applications of this view; especially to Equations and Integrals, and 
to a Theory of Triplets.” The triplets that he sought were hypercomplex numbers that 
were related to three-dimensional space just as the usual complex numbers are related 
to two-dimensional space. His hopes were nally ful led in 1843 with his derivation of 
“quaternions” of four numbers, after a long and fruitless search for triplets. 

By analogy with the complex numbers, Hamilton wrote his triplets as a + bi + cj, 
where a, b, and c are real numbers and i* = j? = —1. Addition of such expressions 
presented no dif culty; they are added by adding their real or “scalar” parts and adding 
the coef cients of each of the units 7 and 7 to form new coef cients for these units: 


(at+bi+cj)+(a’+bi+cj)=(ata)+(b+b))i+(c4+c)j. 


But Hamilton was frustrated by his repeated failure to be able to de ne multiplication of 
triplets in a way that would preserve the properties of ordinary complex numbers. 

Now the modulus of a complex number a + bi is a? +57; and the “law of the 
moduli,” as Hamilton called it, states that the product of the moduli of two complex 
numbers equals the modulus of the product of the two numbers. Hamilton was particularly 
concerned that this law also hold for triplets. Consider the simplest case of the product of 
two triplets, the square of a + bi + cj. Assuming that the multiplication can be carried 
out and the terms collected, one gets 


(a+ bit+cjP =a? —b? —c? + 2abi + 2acj + 2bei;, 
which has modulus 
(a* — b? — cc’) + (ab)? + (ac? + (2beY = (a2 +b? +79 + (2bc)y’. 


The law of the moduli is not ful lled unless the ij-term is removed from the expansion 
of (a + bi+cj)*. The term must be either suppressed by setting ij = 0, or somehow 
included in one of the other three terms. Taking ij to be zero did not appear to Hamilton 
to be quite right: 


Behold me therefore tempted for a moment to fancy that ij = 0. But this seemed odd and 
uncomfortable, and I perceived that the same suppression of the term which was de trop 
might be attained by assuming what seemed to me less harsh, namely that ji = —ij. I made 
therefore ij = k, ji = —k, reserving to myself to inquire whether k was 0 or not. 


His thought here is that, if the order of multiplication is scrupulously respected, there 
are actually two terms involving the product of i and /; that is, 2bcij should be instead 
written as bc(ij + ji). The law of the moduli could be satis ed simply by assuming that 
ij + ji = 0, without taking either ij or ji to be zero separately. 

The next step was to “try boldly then the general product of triplets.” Under the 
supposition that i7 = —ji = k, Hamilton calculated 


(a+ bi+cj\(x +yit+z/) 
= (ax — by —cz)+i(ay + bx) + j(az + cx) + k(bz — cy). 
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Setting k = 0, he once again asked whether the law of the moduli is satis ed. In other 
words, does the equation 


(a? +b? +?)(x? + y? +27) = (ax — by — cz) + (ay + bx + (az + exp 


hold? Clearly the answer is no; for the left-hand side of the proposed equation exceeds 
the right-hand side by (bz — cy)’, which is the square of the coef cient of k in the 
expansion of the product. Thus, the assumption that k = 0 is untenable. However, letting 
k £0 is not entirely satisfactory either, because the product of two triplets should again 
be a triplet; and the product as indicated above contains four terms rather than three. 
For nearly 10 years, Hamilton was unable to get beyond this impasse. Every morning 
on coming downstairs to breakfast, his eldest son would ask him, “Well, Papa, can you 
multiply triplets?” And each time he had to confess ruefully, “No, I can only add and 
subtract them.” 

In a moment of insight, which occurred while he was strolling with his wife by the 
Royal Canal in Dublin in 1843, Hamilton suddenly realized that his dif culties would 
vanish if he multiplied expressions with four terms rather than three; that is, if he took 
k to be a third distinct imaginary unit in addition to i and 7. He described this dramatic 
event in a letter to one of his sons as follows: 


An electric current seemed to close; and a spark ashed forth, the herald (as I foresaw 
immediately) of many long years to come of de nitely directed thought and work, by myself 
if spared, and at all events on the part of others, if I should ever be allowed to live long 
enough distinctly to communicate the discovery. I pulled out on the spot a pocket-book, 
which still exists, and made an entry then and there. Nor could I resist the impulse— 
unphilosophical as it may have been—to cut with a knife on a stone of Brougham Bridge, 
as we passed it, the fundamental formula with the symbols i, /, k; 


Preyer ay.==); 


which contains the solution of the Problem, but of course as an inscription, has long since 
mouldered away. 


Hamilton called these new expressions “quaternions,” or fourfold numbers. They are 
hypercomplex numbers of the form g = a + bi + cj + dk, where a, b, c, and d are real 
numbers and i, j, and k satisfy i? = j* = k* = —1. Having already assumed that i* = 
j? =—land k =ij = —ji, it seemed clear to Hamilton that he should have 


P=) ==GO) =a f= =a 


To test the law of the moduli, he still needed values for ik and kj. Not yet sure that the 
associative law held for quaternions, Hamilton tentatively concluded: 


...that we had probably ik = —/j, because ik = iij, and i? = —1; and that in like manner 
we might expect to nd kj =ijj = —i. 


The associative law would also have provided the value for ki, since ki = (—ji)i = 
—ji? =(—/)(—1) = /. But Hamilton preferred to argue by analogy: 


...from which I thought it likely that ki = 7, jk =i, because it seemed likely that if 
ji = —ij, we should have also kj = —jk, ik = —ki. 
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Summarizing the multiplication “assumptions” (as Hamilton called them) for the 
quaternion units, we have 


P= pP=r=-1, ij =—ji=k, jk = kj =i, ki = —ik = j. 


These are the fundamental rules of calculation that Hamilton scratched on the nearby 
bridge with his penknife; they would preserve his priority until he could announce the 
discovery to the academy. They have long since faded, but today there is a commem- 
orative tablet on Brougham Bridge that reads: “Here as he walked by on the 16th of 
October 1843, Sir William Rowan Hamilton in a ash of genius discovered the funda- 
mental formula for quaternion multiplication i7 = j? = k? =ijk = —1 and cut it in a 
stone of this bridge.” 

Note that for two quaternions g and q’, the product gq’ does not in general equal q’q. 
Hamilton’s quaternion algebra obeys all the fundamental laws of traditional arithmetic 
except for the commutative law. That he was willing to abandon the commutative law 
while retaining the associative law is often regarded as a stroke of genius. To depart 
from long-established tradition and accept strange innovations was every bit as dif cult 
for the algebraist as it had been for the founders of non-Euclidean geometry. Just as 
the break with the parallel postulate had paved the way for all sorts of new geometries, 
this bold sacrice of the commutative law was to bring forth a host of new algebras in 
which the fundamental laws did not necessarily all apply. Indeed, within three months of 
Hamilton’s creation of the quaternions, John Graves arrived at the “octonions,” a system 
with eight unit elements; multiplication in this algebra was not only noncommutative but 
not even associative. 

Hamilton was convinced that in the quaternions he had found the right instrument 
to provide a mathematical description of our world of time and space. Time is a scalar 
and points in space are speci ed_ by three real coordinates; together four components are 
required, just as in a quaternion. For the remaining 22 years of his life, Hamilton’s scien- 
tic career was devoted almost exclusively to an elaboration of the theory of quaternions, 
applying them to dynamics, astronomy, and the theory of light. By the end of 1865, there 
had been 150 papers published on the subject, with 109 of them written by Hamilton. Ten 
years after his initial discovery, he brought out the Lectures on Quaternions, a massive 
work running to 736 pages plus an additional 64-page preface. The cumbersome Lectures 
proved to be unreadable to all save the most intrepid mathematicians. A colleague wrote 
Hamilton that the book would “take any man a twelvemonth to read, and a near lifetime 
to digest.” At the urging of friends, Hamilton began to write an introductory manual on 
quaternions, complete with examples and problems; but again his excessive verbosity 
carried him away, and this too expanded beyond reasonable bounds. The Elements of 
Quaternions, posthumously edited by Hamilton’s son in 1866, was even longer than the 
Lectures (762 closely printed pages.) 

The quaternions never ful Iled Hamilton’s expectation of becoming the mathematical 
language applicable to the physical world; few important physical discoveries were made 
by quaternion methods. As a basic tool for scientists, they proved to be simply too com- 
plicated for quick mastery and easy application. The rst and most pro table departure 
from Hamilton’s creation was made by the American Josiah Willard Gibbs (1839-1925). 
Using just the vector portion u = bi + cj + dk of a quaternion to represent physical quan- 
tities, Gibbs, in the early 1880s, built up a new system called three-dimensional “vector 
analysis.” In place of Hamilton’s single quaternion product, he introduced two different 
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types of multiplication: namely, the scalar or dot product of v and vu’ = b'i + c’'j + d'k, 
de ned by 


v-v' =bb'+cc' +dd', 
and the vector or cross product given by 
v xv = (cd' — c'd)i + (db! — b'd)j +(bc' — b'c)k. 


Despite the spirited advocacy of Hamilton’s devoted followers, the vector analysis of 
Gibbs eventually prevailed, replacing the quaternions for the practical purposes of physics 
and engineering. 

Although the failure of the quaternion cause tended to diminish Hamilton’s historical 
stature, the long view justi es his tremendous labor. Quaternions, with their abandonment 
of commutativity, were a key step in the development of modern algebra. They showed 
that it was possible consciously to construct new elements of algebra rather than nding 
them from elements of existing algebras. Once this possibility opened up, many people 
seized the opportunity—including Graves. Had Graves not left the matter of announcing 
the discovery of the octonions up to Hamilton, these numbers might be known today as 
the Graves numbers rather than the Cayley numbers. Unfortunately, Hamilton did not act 
right away, and in the meantime Arthur Cayley published a paper describing an algebra 
essentially identical to Grave’s octonions. 


Matrix Algebra: Cayley and Sylvester 


Arthur Cayley was born in 1821 at Richmond, in Surrey, during a short visit by his 
parents to England. His earliest years were spent in St. Petersburg, where his father was a 
partner ina rm of Russian merchants. In 1829 the family took up residence in England 
where Arthur, at the age of 14, was sent to King’s College School in London. There 
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the young Cayley gave such indications of mathematical genius that school of cials 
persuaded his father to abandon his intentions of bringing up the boy in the family 
business and to send him instead to Cambridge. Accordingly, Cayley began his university 
career at Trinity College, at the age of 17. In 1841, while still an undergraduate of 20, he 
published a work in the Cambridge Mathematical Journal, thus beginning an astounding 
series of papers that were to enrich various scientic periodicals. Cayley nished his 
student days in the following year by winning Cambridge’s two highest honors: Senior 
Wrangler in the Mathematical Tripos and the more dif cult Smith’s Prize for the essay 
of greatest merit on any subject in mathematics. 

At Cambridge from the 1780s onward, bachelor’s degrees were awarded on the ba- 
sis of a single examination called the Senate-House Examination. This consisted almost 
entirely of mathematics (including “arithmetic, algebra, uxions, the doctrine of in nites- 
imals and increments, geometry, trigonometry, optics, and astronomy’’) with some philos- 
ophy. Apparently the college tutors were interested in having a subject of instruction that 
lent itself to examination purposes. The heavy emphasis on mathematics so dominated 
the Cambridge curriculum “to the neglect of the classics” that in 1824 a second, clas- 
sical, examination was instituted; and from that date the old Senate-House Examination 
became known as the Mathematical Tripos. But the Classical Tripos was a voluntary ex- 
amination open only to those candidates who had already secured mathematical honors, 
a rule that must have excluded many. (The word “tripos” itself had a medieval origin; in 
the fteenth century the “ould bachelor” who conducted oral examinations at Cambridge 
sat upon a three-legged stool and was therefore called “Mr. Tripos.”) It was customary to 
publish the results of the Mathematical Tripos in strict order of merit, with the individual 
who earned the year’s highest score being given the title Senior Wrangler; after him 
came the other Wranglers, Senior Optimes, and Junior Optimes. After them were listed 
all the other students down to the last man, the Wooden Spoon. Nothing for a student 
in any university was deemed comparable to the distinction of being Senior Wrangler 
(‘the rst of the rsts”), for in its prime, the Mathematical Tripos was the most severe 
mathematical challenge ever devised; four days of tests, up to 10 hours a day. 

Stories were current in Cambridge of the equanimity with which Cayley treated his 
successes in the Tripos examination. A concluding feature of the examination was a three- 
hour paper consisting of problems representing the utmost range of the examiner’s fancy. 
On the evening after the grand competition, a friend rushed into Cayley’s room with 
disconcerting news of Cayley’s chief rival. “I’ve just seen Smith,” the friend announced, 
“and he told me that he did all the questions within two hours.” Cayley responded, 
“Likely enough he did: I cleaned up that paper in 45 minutes.” 

Upon completing his degree, Cayley was immediately elected a Fellow of Trinity 
College, at the youngest age for any student of that century. According to university 
statutes, a Fellow was required to take Holy Orders in the Church of England or to 
vacate the position after seven years. (Before the Oxford and Cambridge Act of 1877, 
it was contrary to the statutes to have a wife and a fellowship at the same time.) Many 
scholars held their fellowships for a brief period only, and then sought careers in which 
they would be free to marry; in such careers they would usually earn more than the 
modest stipends offered by their colleges. So it was with Cayley, who left the academic 
world at the end of three years to enter the legal profession. He practiced in London 
for 14 years as a conveyancer, drawing up deeds for the transference of property. But 
his real occupation was pure mathematics, and he wrote something approaching 300 
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mathematical papers, incorporating in them some of his most signicant discoveries. 
Determined to preserve a portion of his time for research, Cayley rejected much of the 
legal work that came his way, making a comfortable living but turning his back upon 
the opportunity to become wealthy. 

It was during this phase of his life that Cayley met James Joseph Sylvester, who was 
likewise dividing his allegiance between the law and mathematics. Thus began a lifelong 
friendship that would produce the theory of invariants (the two came to be known as 
“the invariant twins”) as well as important contributions to higher geometry, combina- 
torics, and the theory of matrices. Sylvester gratefully acknowledged that Cayley was 
the person “to whom I am indebted for my restoration to the enjoyment of mathematical 
life.” Between 1851 and 1855, when Sylvester left to accept a teaching appointment 
at the Royal Military Academy at Woolwich, both men established solid mathematical 
reputations. Often they would walk around the Courts of Lincoln’s Inn, with Cayley, in 
Sylvester’s words, “habitually discoursing pearls and rubies.” 

Around 1861, the Lucasian professorship of mathematics at Cambridge—the chair 
made illustrious by Barrow, Newton, and Cotes—fell vacant. Marking the growing accep- 
tance of science as part of a liberal education, the chair was _lled by the physicist George 
Stokes, who was Senior Wrangler and rst Smith’s Prizeman the year before Cayley. 
However, Cambridge wished to have Cayley also, and in 1863 created a new Sadlerian 
Professorship of Pure Mathematics and promptly offered it to Cayley. Sacricing any 
lucrative future in the legal profession for a modest provision in his truer vocation, he 
held this chair until his death in 1895. It is said the Cayley brought mathematical glory 
to Cambridge second only to that of Newton. 

In the spring of 1882, Cayley delivered an extended series of lectures at Johns 
Hopkins University where his friend and fellow worker, Sylvester, was then in charge 
of the mathematics department. The presence, at the same time, of two of the world’s 
most prominent mathematicians was said to have made Baltimore “the stronghold of 
mathematics in America.” Although he wrote but one extensive work, the Treatise on 
Elliptic Functions (1876), Cayley’s output of papers and memoirs was prodigious. His 
most productive period, the years between 1863 and 1883, saw the publication of 430 
papers. Many of these appeared in the Quarterly Journal of Mathematics, a periodical 
that he, Sylvester, and Stokes helped to found in 1855. As Cayley continued in creative 
activity until the end of his life, Cayley’s grand total of 966 articles ranks him third, after 
Euler and Cauchy, as the most prolic writer of mathematics. His Collected Mathematical 
Papers (1889-1898) ll 13 large volumes. 

It is customary to view Cayley as the creator of an algebra of matrices that did not 
require repeated reference to the equations from which their entries were taken. Cayley’s 
interpretation grew out of his interest in linear transformations of the form 

x’ =ax + by 
T; , 
y =cx+dy 
that may be viewed as transforming an ordered pair (x, y) into a pair (x’, y’). In search- 
ing for “an abbreviated notation for a set of linear equations,” Cayley symbolized the 
foregoing transformation by the square array 


[oe | 
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of its coef cients or “elements,” and called this a (square) matrix of order 2. Two such 
matrices are equal provided their corresponding elements are equal. If the transformation 
just given is followed by a second linear transformation, 
x” = ex! + fy’ 
Th 
yl" —_ gx! + hy’, 


then it is a simple matter to show that (x”, y”) can be obtained directly from (x, y) by 
using the single transformation 


x" =(ea+ fo)x +(eb+ fd)y 
y” = (ga +he)x + (gb + hd)y. 


TnT, 


This led Cayley to dene an operation of matrix multiplication by transferring the rule 
for computing the product of linear transformations to the matrices that represent them: 


e f a b ea+ fe eb+ fd 
g A c d|. ga+he gh+hd | 
Cayley was the rst to realize that square arrays themselves could be treated as 
algebraic quantities. He had a suf ciently clear idea of their various properties in the 


mid-1840s, but it was not until 1858 that he put forth his results in a paper called A 
Memoir on the Theory of Matrices. De ning addition of matrices by 


a b ef |_| ere bey 
edie el ete gee. 


Cayley showed that in the resulting system addition is both associative and commutative; 
and that multiplication is associative and distributive over addition. The matrix 


Blond 


is called the identity matrix (of order 2), because it leaves each second-order matrix xed 


under multiplication: 
a b a b a b 
l= =] ; 
c ad c d c d 


As with Hamilton’s quaternions, matrix multiplication is not commutative. This is seen 
by the following example: 


fo ulty olebe ol#Lt J-L) olle 


Also, the product of two matrices may be zero (that is, equal to the matrix all of whose 
elements are zero) even though neither factor is zero. 

The sole theorem contained in the 1858 Memoir (“a remarkable theorem,” said 
Cayley) is the famous Cayley-Hamilton theorem, which asserts that any square matrix 
“satis es its own characteristic polynomial equation.” To illustrate this situation, consider 
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an arbitrary second-order matrix A = 


; | If vertical lines represent the determi- 


nant of a matrix, then the polynomial 
a-x b 


(x)= |A-—xl|= 
e Cc d—x 


is the characteristic polynomial of A. The developed expression of this determinant is 
p(x) =x? —(a+d)x + (ad — be). 


When 4 is substituted into the polynomial, we get 
0 0 
p(A) = 47 —(a+d)A + (ad — be)I = 5 


It is in this sense that A satis es its characteristic polynomial. 

Cayley did not prove the Cayley-Hamilton theorem in general. He gave a computa- 
tional veri cation for matrices of order 2, noted that he had con rmed the same result for 
matrices of order 3, and concluded that it was unnecessary “to undertake a formal proof 
of the theorem in the general case of a matrix of arbitrary degree.” This re ects Cay- 
ley’s lack of interest in proofs where inductive evidence seemed convincing. Sylvester 
subsequently carried these investigations much further, and may be viewed as the main 
developer of the theory of matrices. Indeed, the introduction of the term “matrix” into the 
mathematical literature seems to be due to Sylvester (1848). Cayley, on the other had, 
came up with the notion of a square matrix between a pair of vertical lines to denote a 
determinant and used (1843) pairs of double vertical lines to indicate matrices. 

Cayley’s mathematical interest also touched on the emerging concept of an abstract 
group. The theory of groups is the oldest branch of modern abstract algebra. Its origins 
can be traced to the work of Lagrange, Rufni, and Galois, who studied the radical 
solution of equations by means of certain permutations of their roots. Galois introduced 
(1830) the word “group” as a technical term in mathematics. To him it meant merely a set 
of permutations closed under multiplication; for he wrote, “if one has in the same group 
the substitutions [permutations] S and 7, one is certain to have the substitution [permu- 
tation] S7.” It was Cauchy who pioneered the establishment of permutation groups as 
an independent area of investigation. In some three hundred pages, published in the two- 
year period 1844-1846, he introduced the familiar permutation notation—in which the 
arrangements are written one below the other, enclosed in a single pair of parentheses— 
along with a number of concepts such as a transposition, a cyclic permutation, and the 
order of a permutation. One of his most fundamental results, known today as Cauchy’s 
theorem, states: for every prime p dividing the order of a nite group, there exists an 
element of order p. Cauchy and the other early investigators worked wholly within the 
context of permutation groups, all the while developing basic ideas (cosets, normal sub- 
groups, quotient groups) that would later extend beyond that domain to abstract groups. 

In an 1854 paper, entitled “On the theory of groups as depending on the symbolic 
equation 6” = 1,” Cayley took the rst notable step in the evolution of the abstract 
view of a group. He starts with symbols 6, ¢,... representing general operations and 
lets 6@ denote their “compound operation” or product. It is stated that “The symbols 
0,@,... are in general such that 0- dy = 0¢- x.” Associativity is further emphasized 
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a few lines later when Cayley writes that “these symbols are not in general convertible 
[commutative], but are associative.” His de nition of a group is presented as: 


A set of symbols 1, a, 8,... all of them different, and such that the product of any two of 
them (no matter in what order), or the product of any one of them into itself, belongs to the 
set, is said to be a group. 


For Cayley, the symbol | “will naturally denote an operation that leaves the operand 
unaltered”; in other words, 1 serves as the identity element of the group. Despite the 
three-dot ellipses in his de nition, Cayley thought only in terms of nite groups. 

Cayley proceeds to introduce the idea of a multiplication table for a group (nowadays 
known as a Cayley table) and concludes that there are only two “essentially distinct” 
groups with four symbols 1, a, 6, and y, their tables being 


1 a Bp Y 1 a  £B Y 
1] 1 a B y Lt a | B Y 
ala B y 1 a}|a 1 Y B 
B| B Y 1 a B| B y 1 a 
y| 1 a | B y| y | BI @ 1 


In a modern terms these tables represent the cyclic group of order 4 (a group which 
Cayley says “is analogous to the system of roots of the ordinary equation x+ — 1 = 0”) 
and the Klein 4-group. As an example of the latter, Cayley cites four operations on the 
set of invertible matrices of a given order, writing them as 1, /, /.tr, and tr.J (where J 
indicates matrix inversion, and tr, transposition). He remarks in a subsequent article that 
the four quaternion units 1, i, 7, and k, along with their negatives, form a group of order 
8 under multiplication. 

Cayley’s advance toward abstraction went virtually unnoticed at the time. The math- 
ematical climate did not favor a formal approach to a concept having the theory of 
equations as its only signi cant application—matrices and quaternions being too new to 
be well known. As Cayley’s attention soon moved in other directions, “group” remained 
synonymous with “permutation group” for the next two decades. Leopold Kronecker in 
1870 set down the rst explicit axioms for an abstract group that is commutative, an 
unnecessarily restrictive condition that was removed in Heinrich Weber’s de nition 12 
years later; both mathematicians limited their attention to nite groups. The nite and 
in nite cases were included in W. von Dyck’s formulation, also in 1882. 

To quote Weber’s essentially modern de nition, from his paper in the early 1880s: 


A system G of h arbitrary elements 0), 62,..., 0, 1s called a group of degree / if it satis e s 
the following conditions: 


I. By some rule which is designated as composition or multiplication, from any two 
elements of the same system one derives a new element of the same system. In 
symbols 6,0, = 6. 

II. It is always true that (6,0, )0, = 0,(0@:0,) = 0,0;4,. 


III. From 06. = 006, or 6.0 = 0,0 it follows that 6, = 6,. 
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As the group concept became more prominent, various postulates appeared in the 
mathematical literature. These usually modied Weber’s condition III, so that the exis- 
tence of an identity and inverses were explicitly required. 

The middle of the nineteenth century saw a new general interest in discovering some 
symbolic calculus that would be capable of expressing geometric ideas in Euclidean 
spaces of dimension greater than three. One of the rst to succeed in this enterprise was 
Hermann Giinther Grassmann (1809-1877), a creative mathematician whom his con- 
temporaries virtually ignored. The son of a German schoolmaster in Stettin, Grassmann 
began his career in 1827 at the University of Berlin, where he studied theology and clas- 
sical languages for about six semesters. Around 1832, he began an intensive, independent 
study of mathematics; by 1834 he was hired to teach the subject at an industrial school 
in Berlin. He was not happy with either the city or the school and stayed there just over 
a year. Early in 1836, Grassmann returned to Stettin, where he would spend the rest 
of his life teaching mathematics, science, and religion in several secondary schools. He 
had no university training in mathematics and although he repeatedly sought a university 
position, he was never to hold one. 

In 1844, Grassmann brought out his highly original work, Die lineale Ausdehnun- 
gslehre, ein neuer Zweig Mathematik (The Theory of Linear Extensions, a New Branch 
of Mathematics). Its basic algebraic entity, which he called an extensive quantity, was a 
formal nite sum }° ae, where the coef cients a, are real numbers and @1, @2,..., en 
are a “system of units of rst rank.” The sum and difference of extensive quantities was 


given by 
o* opex HE > Beer = Sox + Buex 


and the multiplication of an extensive quantity by a real number a by 


a (> axer) = S\(wap der. 


In current language, Grassmann’s extensive quantities formed an n-dimensional vector 
space over the real numbers with basis e), e2,...,@,. Guided by geometric intuition, 
Grassmann proceeded to develop, essentially as is done today, such elementary notions 
as subspace, linear independence, and dimension. Another feature of the new theory was 
the (outer) product of two quantities 


3 axer) (> Bje;) = Y> ax Bjleres]. 


where the brackets indicate the product of units, called a system of second rank. These 
products of units were to conform to the rules 


lee;] = —lejex]J=exe;, LSk<j<n 


[exex] = 0, 1<k<n. 


The Ausdehnungslehre was very dif cult to comprehend. Besides endowing his “sci- 
ence of extensive quantitities” with an unfamiliar terminology and technical symbolism, 
Grassmann shrouded it with philosophical discussion. The result was a forbidding trea- 
tise that was accessible only with extreme dif culty. Indeed, it was almost unreadable. 
Essentially ignored by the mathematical community, this groundbreaking work had a 
tragic fate: it remained unsold, and the publisher disposed of the whole edition as waste 
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paper. In 1862, Grassmann brought out a revised version of the Ausdehnungslehre; he 
had three hundred copies printed, at his own expense, in his brother’s shop. This second 
edition followed its predecessor into temporary oblivion. 

Downhearted, Grassmann gradually abandoned mathematics and turned his talents 
to the study of Sanskrit and Indo-European languages. In this sphere he won public 
recognition and was awarded, in the last year of his life, an honorary doctorate from the 
University of Tiibingen. He had also completed the manuscript for yet another version 
of the Ausdehnungslehre, which was published posthumously in 1878. 

In 1888, Guiseppe Peano brought out a condensed elaboration of Grassmann’s ne- 
glected ideas under the title Calcolo geometrico secondo |’Ausdehnungslehre di H. Grass- 
mann (Geometric Calculus According to the Ausdehnungslehre of H. Grassmann). Its last 
chapter is notable for containing the rst explicit de nition of what Peano called a “linear 
system,” but is now known as an abstract vector space. His axioms were quite similar to 
Grassmann’s basic properties for extensive quantities. Where Grassmann’s publications 
were characterized by their abstruse nature, nothing could be clearer than Peano’s expo- 
sition. Nevertheless, his axiomatic formulation failed to in uence the mathematics of his 
day. The de nition of an abstract vector space did not become widely known until the 
early 1920s, when various writers (notably Norbert Wiener and Stefan Banach) realized 
its relevance to their own investigations. 


Boole’s Algebra of Logic 


We have seen that multiplication of quaternions, and also of matrices, broke the laws 
of ordinary algebra by being noncommutative operations. Once Hamilton had showed 
that the study of algebra need not treat just the real or complex numbers, new types of 
systems were rapidly created. A radically different system, one that differs fundamentally 
from traditional algebra even though addition and multiplication are both commutative, 
was developed by George Boole. He called the system an “algebra of logic,” whose 
general symbols could be interpreted either as sets or as propositions. 

George Boole (1815-1869) was born in humble circumstances in Lincoln, England, 
a shoemaker’s son. He went to elementary school and for a short time he attended a 
school for commercial subjects, but beyond this meager education he was entirely self- 
taught. Acutely conscious of England’s class distinction, the young Boole learned Greek 
and Latin, without outside help, in the hope that such knowledge would improve his 
social standing. This project led to his rst published work, a translation into verse 
of an ancient Greek ode; his father sent it to the local newspaper along with a note 
indicating that the author was 14 years of age. (The note’s bene t was mixed, for the 
town schoolmaster insisted that the translation could not be the product of an untutored 
boy.) Faced with the necessity of supporting his poverty-stricken parents, Boole took up 
teaching in elementary schools when he was 16 years old. Four years later, he opened 
his own day school in his hometown. 

The need to prepare his pupils in mathematics aroused Boole’s interest in the sub- 
ject. He mastered, again by his own unaided efforts, the works of the great mathemati- 
cians: Newton’s Principia, Lagrange’s Mécanique analytique, and Laplace’s Mécanique 
céleste. As Boole’s mathematical investigations proceeded, he began contributing a 
stream of original papers to the recently established (1837) Cambridge Mathematical 
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Journal and also to the Philosophical Transactions of the Royal Society. Boole’s reputa- 
tion was established by his essay on the calculus of operations, “On a General Method 
in Analysis”; it was published in the Philosophical Transactions in 1844 and awarded 
the Mathematical Medal of the Royal Society for the best paper in the most recent three 
years. A sideline pertains here: Boole’s manuscript had nearly been rejected by the Coun- 
cil of the Royal Society, but one member steadfastly argued that the author’s poverty 
and obscurity was no reason that his paper should be summarily dismissed. 

This award-winning paper was followed in 1847 by a slim 82-page pamphlet entitled 
The Mathematical Analysis of Logic, Being an Essay Towards a Calculus of Deductive 
Reasoning, Boole’s rst work on the subject in which he was to earn enduring fame. By 
a curious coincidence, it appeared almost simultaneously—at least in the same month, 
and some say on the same day—as his friend Augustus De Morgan’s book Formal Logic. 
Early in the following year, Boole penned The Calculus of Logic, giving some further 
developments of his system. 

In 1849, at the urging of De Morgan and others, Boole applied for the professorship 
in mathematics at the newly formed Queen’s College in Cork, Ireland. Although he 
was without a university education or degree, he secured the position on the basis of 
his research publication. While there he married Mary Everest, a niece of Sir George 
Everest, who surveyed the highest peak of the Himalayas. Boole remained in Cork until 
his premature death in 1864 from pneumonia, which he contracted after walking two 
miles in a drenching rain and, soaked to the skin, dutifully lecturing to his class. 

Boole was a member of the growing body of English mathematicians who liberated 
algebra from “common arithmetic” by suggesting that the rules that symbols obey are 
the important thing in algebra, and not so much the meaning that one may attach to the 
symbols. In particular, the symbols of algebra need not stand for numbers. In the opening 
section of The Mathematical Analysis of Logic, Boole writes: 


Those who are acquainted with the present state of the theory of Symbolic Algebra are 
aware that the validity of the process of analysis does not depend on the interpretation of 
the symbols which are employed, but solely upon the laws of their combination. 


This aspect of his work made Boole a pioneer in the evolution of modern abstract 
algebra. 

The idea of using algebraic symbolism not only to expedite reasoning about numer- 
ical quantities but to impart precision to the logical methods of reasoning can be traced 
to Leibniz in the seventeenth century. But it was not until the mid-1800s that symbolic 
logic began to emerge fully as a special branch of mathematics, and its early growth 
was primarily a consequence of the efforts of Boole and De Morgan. The signally im- 
portant contribution was Boole’s An Investigation of the Laws of Thought, on Which are 
Founded the Mathematical Theories of Logic and Probabilities (1854), which expanded 
and claried the content of his earlier pamphlet. The philosopher Bertrand Russell was 
later (1901) to assert that “Pure Mathematics was discovered by Boole in a work which 
he called The Laws of Thought....His work was concerned with formal logic and this 
is the same thing as mathematics.” As Boole says in the opening sentences of The Laws 
of Thought, his object is to show that the reasoning processes that are studied in logic 
can be formalized and carried out in an algebra of logic: 


The design of the following treatise is to investigate the fundamental laws of those operations 
of the mind by which reasoning is performed; to give expression to them in the language 
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of a Calculus, and upon this foundation to establish the science of Logic and construct its 
method. 


He goes on to construct an algebra of classes, now known as Boolean algebra, whereby 
logical problems can be solved by a process of formal calculation. 

The logical calculus that Boole described hewed closely to arithmetic algebra, in that 
he chose to use the symbols of arithmetic, and had only one property that departed from 
its usual laws (the novel algebraic law being x* = x, which is only true arithmetically 
if x is O or 1). The letters x, y, z,... represented various classes (sets) of things and the 
equality sign = between two classes indicated that they had the same members. Boole 
employed the notation x - y, or simply xy, to stand for the intersection of the two classes, 
that is, the class consisting of all those things common to x and y. In the same vein, 
x + y represented the union of x and y, but only if they were disjoint. The use of x + y 
to denote the exclusive sense of “or” led to dif culties; most of Boole’s successors in 
mathematical logic took x + y to mean the class of elements in x or y, or both. 

The symbols 0 and 1 had special meanings, with | designating the entire universe 
of things—the “universe of discourse” as Boole called it—whereas 0 stood for the class 
that had no members, the empty or null class. These interpretations accorded with the 
behavior of 0 and 1| in ordinary arithmetic: 


x Me x-0=0. 


The notation | — x, or for brevity x, indicated the class complementary to x; that is, the 
class of all elements in the universe that do not belong to x. Its use allowed Boole to 
represent the inclusive union of x and y (the inclusive “or”) as x + x - y. More generally, 
x — y stood for the class of things in x but not in y and was assumed to have meaning 
only if y is contained in x. 

Logical relations were built up from such symbols, so that x +x = 1 was inter- 
preted, for instance, as asserting that the universe is made up of elements that belong to 
the class x and those that do not. The resulting system was similar in many respects to 
traditional arithmetic. Boole assumed, either explicitly or implicitly, that the following 
familiar laws held: 


x+tyH=ytx, X-yH=y-x, 
X+VHt2=@t+y)+zZ, x-W-z)=O-y)-z, 
X-(V+z)=x-yt+x-z, xX-(y-Z)=xX-y—xX-z. 


But his new algebra differed essentially from the algebra of numbers in the properties 
expressed by the equations x -x = x and x +x =x. Another algebraic rule that had no 
counterpart in ordinary algebra was x - (1 — x) = 0, derived by Boole from x? = x. 

After Boole’s death, his pioneering work was developed much further by De Morgan 
and the American logician Charles Sanders Peirce (1809-1890). They independently 
enunciated the principle of duality: To every relation involving logical addition and 
multiplication there is a corresponding dual relation, which is obtained by an interchange 
of the signs + and -, as well as the symbols 0 and 1. (For example, dual to x - x =0 
is the relation x +x = 1.) De Morgan and Peirce also enriched the young science of 
mathematical logic with the discovery of what are still called De Morgan’s Rules; in 
Boole’s notation these read 


X-yH=xt+y, xX+y=H=x-y, 
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Augustus De Morgan (1806-1871) is a unique gure in nineteenth-century English 
mathematics: he made no startling discoveries, was perhaps more logician than mathe- 
matician, yet managed to exert considerable in uence on public education through his 
gifted teaching and prolic output of popular writing. It is said that he “scattered his 
energies so recklessly as to render notable success in any one eld impossible.” 

De Morgan was born in Madras, India, where his father was a colonel in the service 
of the East India Company (the Company was permitted to maintain its own army, 
composed of a few “European” regiments and a growing number of “native” regiments 
under English of cers). Brought back to England as an infant, he attended a series of 
private schools before entering Cambridge University at the age of 16. Although regarded 
as the ablest candidate by far, De Morgan placed a disappointing Fourth Wrangler in the 
all-important Mathematical Tripos examination of 1827; his low ranking, viewed as 
something of an intellectual disaster, was attributed to the wide mathematical reading 
that frequently drew him away from the prescribed path of examination review. 

Throughout his life, De Morgan never hesitated to take a stand on principle, re- 
gardless of personal sacri ce. By refusing to submit to the required Church of England 
religious test, he was prevented from proceeding to the M.A. degree or from obtaining 
a fellowship at Cambridge or Oxford. Thus, with an academic career apparently closed 
to him, De Morgan contemplated preparing for the bar, only to abandon legal studies 
when the secular London University—later renamed University College, London—was 
created. 

In 1828, he was appointed to the poorly paid position of Professor of Mathemat- 
ics and, when 22 years old, began lecturing. The tempestuous James Joseph Sylvester 
attended these lectures as a boy of 14, but did not last very long, being expelled for at- 
tempting to stab a fellow student with a table knife. De Morgan occupied his post, except 
for ve years in the 1830s, until his resignation in 1866 in protest over an abridgment of 
academic freedom. Characteristically, he refused to be a candidate for the Royal Society, 
viewing it as too like a social club. He also declined the honorary doctoral degrees that 
were offered to him. 

Never wealthy, De Morgan was forced to take up private tutoring as a means of 
increasing his income. One of his tutoring pupils was Charles Babbage’s youthful friend, 
Ada Lovelace, often remembered as an “inventor” of computer programming. He also 
turned to writing a stream—soon to become a ood—of textbooks, pamphlets, and ar- 
ticles to the general reading public. De Morgan wrote books on algebra, arithmetic, 
trigonometry, probability, logic, and calculus. His popular book The Differential and 
Integral Calculus (1842) was the rst English-language calculus text to be based on 
Cauchy’s theory of limits. Perhaps De Morgan’s most notable work is the Budget of 
Paradoxes, edited after his death by his widow. The Budget is an amusing satire of the 
fallacies of would-be circle squarers, angle trisecters, and their kind. He contributed no 
fewer than 850 articles to the famous Penny Cyclopedia and wrote regularly for at least 
15 periodicals. No topic in mathematics or its history was too insigni cant to receive De 
Morgan’s attention. 

De Morgan’s chief contribution to scholarship lay in his application of mathematical 
methods to formal logic, and to the subsequent development of symbolic logic. In a 
series of articles contributed to the Transactions of the Cambridge Philosophical Society 
(ve papers between 1846 and 1862), he attempted to generalize the traditional theory of 
syllogistic reasoning. His ideas on logic attracted little attention in his own time, owing to 
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11.4 Problems 


1. Verify that quaternion multiplication is 
noncommutative by computing the product of the 
quaternions gq =1+i+kandq’=2+j-—kin 


both orders. 


2. In the algebra of quaternions, if (b) 
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a complicated notation that he varied on different occasions. Equally unfortunate for him 
was the publication of Boole’s more readable Mathematical Analysis of Logic, which, 
as we have seen, appeared contemporaneously with De Morgan’s own treatise, Formal 
Logic. 

Sometime in 1864, a small group of De Morgan’s students proposed the formation 
of a society to which mathematical discoveries could be brought. In a bold extension 
of their more modest plan, the new organization changed its name from the University 
College Mathematical Society to the London Mathematical Society. It met for the rst 
time in 1865, with De Morgan taking the chair as its president. In his opening address he 
attacked—not surprisingly—Cambridge’s competitive examination system as inhibiting 
training for mathematical research. His efforts met with little success; in the next century, 
the eminent G. H. Hardy still complained that “cramming” for the Tripos was the root 
of England’s mathematical backwardness. 

The London Mathematical Society soon became a national rather than a local society, 
enlisting among its members the leading mathematicians of the country. J. J. Sylvester, 
then teaching at Woolwich, was the next president after De Morgan; and Arthur Cayley, 
Sadlerian Professor at Cambridge, was the third president. A permanent memorial to the 
active part De Morgan played in the successful establishment of the society is its award, 
every three years, of the De Morgan Medal. 


hence, there are in nitely many quaternions whose 
squares equal —1. 


4. Using Hamilton’s interpretation of complex numbers 
as “number couples” (that is, ordered pairs of real 
numbers), conrm the following: 


(a) Multiplication is both commutative and 
associative. 

The couple (1, 0) satis e s (a, b)- (1, 0) = (a, b) 
for any (a, b). 


q=atbi+tcj+dk and g=a-—bi-—cj —dk, 


show each of the following: 


(a) g=(a+bi)+(c+di)j. 

(b) gi =igq if and only ifc=d=0. 

(c) qi = ig if and only if g =0. 

(d) Ifq 0, then qg had an inverse g~! under 


multiplication; that is, gg~! = 1 = q~'q. [Hint: 


Take q>'! = q/(a? +b? +c 4+’). 
(e) gq and q are both roots of the quadratic 
polynomial 


p(t) =? —2at+ (a? +8? +c’? +4’). 


[Hint: Note that 
Gg = (a? = b? _ 


3. Establish that whenever b,c, and d are real numbers 
satisfying b* +c? + d* = 1, then the quaternion 
q = bi + cj + dk has the property that g? = —1; 


c? —d’) + 2abi + 2acj +2adk.] 


(c) If (a,b) 4 (0, 0), then (a, b) has an inverse 
under multiplication, in the sense that there 
exists a couple (x, y) satisfying 
(a, b)- (x, y) = C1, 0). [Hint: Consider (x, y) - 
(a/a? +b? — b/a* + b?).] 


. Given two ordered triples of real numbers, say 


X = (x1, X2,%3) and y = (1, 2, ¥3), dene their 
cross product x x y by 


XX Y= (X2y3 — y2xX3, —O1 V3 — Vis), XL ¥2 — 1X2). 
For ordered triples x, y, z, establish that 


(a) xx0=0x x =0, where 0 is the triple having 
all entries zero. 

(bl) xx y=-yxx=-(y x x). 

(c) («+y)xz=(x+2z)+( x z) and 
xx (y+z)=(% x y)+( Xz). 


Arithmetic Generalized 


(d) Ife: =(1, 0, 0), e = (0, 1, 0), e3 = (0, 0, 1), 
then e; X @& = e3, X=, 


€3 X €) = —@e2. 


6. Show that in general the associative law does not 
hold for the cross product. 


7. In the algebra of matrices of order 2, show the rl 
following: 
(a) b <=tl a a hence there exist 
1-1 =| 9 | hence there e 


nonzero matrices whose Squares are Zero. 


There is no matrix A for which 


0 1 
Dy oz 
e=[? }] 


(b) 


2 
1 1 tf a 
(c) E qe » jana 
1 1 1 Of; | 0 0 
0 0 -1 0]; | 0 OF 
(d) Ifvz is a positive integer, then 


fo i}=[o t] 


8. The transpose of a matrix A, denoted by A‘ is the 


matrix obtained from A by interchanging its 12. 


successive rows and columns; for example, 


fs a]-[5 «| 


For matrices of order 2, prove the following facts 
from Cayley’s Memoir: 


(a) (AY =A. 
(b) (AB) = BIA’. 
(c) If A has an inverse 4~! under multiplication, 


then (A7!)' = (A). 


9. Establish the Cayley-Hamilton theorem for matrices 
of order 2: if A is a matrix of order 2 and p(f) is its 
characteristic polynomial, then p(A) = 0 (the zero 


matrix of order 2). 13. 


10. At the end of his Memoir, Cayley writes, “If L, M 
are skew-convertible matrices, that is, LM = —ML 
of order 2, and if these matrices are such that 
L? =—I, M? = —I; then putting N = LM = —ML, 
we obtain 

L? =—-I,M? =-I, N* =-I, 
L=MN=-NM, M=NL=-LN, 


N=LM=-ML.” 
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Illustrate this situation using the matrices 


0 7 
ae y | and 


u=| . J@=-p. 


. In abstract algebra, a group is taken to be a set G of 


elements on which there is an operation * de ned. 
The operation satis e s the following four properties: 


(a) For all a,b, in G,ax*b isin G. 

(b) For all a,b,c, in G, (ax b)*c=ax(bxc). 

(c) There exists an element e in G such that 
axe=a=erxa foralla inG. 

(d) For each element a in G, there exists an 


element a~! in G such that 


: lea, 


axa =e=a- 
: a b 

Let G be the set of all matrices A = aa of 
order 2 for which the determinant 

|A| = ad — bc 4 0. Prove that G is a group in the 
sense of abstract algebra, where matrix multiplication 
plays the role of the operation «. [Hint: For A in G, 

d/|A|_  —b/|A| 


take 
At= 
—c/|A| a/|Al 


A subgroup H of a group G is a subset of G that is 
itself a group when the operation of G is restricted to 
H. If G is the multiplicative group of matrices of 
order 2 with real entries and nonzero determinants, 
verify that each of the following subsets of G are 
subgroups of G: 


(a) Matrices of the form | ‘ ° | where a # 0. 
atrices of the form > where a A 

b) Matrices of the f 4 : here ad # 0 

(c) Matrices of the form | : : | 

(d) Matrices of the form | ; : | where a # 0. 


In his 1854 article, Cayley proved that there are just 
two “essentially distinct” groups having six elements. 


(a) 


Show that the set of six matrices 


ee lees Wat ca. 
oT) hla A] 


is a group under matrix multiplication. 
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14. 


15. 


16. 
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(b) Establish that the set of six integers 
{0, 1, 2,3, 4, 5} is a group under addition 
modulo 6. 

(c) Use Cayley tables to conrm that the groups in 
(a) and (b) are examples of the distinct groups 
with six elements. [Hint: The elements of one 
group commute, and those of the other do not.] 


A Boolean algebra is a set B of elements on which 

there are operations V and A satisfying the following 

properties for all a,b, and c in B: 

(1)aVvbandaa bare in B. 

Qavb=bvaandanb=baa. 

(3)aVv(bVc)=(av b)Vc and 
an(bAc)=(anb)Ac. 

(4) av (bAc)=(av b)A (aVc) and 
aN(bVc)=(anb)V (adc). 

(5) There exist elements 0 and 1 in B such that for all 
ain B. 


aV0=a and aANl=a. 


(6) For each a in B, there exists an element a’ in B 
such that 


aVa=1 and ana =0. 


In a Boolean algebra B, prove that for all a,b 

in B 

(a) aVl=1andaA0=0. [Hint: 
l=avada=avVv(a'A1).] 

(b) aVa=aandaAa =a. [Hint: 
aVa=(aVa)Al=(ava)A(ava’).] 

(c) aA(av b) =a andavy (aA b) =a. [Hint: 
a=aAl=adA(bv1).] 

(d) a’ is unique; that is, ifa vb = 1 andaAb=0, 
then b =a’. [Hint b=bV0=bvV(aana)= 
(bv a)A(bva'y=1A(bva’).] 

(ec) (av by =a'Ab' and(anb) =a vb". 


For any a,b, and c in a Boolean algebra B, establish 
the following: 


(a) aVco=bVvcandavVc'=bVC’' together 
imply that a = b. 

(b) aAc=bAcandaAc=bAc' together 
imply that a = b. 


Let B be the set consisting of the integers 1, 3, 7, 
and 21. For a,b in B, de ne 


aVb= \cm(a, b) and aA b= gcd(a, b). 


Also put a’ = 21/a. Show that B with these 
operations in a Boolean algebra. [Hint: Notice that if 


17. 


18. 
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a =3'7 and b = 3"7", where r,s,u,v = 0 or 1, 
then gcd(a, b) = 3*7/, where k = min{r, u} and 
J =min{s, v}; also Ilem(a, b) = 3"7”, where 

n = max{r, u} and m = max{s, v}]. 


Let B be the set of all logical propositions, that is, 
declarative sentences that are either true or false but 
not both. 


(a) If p,q are in B, de ne pV q to be the 
proposition “p or g.” Assume p V q is true if at 
least one of p and gq is true. 

(b) If p,q are in B, de ne pq to be the 
proposition “p and qg.” Assume p A q is true if 
both p and q are true. 

(c) If p is in B, de ne p’ to be the proposition 
“not p.” Assume p’ is true when p is false, and 
false when p is true. 

(d) If p and g are propositions formed by 
combining propositions r,s,f,... with the 
connectives V, A, and’, write p = q provided 
p and q have the same truth value for every 
assignment of truth or falsity to any of 
TiS Tie 

(e) Let 1 represent a proposition that is always 
true, and 0 represent a proposition that is 
always false. 


Establish that B forms a Boolean algebra, the algebra 
of logical propositions. 


Suppose that B is the Boolean algebra of logical 
propositions. 


(a) By assigning truth values to the propositions p 
and q, verify the following: 
(PA py =1,(pAq)v (p'Nq) = 0, and 
(PYQV(pAg)=1. 

(b) Show that the cancellation law does not hold in 
B: that is, pVr=qvVr (or pAr=qAr) 
does not necessarily imply p = q. 

(c) Verify the results in part (a) by using the laws 
of Boolean algebra as given in Problem 14. 
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CHAPTER l 2 


Transition to the Twentieth Century: Cantor 
and Kronecker 


The solution of the difficulties which formerly surrounded the mathematical infinite is probably the 
greatest achievement of which our age has to boast. 
BERTRAND RUSSELL 


Unlike England, France, and Russia, where the academies 


The Emergence of American more than compensated for the meager development of 
Mathematics scientic thought in the universities, Germany had univer- 


sities that assumed responsibility for scientic and mathe- 
Ascendency of the matical research. “The education of German universities,” 
said one French writer, “begins where that of most nations 
in Europe ends.” The German university in its character- 
istic form was precipitated by the utter defeat of Prussia 
by Napoleon at Jena in 1806. The humiliating Treaty of Tilsit (1807) stripped the country 
of all its territories west of the Elbe—in all, about half the territory and population—laid 
upon it a heavy indemnity of 120,000,000 francs, and compelled it to support an army 
of occupation of 150,000 soldiers. One result of the treaty was that Prussia lost all its 
universities except for three along the Baltic coast; the loss of those at Géttingen and 
Halle was the most severe blow. The throne and people of Prussia turned to education, 
the only area in which the French left them free to act as the means to the moral and 
physical regeneration of their country. Said the king, Frederick William II: 


German Universities 


We have indeed lost territory, and it is true that the state has declined in outward splendor 
and power, and for that very reason it is my earnest desire that the greatest attention be paid 
to the education of the people....The state must regain in intellectual force what it lost in 
physical force. 


To carry out this aim, a series of laws was passed establishing a universal, compulsory 
system of state education that was to inculcate patriotism in the oncoming generation. 
Prussia became a nation of pupils and schoolmasters. Then with dramatic suddenness 
came a renaissance of spirit, out of which rose the Prussia that was later to unify the 
German states. When its carefully trained and completely equipped armies achieved the 
military defeat of France in 1871, the schoolmaster of Prussia was held to have triumphed 
at last. 
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At the time of Prussia’s deepest national despair, a new university was founded in 
Berlin (1810), given an annual money grant, and assigned a royal palace for a home. The 
University of Berlin was not intended to be a mere addition to the existing universities, 
but rather the embodiment of a new conception of higher education. The new university 
was intended primarily to develop knowledge; secondarily and perhaps as a concession, 
it was to train the professional classes. The main emphasis for both students and pro- 
fessors was on original research, not teaching skill and examining; therefore, positions 
were offered only to those who had proved themselves capable of advancing knowledge. 
The lecture hall took the place of classroom recitation, and the seminar, in which a small 
group of advanced students investigated a problem under the direction of a professor, be- 
came a prominent feature of every department. (Weierstrass, in a joint undertaking with 
Kummer in 1861, introduced the rst seminar in Germany devoted entirely to math- 
ematics.) Although the appointment of all professors lay in the hands of the minister 
of education, the university was given full liberty to manage its own affairs in regard 
to studies and administration. With it came the modern academic freedom to pursue the 
truth in whatever way seemed best; the sober search for truth, without reference to where 
the truth led, was the watchword of the University of Berlin. 

As the 1800s advanced, other universities were founded on the new model of Berlin: 
Breslau (1811), Bonn (1818), and Munich (1826). The unquestioned superiority of their 
libraries, laboratories, and scholars explained the scienti c renown that Germany enjoyed 
abroad. The German universities, with their combination of lecture, seminar, and lab- 
oratory, were felt to be about the only institutions in the world where a student could 
obtain training in how to do scientic and scholarly research. As might be expected, this 
reputation for academic leadership brought clients from all over the world. In particular, 
a steady stream of American students sought specialized training in Germany. 

The universities of nineteenth-century Germany probably had the greatest in uence 
on the development of the modern institution. When pains were taken to eliminate nepo- 
tism, favoritism, and seniorism in appointments, a new kind of professorate emerged. 
A premium was placed on teachers who could also publish, and whose publications 
were suf ciently signicant to draw attention to the university. One result was that a 
would-be faculty member found it increasingly dif cult to take up an academic career 
without tangible proof of scholarly accomplishment. This was usually evidenced by the 
philosophical doctorate, as it was more and more associated with research and writing— 
frequently publication—of a dissertation. A person with a doctorate and the promise of 
scholarly merit could apply to a university for a license to give accredited lectures as a 
privatdozent. The fastest-growing part of the teaching staff, this rank carried the prestige 
of a title but no remuneration, its ostensible function being to increase the size of the 
student body while holding down the number of full-salaried professors. The income de- 
rived from the position came through the modest lecture fees collected directly from the 
students. After passing a habilitation examination, the privatdozent hoped that ongoing 
scholarship would allow him to climb the academic ladder as openings occurred (nation- 
wide searches were an exception). Perhaps the chief bene t of this entry-level position 
was that it acclimated the neophyte professor to the “research ethic” that was the pride 
of the German university: the advancement of the frontier of knowledge through original 
research. Judging from the lively curiosity shown by American academic observers, the 
conception of a teacher-researcher would soon shake up the old habits and lethargy of 
higher education across the Atlantic. 
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American Mathematics Takes Root: 1800—1900 


Until the mid-1800s, there was hardly any advanced mathematical work in the United 
States worth mentioning. With little incentive to pursue original investigation, college 
faculty were content to transmit preexisting mathematics learned from European sources. 
Furthermore, it tended to be mathematics at a relatively nondemanding level: arithmetic, 
elementary algebra, plane and solid geometry, along with a smattering of trigonometry 
and conic sections. Few major colleges made so much as a stab (usually fatal) at offering 
calculus in the period prior to the Civil War. 

Most American colleges in the seventeenth and eighteenth centuries had been pat- 
terned after institutions in Mother England; like them, they were intended for the training 
of a literate and godly body of clergy. By the early decades of the nineteenth century, as 
the preoccupation with theology began to wane, college education had become more a 
“gentleman’s education”; its aim was to produce upright, re ned, and highly cultivated 
young men by means of a classical curriculum. (A liberal dose of Caesar, Plutarch, and 
Euclid was believed unsurpassed for disciplining the mind and forging the character.) 
But there were many signs of dissatisfaction with an educational system tailored almost 
exclusively to the interests of one social class and which served little to benet the 
economic fabric of a growing industrial nation. 

It is said that modern mathematical teaching in America began with John Farrar 
(1778-1853) and modern research with Benjamin Peirce (1809-1880). Farrar graduated 
from Harvard College in 1803 and returned four years later as professor of mathemetics 
and natural philosophy, or physics. He held the position until 1836 when ill health caused 
him to retire. Mathematics had been an essential part of the school’s classical curriculum 
from the outset, although its program was not very challenging; a knowledge of arithmetic 
was not a requirement for admission until 1803, nor was knowledge of algebra required 
until 1819. Farrar moved to modernize Harvard’s offerings by translating and editing 
portions from the most popular continental textbooks, especially those by French authors. 
Between 1818 and 1824, he placed in his students’ hands versions of mathematical 
works by Legendre, Lacroix, Bézout, and Euler. The most signi cant adaptation, the 
First Principles of Differential and Integral Calculus (1824), drew heavily upon Etienne 
Bézout’s Cours danalyse . It provided American undergraduates with their introduction 
to Leibniz’s notation for the calculus as opposed to the antiquated uxional symbolism 
of Newton. The availability of these new textbooks served to raise the level of instruction 
in mathematics not only at Harvard but also at the country’s other major institutions of 
learning. 

Robert Adrain (1775-1843) is regarded as America’s rst creative mathematician. 
Born in Ireland and receiving little formal education, he was still by the age of 15 
suf ciently versed in mathematics to support himself by teaching and tutoring in the 
subject. With his wife and infant daughter, Adrain ed to the United States after being 
wounded in the 1798 uprising against British authority. Settling in Princeton, he was soon 
hired as headmaster or teacher at a succession of educational institutions, among them 
Queens College (now Rutgers University), the University of Pennsylvania, and Columbia 
University, where he was appointed professor of mathematics and astronomy. 

In the opening decades of the nineteenth century, there were few outlets for the 
publication of original mathematical research, and those that did exist were short-lived 
for want of a suf cient audience. Adrain became an early and frequent contributor to 
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the country’s rst mathematical journal, the Mathematical Correspondent, which was 
founded in 1804. Although the Correspondent was largely taken up with submitted prob- 
lems and their solutions, Adrain on one occasion furnished a 50-page essay entitled “View 
of Diophantine Algebra.” When the Correspondent ceased publication in 1808, Adrain 
launched his own periodical, The Analyst, or Mathematical Museum; it too expired after 
only a single volume consisting of four issues. The second issue deserves special notice, 
for in the course of solving a posted surveying problem, Adrain, unaware of the investi- 
gations of Legendre and Gauss, gave a probabilistic derivation of the law of least squares. 
It is generally accepted today that all three men were independent discoverers of the law. 

The nation’s earliest avenue for the publication of articles of scientic interest was the 
Transactions of the American Philosophical Society, founded in 1771. Adrain’s concern 
with problems beyond mathematics is indicated in two papers in the Transactions of 
1818: one dealing with the mean diameter of the earth and the other with the force of 
gravity at different latitudes. He also made a further attempt at publishing a mathematical 
periodical, with the Mathematical Diary. This venture enjoyed more success than did its 
predecessor, appearing in 13 issues between 1825 and 1832. Although hindered by a 
lack of contact with contemporary European mathematicians and limited by inadequate 
library facilities, Adrain pursued his own research to a creditable degree. 

The African-American Benjamin Banneker (1731-1806), although not a mathemati- 
cian in the usual sense, is yet a remarkable gure of this period. He was born to freed 
slaves on their small farm about 10 miles from Baltimore, Maryland. His parents taught 
him to read, but his education outside the home consisted only of several winters’ at- 
tendance at a local Quaker school for boys. Banneker’s mechanical ability came to wide 
notice in his early twenties, when he had occasion to see the workings of a pocket watch. 
Studying its mechanisms, he constructed a wooden clock, carving its gears by hand. His 
clock chimed the hours for the next 20 years. 

In 1788, a prosperous neighbor loaned Banneker a telescope and three works on 
astronomy—two of them consisting entirely of tables. Without assistance he undertook 
to teach himself the practical aspects of astronomy and the mathematics necessary for 
carrying out its computations. Banneker made such rapid progress that he was able to 
predict the time of the solar eclipse of April 14, 1789, with greater accuracy than several 
well-established astronomers. 

This remarkable aptitude for calculation led Banneker to prepare a compilation he 
called Almanac and Ephemeris for the Year 1782. Its information included the locations 
of planets and bright stars, times of eclipses, phases of the moon, hours of sunrise and 
sunset, as well as a table of tides for Chesapeake Bay. Because it served the purpose of a 
household calendar, his almanac succeeded beyond expectations. It sold in large numbers 
throughout the mid-Atlantic states. Banneker sent a manuscript copy to Secretary of State 
Thomas Jefferson, who forwarded it to the Académie Royale des Sciences in Paris as 
indicative of the talents nature has given to “our black brethern.” Banneker continued to 
prepare consistently accurate yearly almanacs for each of the next 10 years. 

Although the developing society required an expansion of scientic instruction in 
general, with a strong mathematical component, it was technology rather than science that 
rst took root in American higher education. Specialized institutions such as Rensellaer 
Polytechnic Institute—essentially a school of civil engineering—opened in 1825, to be 
followed later in the century by the Polytechnic Institute of Brooklyn (1855) and the 
Massachusetts Institute of Technology (1865). Even Harvard and Yale sprouted separate 
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technical institutes in the late 1840s—the Lawrence Scientic School and the Shef eld 
Scientic School, respectively. But education, even in the oldest and most advanced 
colleges, remained centered around classical studies with a highly prescribed curriculum 
from which little deviation was allowable. Applied science was viewed as an inferior 
subject for inferior students whose degrees carried little prestige. At Yale, the “scienti cs” 
were kept segregated from the rest of the students in chapel. 

In the rst half of the nineteenth century, only a few Americans made any noteworthy 
contribution to higher mathematics. The mathematical talent of the country was diverted 
almost exclusively to the practical science of astronomy or to the various coastal and 
geodesic surveys. One recognizable exception was Nathaniel Bowditch (1773-1838). 
A largely self-taught mathematician, he translated four of the ve volumes of Laplace’s 
Mécanique Céleste between 1814 and 1817. Finally published in 1829 at his own expense, 
it was the rst great scientic work issued in the United States. To make the work a 
full introduction to Continental mathematics, Bowditch added to Laplace’s text nearly 
an equal amount of explanatory notes. He also wrote a classic work on navigation, 
New American Practical Navigator (1802), which is still in print today. Along with 279 
pages of instruction and 29 pages of tables, it contained corrections to the more than eight 
thousand errors in the standard British reference of the time. The Navigator soon became 
“the seaman’s Bible” without which no captain could sail. Bowditch continued as editor 
of the immensely popular Navigator, adding additional material to its constantly new 
editions until his death, when his son took over the project. These writings earned him 
membership in the Royal Society of London. Harvard College conferred an honorary 
M.A. degree on Bowditch and offered him a professorship in mathematics (which he 
declined)—quite an honor for a man whose formal education never advanced beyond 
elementary school. 

Until the middle of the nineteenth century, most American navigators relied on 
foreign tables to set their courses, mainly using the British Nautical Almanac (est. 1767). 
An act of Congress in 1849 authorized the United States Navy to prepare an annual 
American ephemeris and nautical almanac. Its rst volume, which was published in 1853, 
included the many tables necessary for the year 1855. The routine mathematical calcu- 
lations involved fell to a small group of “computers,” some of whom would later come 
to be regarded as the nation’s leading theoretical astronomers. Two who achieved such 
prominence were Simon Newcomb (1835-1909) and George William Hill (1838-1914). 

Simon Newcomb was born in Nova Scotia where he was schooled at home by his 
father, a country teacher. He came to the United States at the age of 18 and began his 
career by also teaching in rural schools. In 1857, he was appointed a “calculator” in the 
Nautical Almanac Of ce. At that time the of ce’s headquarters were in Cambridge, Mas- 
sachusetts, close to Harvard’s powerful telescope. Taking advantage of the opportunity, 
Newcomb enrolled in Harvard’s Lawrence Scientic School. He graduated with a Bach- 
elor of Science degree a year later, having studied under Benjamin Peirce. Newcomb was 
commissioned a professor of mathematics in the U.S. Navy in 1861 and assigned to the 
Naval Observatory in Washington. He subsequently rose (1877) to the position of Super- 
intendent of the Nautical Almanac Of ce, by then located in Washington, and directed 
its activities for the next 20 years until reaching the mandatory retirement age of 62. 
During this period, he also served (1884—1894) as a part-time professor of mathematics 
and astronomy at the recently founded Johns Hopkins University, as well as becoming 
the editor of its periodical, American Journal of Mathematics. 
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Newcomb was acclaimed internationally for a succession of scientic contributions. 
For instance, there were discrepancies in the moon’s calculated position in the sky: mea- 
suring and explaining them had become a notoriously complex question for astronomers. 
An 1846 estimate that the moon’s actual position deviated from its expected one by at 
most 21 arcseconds over 240 years was found to be unsatisfactory. In 1871, using the 
available tables and records as far back as 1675, Newcomb was able to improve the 
empirical correction to 17 arcseconds every 273 years. 

Between 1880 and 1882, he and Albert Michelson collaborated on a fresh determi- 
nation of the speed of light as being close to 186,000 miles per second. Another feat was 
his calculation (1891) of the earth’s distance from the sun as nearly 92,950,000 miles (the 
best current estimate being 92,955,860 miles). These noteworthy efforts led to honorary 
degrees from 17 universities and election to national academies or astronomical societies 
in just as many other countries. 

Another mathematical astronomer with a worldwide reputation was George W. Hill. 
He graduated from Rutgers College in 1859 and shortly thereafter joined the staff of the 
Nautical Almanac in Cambridge. A solitary person, Hill sought and obtained permission 
to carry out his calculations at the family farm in New York. However, he was called 
to Washington in 1882 when Newcomb became superintendent of the Almanac Of ce 
and remained there until his retirement 10 years later. In the initial (1878) volume of 
the American Journal of Mathematics, Hill published an article entitled “Researches on 
Lunar Theory” in which he developed an entirely new approach to the problem of three 
mutually attractive bodies. A version of the paper afterward (1886) appeared in the 
Swedish journal Acta Mathematica. France’s renowned mathematician Henri Poincaré 
rened Hill’s investigations in a prize-winning memoir of 1887. When Poincaré visited 
the United States and was introduced to Hill, his rst words were, “You are the one 
man I came to America to see.” In his later years, Hill was the third president of the 
edgling American Mathematical Society, and from 1898 to 1901 he lectured on celestial 
mechanics at Columbia University. 

Another outstanding gure of this time was Maria Mitchell (1818-1889), the rst 
American woman astronomer. She was educated by her father, a respected amateur 
astronomer on Nantucket Island. From an early age, she assisted him at their modest 
home observatory, making observations of star positions to aid the local whaling eet 
in navigation. Mitchell’s discovery in 1847 of a new comet brought her international 
celebrity and a gold medal from the king of Denmark. Hired by the Nautical Almanac 
Of ce in 1849, she spent the next 19 years working part-time at home on the positions of 
the planets. Mitchell was appointed professor of astronomy and director of its observatory 
when Vassar College opened in 1865. She remained at Vassar until her retirement in 1888. 

Benjamin Peirce (1809-1880), one of the 50 original incorporators of the National 
Academy of Sciences (1863), was generally regarded as the leading mathematician of 
his day. As an undergraduate at Harvard, he assisted Bowditch in revising and correcting 
the proof sheets of the Laplace translation, and, in doing so, became familiar with a level 
of mathematics considerably beyond that treated in America. Some 30 years later, Peirce 
dedicated one of his most notable works, A System of Analytic Mechanics, to “my master 
in science, Nathaniel Bowditch.” 

Peirce graduated from Harvard in 1829 and returned as a tutor two years later. On 
receiving his M.A. from the institution (1833), he was appointed—at the age of 24— 
to a professorship in mathematics and physics, which he retained until his death. The 
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amount of mathematical instruction then given at Harvard was small, with the teaching 
of calculus and everything beyond it falling on Peirce. When Arthur Cayley spoke of 
him as being the “Father of American Mathematics,” it was in acknowledgment that the 
advanced courses Peirce offered had never been available at any other American college. 
He was one of the earliest and most in uential advocates of Hamilton’s new system of 
quaternions, and took pains to interest his students in a subject that he believed was to 
have a fruitful future. (A poor prophet, he did not live to see the decline of quaternions 
after the introduction of vector analysis.) Mathematical research in America is often 
said to have begun with Peirce; before him it never occurred to anyone that scholarly 
research was one of the activities for which a mathematics department existed. His 
most outstanding work was a memoir Linear Associative Algebra, read to the National 
Academy in 1866-1870. It was printed only in lithograph form for private circulation in 
1870, and at long last published in the American Journal of Mathematics (1881). The 
memorable opening sentence states: 


Mathematics is the science which draws necessary conclusions. 


On page 5 is reference to the “mysterious formula” connecting z,e, and 7, namely, 
i~! = e*/*, which evidently had a strong hold on Peirce’s imagination. After proving it 
in one of his classes on analysis, he said to his students, “Gentlemen, this is surely true, 
it is absolutely paradoxical, we can’t understand it, and we haven’t the slightest idea 
what the equation means, but we may be sure that it means something very important.” 

A Yale University physical chemist, Josiah Willard Gibbs (1839-1903), wrote for use 
by his students a pamphlet, Elements of Vector Analysis, which was a great simpli cation 
and improvement of Hamilton’s vector calculus. The value of Gibbs’s work was so little 
appreciated by the authorities at Yale that for 10 years he served without pay, living on 
his inherited income. Only when Gibbs was invited to join the faculty of the new Johns 
Hopkins University was Yale persuaded to provide him with a salary. This was still, 
however, only two-thirds of what had been offered him in Baltimore. There is a story 
that the German mathematician and physicist Hermann von Helmholtz on his visit to Yale 
in 1893 expressed regret at missing an opportunity to talk with Gibbs. The university 
of cials, perplexed, looked at one another and said, “Who?” 

For most of the nineteenth century, there was no good understanding of mathematical 
research in the United States, much less the capacity for training research mathematicians. 
It was considered impossible for an American to enter the eld of advanced mathematical 
study without going abroad to seek ful IIment at rsthand under a European professor. 
Germany proved to be particularly attractive, partly because of the brilliance of its teach- 
ers and partly because of the relatively inexpensive German doctorate, which could be 
obtained fairly quickly by a well-prepared student. Then there was the matter of pres- 
tige; to have studied at a German university placed one in a superior class, and the actual 
attainment of a German degree was looked on as an infallible passport into American 
academic circles. One estimate about 1900 indicated that 20 percent of the mathematical 
faculty of American universities had at some time studied in Germany. Clearly, there 
was a signicant German in uence on mathematics in the United States. 

For study in Germany, Géttingen was the most frequent choice, with Berlin and 
Leipzig the next most popular. The Americans at the University of Gdéttingen were 
so numerous as to have their own letterhead, “The American Colony in Géttingen.” 
These American scholars returned from Europe with a new zeal for research in elds 
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they had not been aware of. They also brought back, for better or worse, a taste for 
abstract mathematics rather than for its applications. Thus, the early 1890s witnessed 
a great awakening in American mathematics, when many able and enthusiastic young 
men, largely trained in Germany, set about raising the subject to the same level as that 
pursued in Europe. 

At the same time, a notable number of German mathematicians joined American 
university faculties, where they rendered distinguished service. When the University 
of Chicago opened its doors in 1892, Eliakim Hastings Moore, an American who had 
studied in Berlin during 1885-1886, was appointed professor and acting head of the 
mathematics department. Moore persuaded the trustees to employ for him two unusually 
ne scholars, Oskar Bolza and Heinrich Maschke, both former students at Berlin and 
Ph.D.’s of Gottingen. The young and vigorous department at Chicago with its trio of 
leaders soon became a major American research center. Bolza’s lectures in particular were 
responsible for a highly productive American school in the modern calculus of variations. 

Alternatives to a German university experience began to appear shortly after midcen- 
tury. In 1861, Yale awarded the rst earned doctoral degrees in America; the University 
of Pennsylvania followed Yale’s lead 10 years later to become the second such institution. 
The existence of these programs prompted Harvard to establish its Graduate Department, 
which produced its initial doctorates in 1873. The elective system, pioneered by Harvard 
(1869), not only counteracted the dilution of courses but allowed professors to teach 
more advanced topics. Benjamin Peirce unabashedly championed electives in the hope 
that poor students would elect to stay out of his courses. (Few took the courses in any 
event, since Peirce was famous for his inability to make himself understood in class as 
he raced with religious fervor through abstruse mathematics.) 

At the beginning of the Civil War, there was no such thing as an American university 
in the European sense; there were only colleges, and a large number of them. But in the 
next decade the persistent rise of interest in more advanced studies made major changes 
in higher learning almost inevitable. The emerging university movement was to nd its 
inspiration in the spirit of the German educational system, rather than in the English or 
French examples. 

The real credit for the sudden outburst of mathematical energy in the United States 
during the late 1800s must go to its rst research-oriented university. Johns Hopkins was 
founded in 1876 through the benefaction of a Baltimore nancier of that name. (Hopkins 
left half of his $7 million estate to establish a university, the largest single bequest made 
to that day to an American institution of higher learning.) In a radical departure for the 
time, the trustees of Johns Hopkins decided to mold their new school on a model provided 
by the University of Berlin. The long-ignored investigative function of a university would 
be given new priority in their plan; a resident faculty of high caliber, supplemented 
by distinguished scholars brought in as visiting lecturers, would enrich their individual 
elds through original research and publication. Although initially intended as a wholly 
graduate institution, it was agreed that a collegiate section would also be maintained. 
But instruction at the lower level was regarded as a subordinate function, as re ected by 
the opening (1876) enrollment of 54 graduate students and 23 undergraduates. A system 
of graduate fellowships was instituted to assemble quickly a corps of well-trained Ph.D. 
candidates. The fellowship idea was not entirely new in American education, but no 
other program offered stipends so large or so numerous—22 were awarded in the rst 
year of operation—as those at Johns Hopkins. Many of these fellows eventually joined 
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the faculties of other institutions, spreading the word that research universities could and 
should exist in the United States. 

In 1876, the distinguished British mathematician James Joseph Sylvester was called 
to Johns Hopkins to take charge of the mathematics department at America’s newest seat 
of learning. Sylvester had been admitted to Cambridge University in 1831, becoming one 
of the best scholars in his class. Cambridge was controlled by the Church of England, 
and English law required signing articles of faith before a diploma could be conferred. 
Since Sylvester was Jewish, he was unwilling to submit to this religious test and so 
was barred from receiving a degree. Far worse, he was deprived of the fellowship to 
which his standing in class entitled him. For three years following 1838, he taught at 
University College, London, where his old teacher De Morgan was one of his colleagues. 
At the age of 37, Sylvester accepted a chair in mathematics at the University of Virginia. 
He resigned after a few months’ service, however, because he was dissatis ed over the 
faculty’s failure to sustain him in exercising his authority over an insulting student in 
his course. (The young man had been reading a newspaper while the lecture was under 
way, and when reprimanded had refused to leave.) After this, Sylvester left the academic 
world for a time, serving as an actuary and a lawyer from 1845 to 1855. He seems 
to have given some private instruction in mathematics, numbering Florence Nightingale 
among his pupils. In 1855, Sylvester began new duties as professor of mathematics at 
the Royal Military Academy in Woolwich, a position he held until he was 55, the age 
set by English military law for granting pensions. He was thus free to accept a position 
at Johns Hopkins at its organization. The annual salary, $6000, was extremely generous 
for those days. With British conservatism, Sylvester stipulated that it be paid in gold. 

Sylvester’s reputation and scholarship drew to him a small body of earnest students 
seeking his guidance. A graduate of Vassar College, Christine Ladd, obtained special 
permission to hear Sylvester’s—but only Sylvester’s—lectures, although Johns Hopkins 
did not of cially admit women at that time. As a “special student,” her case was not 
expected to set any precedent. Ladd was later allowed (unof cially, of course) to attend 
other mathematics classes, including those of the renowned logician and philosopher 
Charles Sanders Peirce. By 1882 as Peirce’s student, she had written a doctoral disserta- 
tion, “The Algebra of Logic,” but school of cials proved unwilling to grant her a degree, 
however brilliant the work. On nishing her graduate studies, Christine Ladd married 
Fabian Franklin, a young member of the mathematics faculty at Johns Hopkins. Although 
refused a lectureship in 1893, she held such a position in logic and psychology from 1904 
to 1909 at Johns Hopkins and later acted in a similar capacity at Columbia University. 
As part of the University’s ftieth anniversary exercise in 1926, Johns Hopkins offered 
to award Ladd-Franklin an honorary degree for her distinguished research in the theory 
of color vision. Instead, she insisted on receiving the actual Ph.D. for which she had 
done all the required work. 

Doctoral degrees in mathematics had been given by Yale since 1862 and by Harvard 
since 1873. The earliest such degree actually conferred on an American woman was that 
granted by Columbia to Winifred Edgerton in 1886. In 1888, a year after her marriage, 
Winifred Edgerton Merrill participated in the founding of Barnard College, serving on its 
original Board of Trustees. When in her forties, she established and became the principal 
of a girls’ school. 

The history of African-American women holding advanced degrees is more recent 
in the life of American mathematics. The rst doctorates to be awarded in mathematics 
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were to Marjorie Lee Brown from the University of Michigan and Evelyn Boyd Granville 
from Yale, both in 1949. Brown spent the next 36 years on the faculty of North Carolina 
Central University, heading the mathematics department from 1951 until 1979. After a 
stay at Fisk University from 1950 until 1952, Granville went to work in industry and 
government for 16 years before returning to teaching at California State College in Los 
Angeles where she remained until her retirement in 1985. It might also be mentioned 
that the rst year in which an African-American received a Ph.D. degree in mathematics 
was 1925, when Elbert Cox completed his doctoral studies at Cornell University. 

Winifred Edgerton’s counterpart in England was Charlotte Angas Scott of Girton 
College, Cambridge. Girton was one of the new women’s colleges whose students, be- 
ginning in 1878, were allowed to attend most Cambridge University lectures—at rst, 
on the condition that they be accompanied by chaperones. After 1881, Girton College 
undergraduates were formally admitted to Cambridge degree examinations (though they 
could not receive the resulting degree) on the same terms as male students. Just one 
year earlier, Scott had been allowed to participate in Cambridge’s highly prized Math- 
ematical Tripos, an informal arrangement at the discretion of the individual examiners. 
Although she was judged as standing “equal in pro ciency to the eighth Wrangler” she 
could neither attend the award ceremony nor have her name of cially read out; but when 
the eighth place was announced the students chanted “Scott of Girton, Scott of Girton!” 
(Ten years later the brilliant individual performance of Philippa Fawcett in the Tripos 
placed her “above the Senior Wrangler.”) Although Scott spent nine years at Cambridge, 
Cambridge gave no advanced degrees to women and indeed did not do so until 1948. 
Thus, while her graduate research was directed by Cayley, her doctorate in mathematics 
was granted (1885) by external examination from the University of London. Immediately 
after obtaining her degree, Scott assumed the chairmanship of the mathematics depart- 
ment at the newly founded Bryn Mawr College in Pennsylvania. She was the rst woman 
living in the United States to hold a Ph.D. in mathematics, and she later directed seven 
doctorates granted by Bryn Mawr. 

It is due to the enthusiasm and ability of Sylvester more than any other one man 
that mathematical science in America received its remarkable impetus in the late 1800s. 
During his few years at Johns Hopkins, he founded the rst mathematical research journal 
in the United States, the American Journal of Mathematics (1878). The purpose of the 
journal was to make accessible the papers written by Sylvester, his pupils, and other 
mathematicians trained in America. European contributors added to its prestige. Of the 
rst 90 writers submitting articles, 30 were from foreign countries, and a third of the 
rest were students of Sylvester. In the period 1878 to 1900, Johns Hopkins awarded 32 
doctorates in mathematics, compared with 15 from Yale and 9 from Harvard. 

Sylvester resigned his position at Johns Hopkins in 1883 to become a professor 
of geometry at Oxford University. The school’s administrator, “not wishing to lose the 
impulse already given to mathematical studies among us,” invited the young German 
mathematician Felix Klein to succeed Sylvester. After some hesitation, Klein refused the 
offer and went instead to Géttingen, where he succeeded in making it as prestigious and 
as great as Berlin—if not more so. 

With Sylvester’s return to England, Johns Hopkins lost its momentum as the cen- 
ter of advanced mathematical training on American soil. But the founding of the new 
school in Baltimore had launched a fundamental change in the academic climate. The 
existing graduate schools began to awaken, modernize their curriculums, and undertake 
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the responsibility of advancing, rather than merely diffusing, knowledge. Older faculty 
members with poor quali cations gave way to better-trained mathematicians who had 
known the rigors of a Ph.D. program. By the end of the century, a well-known French 
mathematician, C. A. Laisant, was able to write of the situation in the United States: 


Mathematics in all its forms and in all its parts is taught in numerous universities, treated in 
a multitude of publications, and cultivated by scholars who are in no respect inferior to their 
fellow mathematicians in Europe. It is no longer an object of import from the old world, 
but it has become an essential article of national production, and this production increases 
each day both in importance and in quantity. 


As a result of the sudden increase in the tempo of mathematical research, the number of 
journals grew and meetings for the reading of papers became more numerous and active. 

In 1887, a Columbia University graduate student, Thomas Fiske (1865-1944), spent 
six months visiting Cambridge University where he was invited to attend meetings of the 
London Mathematical Society. He returned to New York City wanting America similarly 
to foster a “feeling of comradeship among those interested in mathematics.” Toward the 
end of 1888, Fiske and ve friends at Columbia took the initiative and formed what they 
called the New York Mathematical Society. Because the new organization was based in 
the mathematics department, most of its regular monthly meetings took place there. It 
was soon decided (1891) to publish a new mathematical journal, the Bulletin of the New 
York Mathematical Society. By 1894, the Society was attracting participants from beyond 
New York City for a total membership of 250. When it became evident that the name 
failed to indicate its widening scope, the Society adopted its present title of the American 
Mathematical Society; simultaneously, its periodical became the Bulletin of the American 
Mathematical Society. 

A national name, however, did not necessarily imply an organization of national char- 
acter. To provide mathematical interaction for those residing in the Midwest, a Chicago 
section of the society was sanctioned in 1897 with E. H. Moore as chairman. The increase 
in the activity of the members of the society and the quality of the papers presented at 
its section meetings led to a new publication outlet, the Transactions of the American 
Mathematical Society. It began publication in 1900, and Moore served as editor-in-chief 
for the rst eight years. Later that year, his colleagues awarded Moore their highest honor 
by electing him president of the society. The early contribution of Thomas Fiske, who 
had by then been appointed to a professorship in mathematics at Columbia, was also 
acknowledged when he too became president (the seventh president from the founding 
of the New York Mathematical Society). 

The unanticipated growth of “graduate programs” in small colleges led to the es- 
tablishment of a journal catering more to the needs of those teaching mathematics than 
the research-oriented American Journal of Mathematics. The American Mathematical 
Monthly was brought out in 1894, not by an of cial organization but by a teacher, Ben- 
jamin Finkel (1865-1947). Contributions by the Chicago faculty to its early volumes 
helped raise the prestige and level of mathematics of the Monthly. 

During the nal decade of the nineteenth century and the rst decade of the twentieth, 
advanced training in mathematics took rm hold in the universities of the United States. 
The commitment to the discipline at the research level was the result of the efforts of 
more than one institution, but the academic presence that helped achieve international 
respectability for American mathematics was the newly formed University of Chicago. 
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Since it was richly endowed by John D. Rockefeller, conventional wisdom describes 
Chicago as a great university from the moment its doors opened in 1892. Chicago, 
unlike Johns Hopkins, which had a deliberate slant toward graduate studies, had nearly 
one-third of its 700 students enrolled as undergraduates, one-fourth graduates, with the 
remainder distributed between the Divinity School and unclassi ed students. Among the 
original faculty possessing doctorates, there were 14 German degree holders and 21 from 
the United States. 

The rst major undertaking of the edgling mathematics department involved the 
organization of an International Congress of Mathematicians—the so-called zero’th 
Congress—to be held in connection with the World’s Fair in Chicago in 1893 (com- 
memorating the 400th anniversary of the discovery of America). Invited papers were 
given by illustrious Europeans, among them Géttingen’s Felix Klein who served as the 
event’s keynote speaker. Chicago’s triumvirate of E. H. Moore, Oskar Bolza, and Hein- 
rich Maschke each presented his own new research at the congress, thereby establishing 
the reputation of their department as a center of original work. After the congress, Klein 
delivered a two-week series of lectures at Northwestern University, the rst colloquium 
of American mathematicians. A gift of $500 from Christine Ladd Franklin enabled Mary 
Winston, then a graduate student at Chicago, to attend the colloquium. She was the only 
woman participant. 

The University of Chicago had an immediate effect on American mathematics by 
conferring 10 doctoral degrees in the discipline between 1896 and 1900. One of the rst 
two Ph.D’s in mathematics, Leonard Eugene Dickson, deserves special mention. After a 
year’s study tour abroad that took him to Leipzig, Dickson accepted appointments to the 
University of California in Berkeley and then Texas. Invited back to Chicago in 1900, 
he remained on its faculty for the rest of his academic career. 

Re ecting the abstract interests of his thesis advisor, E. H. Moore, Dickson initially 
pursued the study of nite groups. By 1906, Dickson’s prodigious output had already 
reached 126 papers. He would jokingly remark that, while his honeymoon was a success, 
he managed to get only two research articles written then. His monumental History of the 
Theory of Numbers (1919), which appeared in three volumes totaling more than 1600 
pages, took nine years to complete. By itself this would have been a life’s work for 
an ordinary person. One of the century’s most prolic mathematicians, Dickson wrote 
267 papers and 18 books covering a broad range of topics in his eld. An enduring bit 
of legend is his barb against applicable mathematics: “Thank God that number theory 
is unsullied by applications.” (Expressing much the same view, G. H. Hardy is reported 
to have made the toast: “Here’s to pure mathematics! May it never have any use.’’) In 
recognition of his work, Dickson was the rst recipient of the Cole Prize in algebra and 
number theory, awarded in 1928 by the American Mathematical Society. 

The prize is named after the algebraist Frank Nelson Cole (1861-1926), who spent 
two years at Leipzig studying with Felix Klein before obtaining his Ph.D. from Harvard. 
Later, in 1892, Cole published the rst American research paper on the theory of groups. 

Under the chairmanship of Moore, the mathematics department of Chicago became 
the source of the rst generation of U.S.-trained mathematicians to attain international 
reputation. Four of Moore’s early students, who took off in different mathematical di- 
rections, were to become the brightest stars of the twentieth century: the algebraist L. E. 
Dickson (1874-1954), the geometer O. Veblen (1880-1960), the mathematical physicist 
G. D. Birkhoff (1884-1944), and the topologist R. L. Moore (1882-1974). Together these 
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four mathematicians published 30 books and over 600 papers in addition to directing the 
research of almost two hundred doctoral candidates. It is not surprising that many consider 
E. H. Moore to be the driving force who changed the United States from a mathematical 
backwater into a world leader in the eld. 

By the turn of the new century, the country had taken control of its own mathematical 
destiny and began to surpass Germany, at least in the number of doctorates granted— 
quality may be another matter. 


The Twentieth-Century Consolidation 


The success of the newly formed University of Chicago encouraged older institutions 
to develop more ambitious, well-rounded graduate programs. Harvard University was just 
coming into prominence in the late 1890s, with the able duo of William Fogg Osgood 
and Maxime Bochner directing its mathematically talented students. Under the leadership 
of Henry Burchard Fine and James Pierpont, respectively, the departments at Yale and 
Princeton Universities blossomed somewhat later in the new spirit of research. That the 
rst three of these scholars wrote their doctoral dissertations under Felix Klein is another 
indication of his in uence on America’s growing mathematical activity. 

Chicago’s L. E. Dickson and E. H. Moore were among the increasing body of 
American mathematicians whose scholarly pursuits were beginning to achieve interna- 
tional reputation. In 1901, Dickson, then 27 years old, revised and expanded his doctoral 
thesis into a research-level book entitled Linear Groups with an Exposition of the Galois 
Field Theory. Brought out by a distinguished German publishing house, Linear Groups 
is said to have signaled the arrival of the Americans on the mathematical scene. The 
early direction of Moore’s work was in abstract algebra, with the most important result 
being that any nite eld contains p” distinct elements for some prime p and positive 
integer n; this occurred in a paper read at the 1893 Chicago International Congress and 
subsequently published in 1896. In 1905, the young Scots algebraist Joseph Wedderburn 
(who spent the year 1904-1905 at Chicago) and Dickson sharpened Moore’s theorem 
by showing, at virtually the same time, that the assumption of the commutativity of 
multiplication followed from his other hypotheses—in technical terms: any nite integral 
domain is a eld. Wedderburn went on to teach at Princeton University from 1909 until 
his retirement in 1945. By 1900, Harvard’s W. F. Osgood had published 21 research pa- 
pers, 6 in German, while his colleague Maxime Bochner had produced 30. Both were to 
write landmark textbooks that became standard at home and abroad. Osgood’s Lehrbuch 
der Funktionentheorie rst appeared in 1907 and went through four more editions in the 
next 20 years. Béchner’s treatise, Introduction to Higher Algebra, also came out in 1907 
to be followed by a German translation in 1909. 

The “Hopkins experiment” of modeling the German university on American soil had 
far-reaching consequences for higher education, particularly graduate education, in the 
United States. The day of the learned amateur was all but over: where in 1850 there 
were but eight graduate students in the whole country, the total had increased to some 
5700 by 1900. The discovery and dissemination of knowledge was widely established as 
a major part of a university’s mission; with it came a growing acceptance of the doctoral 
degree as the standard credential for entry into an academic career. Fed by an unrelenting 
growth in student enrollments along with a higher level of instruction (made possible by 
the almost universal adoption of the elective system), there was increasing demand for 
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specialization of the faculty in their chosen elds. The output of Ph.D.s in mathematics 
nearly tripled in the years 1900-1910 and then doubled again in the next decade. Those 
mathematically inclined were no longer forced to look to the other side of the Atlantic 
for serious training, not even to Germany for postdoctoral study. 

The University of Chicago soon outstripped Johns Hopkins as the leading producer 
of future researchers. One-fourth of the mathematics Ph.D.s awarded in the opening 
quarter of the century were conferred by Chicago alone. Five institutions—Chicago, 
Johns Hopkins, Harvard, Yale, and Cornell—were responsible for more than half of the 
doctorates. In 1924, E. H. Moore proudly reported that by then his university had granted 
131 doctorates, and 52 of the recipients were already full professors in their respective 
institutions. 

In the same period, further periodicals and learned societies made their debut on the 
academic scene. The Annals of Mathematics was begun by Ormond Stone at the Univer- 
sity of Virginia in 1884, with the intent of printing papers of intermediate dif culty. It 
gradually evolved into a more research-oriented journal after moving to Harvard in 1899 
and nally to Princeton in 1911. On the last day of 1915, The Mathematical Association of 
America came into being as the second national organization in the discipline. The strug- 
gling American Mathematical Monthly, subsidized at that time by 12 colleges and univer- 
sities in the nation’s Middle West, gained wider support by becoming the of cial publica- 
tion of the new body. In a similar way, the lower-level Mathematics Teacher (founded in 
1908 as a much needed forum devoted to the teaching of mathematics in the high schools) 
was taken over by the National Council of Teachers upon its establishment in 1920. 

A new generation of American-educated mathematicians was beginning to produce 
research of a quality competitive with that of Europe. Two of the most prominent g- 
ures were George D. Birkhoff and Norbert Wiener. Birkhoff, who was acknowledged to 
be the country’s leading mathematician, made fundamental contributions to differential 
equations and the theory of dynamical systems. He received his doctoral training under 
E. H. Moore, then taught brie y at both the University of Wisconsin and Princeton. After 
achieving international recognition by proving a conjecture of France’s Henri Poincaré, 
Birkhoff joined the faculty of Harvard in 1912 as an assistant professor. He remained 
there the rest of his career, rising to a full professorship in 1919. 

Norbert Wiener (1894-1964) was groomed by an overbearing father, a professor 
of Slavic languages at Harvard, to be a “child prodigy.” At the age of 11, he enrolled 
at Tufts College, from which he graduated in 1909, not yet 15 years old. Wiener then 
entered Harvard Graduate School with the intention of concentrating in zoology. His 
interest turned elsewhere, and he chose to earn a doctorate in 1913 with a thesis on 
mathematical logic. The award of a traveling fellowship from Harvard enabled him to 
visit Cambridge University to study with the logician and philosopher Bertrand Russell. 
At Russell’s urging that he broaden his background in higher mathematics, Wiener took 
several courses from Cambridge’s distinguished analyst G. H. Hardy. In the spring of 
1914—-shortly before the outbreak of the Great War—he went on to work with David 
Hilbert at Géttingen. He joined the ballistics laboratory at the Aberdeen Proving Ground 
when America entered the con ict. 

In 1919, Wiener accepted a position at the Massachusetts Institute of Technology 
(MIT) as an instructor. The institute had begun its instruction in Boston, but in 1912 
had moved across the Charles River into Cambridge, not far from Harvard University. 
The mathematics department was not particularly distinguished when Wiener arrived. It 
seemed to view itself as merely a service department, whose major role was to teach 
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calculus to the budding engineers. Wiener’s appointment was the rst step in the trans- 
formation of what was essentially an engineering college into a research institution of 
national rank. He would go on to publish over two hundred articles on various aspects 
of pure and applied mathematics. His many-sided interests extended to such topics as 
Brownian motion, generalized harmonic analysis, postulational theory, stochastic pro- 
cesses, and information theory. With the presence of Birkhoff at Harvard and Wiener at 
MIT, Cambridge soon eclipsed Chicago as America’s center of mathematical activity. 

Like Birkhoff, Oswald Veblen was one of E. H. Moore’s early doctoral students. 
His dissertation (1903) provided a system of axioms for Euclidean geometry different 
from those in Hilbert’s recently published Grundlagen der Geometrie: where Hilbert 
used three unde ned elements (point, line, and plane) Veblen employed just two, “point” 
and “between.” In 1905, Veblen presented the rst rigorous proof of the Jordan curve 
theorem. Stated by Camille Jordan (1838-1922) in his Cours danalyse , it asserts that 
any simple closed curve—one that does not intersect itself—separates the plane into 
two disjoint regions, the interior and exterior of the curve. Veblen’s career was spent 
in Princeton, teaching at the university from 1905 until 1932 and as a member of the 
newly created Institute for Advanced Study from 1932 until 1950. He produced a number 
of groundbreaking papers and books connected with geometry. The rst volume of the 
two-volume Projective Geometry (1910, 1918), written with J. W. Young, established the 
axiomatic foundations of several geometries including projective geometry. The Analysis 
Situs (1922) was the rst work in English to deal with what is now called algebraic 
geometry. This was followed by Foundations of Differential Geometry (1932), written 
jointly with J. H. Whitehead. In 1928, Veblen and Birkhoff were both invited to give 
hour addresses at the International Congress of Mathematicians held in Bologna. 

By the end of the 1920s, research-oriented mathematics had become rmly estab- 
lished in the United States. During that decade, 351 American doctorates were granted. 
The custom of traveling to Europe for advanced training in mathematics was no longer 
seen necessary. Although it was still generally felt that Harvard, Princeton, and Chicago 
had the leading departments, the programs of a number of other institutions were be- 
coming competitive. At the University of Texas, for example, the former Chicagoan R. 
L. Moore (a student both of E. H. Moore and of Oswald Veblen) had been making 
point-set topology a major area of research since 1920. His unconventional approach to 
teaching—which came to be called the Moore Method—allowed students to discover 
the proofs of theorems by themselves. During his long career, R. L. Moore was able to 
supervise some 50 doctorates in topology. 

Compared with the European mathematical community, the American community 
had a substantial presence of women. The nature of college education was transformed 
with the founding of women’s colleges (some 120 by the year 1900) and the expansion 
of the land-grant universities in the three decades after the Civil War. The proportion 
of women among the nation’s college graduates climbed upward in the years 1900- 
1929 from about a_ fth to two- fths. Against this background, the leading graduate 
schools slowly relaxed their policies and opened their programs to both sexes on equal 
terms. Among them were several conservative institutions in the East, which still refused 
to enroll women in their undergraduate departments. The University of Chicago was a 
champion of coeducation at all levels from its very beginning. The rst woman to receive 
a Ph.D. in mathematics from Chicago obtained her degree in 1903; more striking is that 
18 of the 67 doctoral dissertations guided there by L. E. Dickson would be written by 
women. 
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Work the following problems, found in nineteenth-century 
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The prestigious eastern women’s colleges were an early source of research-level 
training. From its opening in 1885, Bryn Mawr College offered small doctoral programs 
in several elds. The English-born and educated Charlotte Angas Scott served as advisor 
for seven mathematics dissertations, the rst being written in 1894 by Ruth Gentry. 
Gentry had also been the rst woman to attend lectures at the University of Berlin, but 
was not allowed to enroll for a degree there. 

By the late 1920s, roughly 15 percent of the doctoral recipients in mathematics were 
women. With the onset of the Great Depression and the American entry into World War 
II, this historical level of participation was not reached again until 1979, having declined 
dramatically to a low of 5 percent of the doctorates in the 1950s. 

England’s renowned universities, Oxford and Cambridge, reacted to the efforts to 
secure female admission but did so by devices that did not provide full educational privi- 
leges. Their solution was to incorporate women’s colleges—Girton College af liated with 
Cambridge University in 1873, for instance—as a separate part of the university struc- 
ture. The colleges were permitted to confer their own degrees, a device that still did not 
signify recognition by the universities. (This compromise of having coordinate women’s 
branches was adopted in the United States when Harvard University started Radcliffe 
College, known as the “Harvard Annex,” in 1879; Columbia University followed suit 
in 1894 by opening Barnard College.) The barriers to formal academic recognition of 
English women were overcome with time. Interim concessions included a certi cate 
of pro ciency setting forth a candidate’s success in university examinations at Oxford 
or Cambridge and, later, an offer of a diploma (in no way equivalent to a degree) 
to those who had advanced through the university’s undergraduate curriculum. Oxford 

nally granted full membership, with access to the titles of all its degrees, in 1920. 
Cambridge followed in 1948 when Queen Elizabeth, the mother of Queen Elizabeth II, 
became the rst woman to receive a Cambridge degree. 


4. Three men A, B, and C built a house that cost 500 
dollars, of which A paid a certain sum. B paid 10 
dollars more then A, and C paid as much as A and B 
both. How much did each man pay? 


American arithmetic and algebra textbooks: 


1. 


A schoolmaster being asked how many scholars he 
had said, “If I had as many more as I now have, half 
as many, one-third, and one-fourth as many, I should 
then have 148.” How many scholars had he? 

A purse of 100 dollars is to be divided among four 
men, A, B, C, and D, so that B may have 4 dollars 
more than A, and C 8 dollars more than B, and D 
twice as many as C. What is each one’s share of the 
money? 

A man driving geese to market is met by another 
who said, “Good morning, master, with your 100 
geese.” Says he, “I have not 100; but if I had half as 
many as I have now and 2 geese and a half, beside 
the number I now have already, then there would be 
100.” How many geese did he have? 


The sum of the ages of a father and son is 100 years. 
Also 1/10 of the product of their ages, in years, 
exceeds the father’s by 180. How old are they? 


In a certain family, 11 times the number of children 
is greater by 12 than twice the square of the number. 
How many children are there? 


One man and two boys can do in 12 days certain 
work that could be done in 6 days by three men and 
one boy. How long would it take one man to do it? 


A man walking from town A to another town B at 
the rate of 4 miles an hour, starts one hour before a 
coach that goes 12 miles an hour, and is picked up 
by the coach. On arriving at B, he observes that his 
coach journey last two hours. Find the distance from 
A to B. 


Counting the Infinite 673 


France had been the center of European math- 


(pay Counting the Infinite ematics in the early part of the nineteenth 
century, but around 1840 lost its dominant 
The Last Universalist: Poincaré position to Germany. In the closing decades 


of the century, the country again showed great 
mathematical strength through the contributions of such scholars as Henri Poincare, 
Emile Picard, Jacques Hadamard, Emile Borel, René Baire, and Henri Lebesgue. Al- 
though the research achievements of this illustrious group tended to concern the theory 
of functions of a real variable, they left a rich inheritance in many elds. Hadamard, 
for instance, is more often remembered today for his 1896 proof of the Prime Number 
Theorem. 

A signi cant role in French mathematical rejuvenation can be attributed to the Ecole 
Normale Supérieure, founded in 1808 by Napoleon to train the academic elite. In many 
ways, the prestigious institution came to resemble a German university of the late nine- 
teenth century. A highly competitive entrance examination served to select the enrollment 
from across the entire nation. The Ecole Normale Supérieure was the only school where 
subject matter was taught in a conférence, a type of seminar, rather than through formal 
lectures. The instructors themselves were frequently faculty members of the University 
of Paris. All of the above-mentioned mathematicians, save for Poincaré, experienced this 
stimulating atmosphere as students. 

At the close of the nineteenth century, Henri Poincaré was viewed as the greatest 
mathematician of his time. His French compatriots claimed that the country had borne no 
equal in a hundred years, not since d’Alembert and Laplace. Poincaré grew up in Nancy, 
France, where his father was a successful physician. The family had a distinguished tra- 
dition of serving the government in various posts. A cousin, Raymond Poincaré, became 
president of the Republic in the years of the Great War, while another cousin, Lucien 
Poincaré, rose to minister of public instruction. 

The youthful Henri Poincaré was not precocious, or at least not recognized as 
being so. A bout of diphtheria, which he contracted at the age of ve, weakened his 
health and left him unt for physical exercise. This kept him out of active military 
combat service during the Franco-Prussian con ict, although he accompanied his father 
on ambulance rounds. Poincaré’s talent became apparent when in 1873, he placed rst 
among all French applicants on the entrance examination to the Ecole Polytechnique. 
He followed the mathematics courses without ever taking a note, for he was extremely 
nearsighted and, although ambidextrous, wrote badly with either hand. But Poincaré 
read very rapidly, and, blessed with a phenomenal memory, he managed to retain almost 
everything that passed before his eyes. 

Upon graduating from the Ecole Polytechnique after two years, Poincaré proceeded 
to study engineering at the Ecole des Mines. He worked brie y as a mining engineer 
while at the same time writing a doctoral dissertation on differential equations; the 
degree was granted by the University of Paris in 1879. Immediately thereafter, Poincaré 
began his academic career as an instructor at the University of Caen. In 1881 he was 
appointed professor of mathematical physics at the University of Paris, where he spent 
most of the remainder of his life. Poincaré was elected a member of the Académie des 
Sciences at the early age of 33 and, in 1905, was awarded the Bolyai Prize—named in 
honor of John Bolyai—by the Hungarian Academy of Science for contributions to the 
progress of mathematics. He died an untimely death at the full height of his mathematical 
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powers, suddenly stricken with an embolism after having almost completely recovered 
from a surgical operation a few days earlier. 

Poincaré’s poor eyesight and absentmindedness—such as forgetting his breakfast 
and packing hotel towels in his briefcase—were the subject of numerous anecdotes. On 
one occasion, he arrived at his apartment only to realize that he was holding a wicker 
birdcage that he had apparently removed from an open market stall enroute. 

Poincaré was a mathematical universalist—probably the last one—in the sense that 
he worked in, or at least followed, the developments in almost every area of the subject. 
He made important contributions to virtually all branches of pure and applied mathe- 
matics as well as physics, astronomy, and the philosophy of science. His writing output 
was enormous. Altogether, Poincaré produced more than 30 books and a deluge of 500 
articles, many of a less technical nature directed toward the general public. (Weierstrass, 
whose work was never quickly published, was moved to remark it was a pity that 
Frenchmen brought out their results in a succession of little papers.) Poincaré did not care 
to linger long in one eld; a contemporary remarked of him, “He was a conqueror, not a 
colonist.” 

An early success of Poincaré was his devlopment of the main theory of automorphic 
functions. He called these “fuchsian functions” in honor of the German mathematician 
Lazarus Fuchs whose poineering work drew his attention to the area. Poincaré’s subse- 
quent investigations on curves and surfaces in higher dimensions resulted in his memoire 
Analysis Situs (1895), which was to determine the future course of algebraic topology. 
This pioneering work led him to raise a celebrated question (1904) known today as the 
Poincaré Conjecture. In rough terms, he asked: if a three-dimensional body shares certain 
speci ed topological properties with a sphere, is the body itself a sphere, or a deforma- 
tion of a sphere? In spite of much effort over the years, the question resisted all attempts 
at solution until 2003, when the Russian mathematician Grigori Perleman veri ed the 
conjecture. 

Among Poincaré’s other contributions was an easily comprehended model of 
Lobachevskian geometry. Here, Poincaré’s universe consists of the interior points of 
some ordinary circle C in the Euclidean plane. The role of “lines” is played by either 
diameters of C, minus their endpoints, or by arcs of circles that intersect C orthogonally, 
again without endpoints. Because in this model angles of intersecting circular arcs are 
measured by the ordinary Euclidean angle between tangents, it can be shown that the 
angle-sum of any “triangle” will always be less than 180°. 
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Poincaré devoted signi cant attention to the solution of differential equations that 
arise in celestial mechanics. Of persistent concern was the shape of a rotating homo- 
geneous uid mass held together by the force of gravity. Where Colin Maclaurin had 
shown years earlier that the mass could adopt the form of an ellipsoid of revolution, 
Poincaré succeeded (1885) in proving that a pearlike gure was also possible. 

In 1887, King Oskar II of Sweden and Norway sponsored a prize of 2500 crowns 
for a solution of the vexing three-body problem: Given three bodies (above all, sun- 
moon-earth) of known mass, velocity, and position, determine the subsequent position of 
each of them if they are acted upon by a mutual gravitational attraction only. Poincaré 
concentrated instead on a simpli ed form of the problem, which he called the “restricted 
three-body problem,” with one of the bodies being so small that its own gravitational 
effect on the others can be ignored. While he did not completely solve this case, he made 
suf cient progress as to merit the coveted award. The prize-winning essay appeared in 
the recently founded (1882) Scandinavian journal Acta Mathematica. 

On reading the printed version of the paper, Poincaré discovered a major error in the 
work. Further distribution of the periodical was halted and readers were asked to return 
the limited number of copies already in circulation. Poincaré agreed to use his award 
money, along with a further 1000 crowns, to cover the expense of printing the suppressed 
submission. Publication of the prize-winning entry was delayed for over a year as the 
substantial alterations were carried out. The considerably enlarged revision, entitled Sur 
le problém des trois corps et les équations de la dynamique, took up 270 pages of the 
December 1890 issue of Acta Mathematica. It became the cornerstone upon which the 
modern theory of dynamical systems was built. 

Poincaré’s continuing interest in planetary trajectories led to his monumental three- 
volume Les Méthodes Nouvelles de la Méchanique Celeste (1892, 1893, 1899). The trea- 
tise was hailed as a worthy successor to Laplace’s Mécanique Céleste in opening up 
entirely new ground in the subject. Poincaré had shown the solutions of the differential 
equations used to describe motion in a system of three or more bodies incorporated not 
only regular and periodic movement, but also unexpected erratic behavior over the long 
term. The series solutions depended in a very sensitive way on the initial conditions 
chosen. His analysis had profound implications for the question of the stability of the 
solar system. In 1963, it was established that the solar system, despite strong numerical 
evidence of future chaotic behaviors, will nevertheless survive roughly in its present form 
for millions of years. 

Poincaré in his later years developed a side interest of writing for a wider audience. 
Among his popular books on science and mathematics are La Science et [Hypotheses 
(1902), La Valeur de la Science (1905), and Science et Méthode (1908). The last work 
seeks to explain the process of mathematical discovery and invention, stressing the role 
of the subconscious mind. For Poincaré, the essence of discovery is the “intuition of 
mathematical order” in which useful already-known information is placed. This is hidden 
from conscious awareness, only to leap forth unexpectedly in “sudden illumination,” 
perhaps after prolonged incubation. He wrote, “Logic and intuition each has its necessary 
role. Each is indispensable. Logic alone, which can give certainty, is the instrument of 
demonstration; intuition is the instrument of invention.” 

Toward the end of 1911, Poincaré had a premonition that he might not live much 
longer. He asked the editor of a mathematical journal if, contrary to established practice, 
he would accept an incomplete article putting forth a topological result that Poincaré had 
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tried in vain to prove. Poincaré felt that the solution would lead its researchers “on a new 
and unexpected path” toward extending his results on the three-body problem. The article 
was accepted and appeared the next year. Shortly thereafter, what is known as “Poincaré’s 
last geometric theorem” was established by the 28-year-old Chicago-trained George 
D. Birkhoff (1884-1944), an American who had never studied in Europe. It greatly 
enhanced the international prominence of the emerging American school of research 
mathematics. 

In the waning decades of the nineteenth century, the mathematical world looked 
toward Germany for inspiration and guidance. This was a tribute to its scholarship and 
an expression of con dence in its practitioners, for a now-united Germany had more and 
better universities than most of its neighbors. Although Lord Palmerston derided Germany 
as “a land of damned professors,” students from abroad ocked there for instruction. 
After the retirement of Weierstrass, Géttingen displaced Berlin as the nation’s preeminent 
research institution. Klein joined Gottingen’s faculty in 1886, and Hilbert arrived in the 
mid-1890s. 


Cantor’s Theory of Infinite Sets 


This is not the place to recount in detail the events whereby Prussia reached as- 
cendancy in a united Germany. Suf ce it to say that the long-desired uni cation of the 
German states, with the exception of Austria, was accomplished as a result of the Franco- 
Prussian War in 1871. The new German Empire was a union of the governments of 26 
states of various sizes and one administrative territory, the conquered provinces of Al- 
sace and Lorraine. The rights enjoyed by the member states were not equal, because they 
were all in one way or another subject to Prussia. The constitution gave Prussia a degree 
of prominence and power that was consistent with its territory, population, and military 
prowess in bringing about the stunning victory over France. Not only was the hereditary 
leadership of the empire vested in the king of Prussia with the title German Emperor, but 
the minister-president was nearly always the imperial chancellor, the head of the federal 
government. Thus, the new empire did not re ect the submergence of Prussia in Ger- 
many but represented the extension of Prussian in uence to the whole nation. Although 
Germany was to become the most powerful of the continental countries, it was, like the 
old Prussian kingdom, an autocratic military state. 

People have come to look back on the last third of the nineteenth century in Germany 
as a golden age of mathematical scholarship; and they are not unjustied in doing so, 
as even a short list of its university professors will indicate. Although mathematical 
generations inevitably overlap—as their ideas do also—the great names on the scene 
after 1870 are Georg Cantor, Richard Dedekind, Paul Gordan, Eduard Heine, David 
Hilbert, Otto Hélder, Adolf Hurwitz, Felix Klein, Leopold Kronecker, Ernst Kummer, 
Ferdinand Lindemann, Rudolph Lipschitz, Hermann Minkowski, Moritz Pasch, and Karl 
Weierstrass. One consequence of this galaxy of brilliance was that a state of intense rivalry 
and sometimes of bitter enmity existed continually in German mathematical circles. This 
was particularly manifest in the loudly voiced doubts over one of the most disturbingly 
original contributions to mathematics in 2500 years, Cantor’s theory of in nite sets. 
Whether the violent opposition was brought on more by the strangeness of the idea of 
the “actually in nite” or more by the forceful personalities of the individuals involved 
is hard to say. The result was the same. 
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Georg Cantor 
(1845-1918) 


(By courtesy of Columbia University, David Eugene 
Smith Collection.) 


Georg Ferdinand Cantor (1845-1918), although he was born in St. Petersburg and 
lived there until 1856, should properly be ranked among the German mathematicians, 
because he was educated and employed in German universities. His stockbroker father 
had urged him to study engineering, a more pro table pursuit than mathematics, and 
with this intention Cantor began his university studies at Zurich in 1862. The elder Can- 
tor nally agreed to allow his son to follow a career in mathematics, so that after a 
semester at Zurich he moved to the University of Berlin. There he attended the lectures 
of the great triumvirate, Weierstrass, Kummer, and Kronecker. In 1867, he received his 
Ph.D. from Berlin, having submitted a thesis on problems in number theory, a thesis 
that in no way foreshadowed his future work. Two years later, Cantor accepted an ap- 
pointment as privatdozent at Halle University, where he remained until his retirement 
in 1913. 

In uenced by Weierstrass’s teaching on analysis, Cantor’s initial research dealt with 
trigonometric series. A sequence of ve articles issued between 1870 and 1872 culminated 
in showing that the uniqueness of the representation of a function by a trigonometric 
series holds even if convergence is renounced for an in nite set of points in the interval 
[0, 27]. Because Cantor’s uniqueness proof depended heavily on the nature of certain 
point sets in the real line, and only to a lesser extent on trigonometric series, it was only 
natural for him to explore the consequences of the former. The birth of set theory can 
be marked by Cantor’s next published paper, Uber eine Eigenshaft des Inbegriffes aller 
reellen algebraischen Zahlen (On a Property of the System of all the Real Algebraic 
Numbers), which is found in Crelles Journal for 1874. Over the next two decades, 
the need for comparing the magnitudes of in nite sets of numbers led Cantor, almost 
against his will, to his notion of trans nite numbers, and to immortality. Growing out 
of specic problems posed by trigonometric representation, and reaching full articulation 
in Cantor’s lengthy survey Beitrage zur Begriindung der Transfiiten Mengenlehre_ of 
1895 (translated into English in 1915 under the title Contributions to the Founding of 
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the Theory of Transfiite Numbers ), set theory gained an autonomy as a mathematical 
discipline. 

The year 1872 was signi cant for mathematics in many ways. Cantor laid the outline 
of an entirely new eld of research. There was also Klein’s celebrated inaugural address 
when he became professor at Erlangen. The year also saw Weierstrass’s presentation 
to the Berlin Academy of an example of a continuous nondifferentiable function. And 
Dedekind published Stetigheit und irrationale Zahlen, in which he constructed the irra- 
tional numbers in terms of his famous “cuts.” The problem of irrational numbers had 
existed from the time of the Pythagoreans, but until 1872 no successful attempt had been 
made to give them a precise mathematical meaning. They “existed” as decimal approx- 
imations, and the logical basis of, say, 7 was no more sure than its approximation to 
707 places by William Shanks in 1853. (It took him 15 years of calculation, and later an 
error was found in the 528th place.) Cantor’s attention was directed toward these matters 
when he realized that an understanding of the nature of the elusive irrationals lay at 
the root of his proof of the uniqueness of the trigonometric representation. In his paper 
of 1872, the year of Dedekind’s construction, Cantor devised a rigorous formulation of 
the irrational numbers by means of what we should today call Cauchy sequences. Thus, 
during the 1870s, Weierstrass, Dedekind, and Cantor all succeeded in developing alge- 
braically self-contained theories of the irrational numbers, but they substituted an appeal 
to set-theoretic intuition for the limit concept. 

Cantor, in the rst sentence of his great synoptic work of 1895, tried to dene what 
he meant by a set (Menge, in the German). The words are not novel now, although they 
were then: 


By a set we are to understand any collection into a whole M of de nite and distinguishable 
objects of our intuition or our thought. These objects are called the elements of M. 


Although “collection into a whole” is at best a paraphrase of the notion of set, the terms 
defiite and distinguishable had a clear meaning to Cantor. The intended meaning of 
the former was that given a set M, one should be able to decide whether any particular 
element would belong to M; the attribute distinguishable is interpreted as meaning that 
any two elements of the same set are different. The implication is that a set is determined 
solely by what is in it, that is, by its elements. 

Cantor conceived of the notion of set in as general a way as possible. There was 
no restriction whatever on the nature of the considered objects nor on the way they 
were collected into a whole. Because his de nition was not precise enough to prohibit 
him from considering such things as the “set of all sets,” it ultimately led to some 
famous paradoxes concerning the in nite. (Paradoxes are apparently contradictory results 
obtained by apparently impeccable logic.) These paradoxes, which threatened the very 
foundations of logic and mathematics, necessitated the renement of Cantor’s naive 
concept of “set.” The attempted improvements in the de nition were so unsuccessful 
in identifying the notion that today we nd it convenient to take set and element as 
unde ned terms. 

It should be emphasized that Cantor was not the only one, or even the rst, to be 
interested in the properties of in nite sets. Galileo noticed the curious circumstance that 
part of an innite set could, in a certain sense, contain as many elements as the whole 
set. In his Dialogue Concerning the Two Chief World Systems (1632), he made the telling 
observation: “There are as many squares as there are numbers because they are just as 
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numerous as their roots.” He asked which of the two sets, squares or natural numbers, 
could be the larger one. Seeing in this discovery only a puzzle, he abandoned the subject 
because it was not amenable to reason. 

Cantor gave a precise meaning to “as many” by interpreting the phrase to require 
that there exists a one-to-one correspondence between the two sets in question: 


Two sets M and M’ are equivalent (equipotent, equinumerous), written M ~ M’, if there 
exists a one-to-one correspondence between their elements. 


It is clear that two nite sets are equivalent provided that they have the same number of 
elements. But Cantor’s de nition of equivalence does not use the notion of niteness in 
any essential way. It depends only on the idea of one-to-one correspondence, which can 
be applied to all sets, nite or not. In Galileo’s example, the set of natural numbers is 
equivalent to the set of perfect squares via the mapping that sends a natural number n to 
its square n?. This shows that a set may be equivalent to a subset of itself. 

Up to now the terms fiite set and infiite set have been used in an informal way, 
but they can be given a precise meaning through the notion of equivalence. Because 
everyday experience involves encounters with nite sets only, the usual custom is rst 
to dene a nite set in the positive sense, and then to take an in nite set as one that is 
not nite: 


A set M is nite if either it is empty or there exists a natural number n such that M ~ 
{15.235 seas n}; otherwise, M is in nite . 


The rst positive steps toward a theory of sets were taken in the mid-nineteenth cen- 
tury by Bernhard Bolzano (1781-1848), a Bohemian priest who was dismissed from his 
post as professor of religion at the University of Prague for heresy. Although Bolzano was 
concerned mainly with social, ethical, and religious questions, he was attracted by logic 
and mathematics, especially analysis. Unfortunately, most of Bolzano’s mathematical 
writings remained in manuscript form and did not attract the attention of his contempo- 
raries or directly in uence the development of the subject. (Many were published for the 
rst time in 1962.) Bolzano’s small tract Paradoxien des Unendlichen (Paradoxes of the 
In nite), which was published three years after his death by a student he had befriended, 
contains many interesting fragments of set theory; in fact, the term set made its initial 
appearance here. Familiar with Galileo’s paradox on the one-to-one correspondence be- 
tween natural numbers and perfect squares, Bolzano expanded the theme by giving more 
examples of correspondences between the elements of an in nite set and a proper subset. 
What had perplexed Galileo and what Bolzano had regarded as a curious property of 
in nite sets was elevated by Dedekind—who earned his doctor’s degree under Gauss—to 
the status of a de nition of the in nite. In 1888, Dedekind published a small pamphlet, 
Was sind und was sollen die Zahlen (The Nature and Meaning of Numbers), in which he 
proposed a de nition of infiite that had no explicit reference to the concept of natural 
number: 


A set M is innite if it is equivalent to a proper subset of itself; in the contrary case, M is 
nite . 


This was adopted by Cantor, whose work developed along a direction parallel to that of 
his personal friend, Dedekind. 

Cantor spent considerable effort defending himself against the opposition of many 
mathematicians who regarded the infiite more as a description of unbounded growth, 
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expressed by some symbol like oo, than of an attained quantity. According to the tradi- 
tional conception, the in nite was something “increasing above all bounds, but always 
remaining nite.” As it appeared in the work of Cantor, it was “ xed mathematically by 
numbers in the de nite form of a completed whole.” What disturbed the critics most was 
that an “‘actually in nite” set was an abstraction to which there could be no correspond- 
ing physical reality—there was no evidence that in nite collections of physical objects 
existed. Whose views carried more authority than those of the Prince of Mathematics, 
Carl Friedrich Gauss? The in uence of this monumental gure most surely set the tone 
of the mathematical world up to almost the end of the century. In a famous letter to 
Schumacher, written in 1831, Gauss posed his horror of the in nite: 


As to your proof, I must protest most vehemently against your use of the in nite as something 
consummated, as this is never permitted in mathematics. The in nite is buta gure of speech; 
an abridged form for the statement that limits exist which certain ratios may approach 
as closely as we desire, while other magnitudes may be permitted to grow beyond all 
bounds....No contradictions will arise as long as Finite Man does not mistake the in nite 
for something xed, as long as he is not led by an acquired habit of mind to regard the 
in nite as something bounded. 


Not satis ed with merely de ning in nite sets, Cantor proposed something even more 
shocking and impious—endowing each set with a number representing its plurality. This 
would allow him to distinguish in nite sets by “size,” and to show, for example, that 
there are “more” real numbers than there are integers. Some mathematicians of the day 
could accept, albeit reluctantly, Cantor’s in nite sets, taking an attitude that has been 
compared with that of a gentleman toward adultery: better to commit the act than utter 
the word in the presence of a lady. It was an actually in nite number that was forbidden, 
and its use forced Cantor to live the rest of his life within a storm. 

Cantor’s attempt to measure sets led him to introduce the notion of cardinal numbers. 
In his earliest papers, he found it prudent to adopt a neutral attitude toward cardinal 
numbers, saying what they are supposed to do and not what they actually are: 


Two sets have the same cardinal number or have the same power if they are equivalent. 


Thus, for Cantor, a cardinal number is “something” attached to a set in such a way 
that two sets are assigned the same cardinal if and only if they are equivalent. In his 
nal (1895) exposition of their theory, he tried to remove this vagueness by means of a 
de nition “by abstraction”: 


If we abstract both from the nature of the elements and from the order in which they are 
given, we get the cardinal number or power of the set. 


The cardinal number of the set M was thus taken to be the general concept common 
to all sets equivalent to M. The process of double abstraction, or disregarding both the 
special properties of the elements and any ordering within the set, is the origin of the 


double bar in Cantor’s symbol M for the cardinal number of the set M. The modern 
notation o(A) for the cardinal number of A will serve our purposes quite well. 
Cantor’s “de nition” of cardinal number can hardly be regarded as satisfactory, and 
various attempts were made to formalize the concept. The logician Gottlob Frege in his 
Grundlagen der Arithmetik of 1884 suggested a de nition that did not become widely 
known until Bertrand Russell, who had arrived at the same idea independently, gave 
prominence to it in his Principles of Arithmetic (1903). This so-called Frege-Russell 
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de nition is beautiful in its simplicity: The cardinal number of a set A is the set of all 
sets equivalent to A. (Unless logical precautions are taken, of course, there may not exist 
a set that contains all sets with a given property.) On the other hand, John von Neumann 
(1928) selected a xed set from the set of all sets equivalent to A to serve as the cardinal 
of A. Whatever a cardinal number is is perhaps immaterial; all we need assert is that 
it is an object shared by just those sets that are equivalent to each other. The cardinal 
number of a nite set is said to be a_ nite cardinal, whereas the cardinal number of an 
in nite set is called a “trans nite cardinal.” 

In building up an arithmetic of trans nite numbers analogous to ordinary arithmetic, 
Cantor became a mathematical heretic. The outcry was immediate, furious, and extended. 
Cantor was accused of encroaching on the domain of philosophers and of violating 
the principles of religion. Yet, in this bitter controversy, he had the support of certain 
colleagues, most notably Dedekind, Weierstrass, and Hilbert. Hilbert was later to refer 
to Cantor’s work as “the nest product of mathematical genius and one of the supreme 
achievements of purely intellectual human activity.” 


Kronecker’s View of Set Theory 


Cantor’s former professor, Leopold Kronecker (1823-1891), became the focus of 
Cantor’s troubles, a sort of personal devil. Kronecker had made important contributions 
to higher algebra, but in matters pertaining to the foundations of mathematics he did 
little more than openly criticize the efforts of his contemporaries. The son of a wealthy 
businessman in Liegnitz, Prussia, Kronecker was provided private tutoring at home until 
he entered the local gymnasium, where Ernst Kummer happened to be teaching. (Because 
no university position was open at the time Kummer was awarded his Ph.D., he taught 
for 10 years in his old gymnasium.) While still at Leignitz, Kronecker became interested 
in mathematics through Kummer’s stimulation and encouragement. In 1841, he enrolled 
at the University of Berlin, then the mathematical capital of the world, where he studied 
with Dirichlet, Jacobi, and Eisenstein. 

The German student of that day was free to attend the lectures of his choice or even to 
move from one university to another, restricted by no formal curriculum and responsible 
only, in the end, to his examiners. Following this custom, Kronecker spent the summer 
of 1843 at the University of Bonn, then migrated to Breslau for two semesters. There his 
former teacher Kummer was then a professor. Subsequently Kronecker returned to Berlin 
to write a thesis on algebraic number theory under Dirichlet. Temporarily obliged to leave 
the academic world in order to manage the prosperous family business, Kronecker was 
for 11 years unable to return to Berlin and to his hobby, mathematics. By this time, 
the University of Berlin was beginning to experience a new owering in mathematics, 
brought on by the arrival of both Kummer and Weierstrass. On Kummer’s nomination, 
Kronecker was elected a member of the Berlin Academy of Sciences in 1860; this 
position entitled him to deliver lectures at the university and he regularly availed himself 
of the prerogative, beginning in 1861. Because the wealthy Kronecker could afford to 
teach without holding a chair, he refused the professorship in mathematics at Gottingen 
held successively by Gauss, Dirichlet, and Riemann. Feeling the onset of a decline in 
productivity, Kummer suddenly decided to retire in 1882. Kronecker was then called 
upon to succeed his old mentor, thus becoming the rst person to hold a position at 
Berlin who had also earned a doctorate there. 
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Leopold Kronecker 
(1823-1891) 


(By courtesy of Columbia University, David Eugene 
Smith Collection.) 


Kronecker was a tiny man, who was increasingly self-conscious of his size with 
age. He took any reference to his height as a slur on his intellectual powers. Making 
loud voice of his opinions, he was venomous and personal in his attacks on those whose 
mathematics he disapproved; his opinions relative to the new theory of in nite sets were 
ones of ire and indignation. As Cantor’s bold advance into the realm of the in nite was 
based largely on nonconstructive reasoning, Kronecker categorically rejected the ideas 
from the start. He asserted dogmatically, “De nitions must contain the means of reaching 
a decision in a nite number of steps, and existence proofs must be conducted so that 
the quantity in question can be calculated with any required degree of accuracy.” Any 
discussion of in nite sets was, according to Kronecker, illegitimate since it began with 
the assumption that in nite sets exist in mathematics. 

Kronecker not only objected strenuously to Cantor’s uninhibited use of in nite sets, 
but to most of contemporary analysis. His principal concern was with the new formula- 
tions of irrational numbers, Dedekind’s by his device of “cuts,” Weierstrass’s by classes 
of rational numbers. Kronecker felt that these produced numbers that could have no ex- 
istence. Returning to the ancient Pythagorean vision, Kronecker gave loud voice to the 
view that all mathematics must be built up by nite processes from the natural numbers. 
This counterrevolutionary program is revealed in his oft-quoted dictum, “God created 
the natural numbers, and all the rest is the work of man.” It is not too surprising that 
Kronecker found Weierstrass’s analysis unacceptable, lacking as it did constructive pro- 
cedures for determining quantities whose being was merely established by the free use 
of “theological existence proofs.” One day he reduced the distinguished old man to tears 
with an abrasive remark about “the incorrectness of all those conclusions used in the 
so-called present method of analysis.” Seeing in these words an attempt by Kronecker to 
tear down a whole life’s work, Weierstrass severed all ties with his erstwhile colleague. 

Although Kronecker’s notion of mathematical existence angered and embittered 
Weierstrass, it was the high-strung Cantor who was wounded most seriously by such un- 
compromising skepticism. Cantor had hoped to obtain a professorship at the University 
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of Berlin, possibly the highest German distinction that could be secured during the period 
of his productivity. But the opposition to his work was growing, especially for its use 
of the “actually in nite.” In Berlin, the almost omnipotent Kronecker blocked Cantor’s 
every attempt to improve his position; and when a professorship at Géttingen was to be 
made, Cantor was passed over in favor of lesser candidates. All Cantor’s professional 
career, some 44 years, was spent at Halle University, a small school without partic- 
ular reputation. The temperamental Cantor suffered deeply under what he considered 
Kronecker’s malicious persecution, with the tragic outcome that he sustained a complete 
nervous breakdown in 1884. Although he recovered from this crisis within a year, mental 
illness was to plague him through the remainder of his life. 

In the hostile intellectual world, Cantor found an in uential friend in Gésta Mittag- 
Lef er, who had studied under Weierstrass in Berlin. Mittag-Lef er’s wife was a mil- 
lionaire, so that he was nancially able to establish a new mathematical journal, Acta 
Mathematica. Hoping to make a noteworthy start, he proposed issuing French transla- 
tions of the most important papers that Cantor had so far published. No doubt he had in 
mind the good fortune of Crelle, who began his journal with a plentiful supply of work 
by Abel. These translations, which appeared in volume 2 (1883) of Acta Mathematica, 
contributed to the spread of Cantor’s ideas on set theory. 

Even so sympathetic a supporter as Mittag-Lef er failed to appreciate the revolu- 
tionary character of Cantor’s research. He asked Cantor to withdraw a comprehensive 
account of the properties of ordered sets that was intended for the seventh volume of 
Acta Mathematica (1885-1886). Mittag-Lef er suggested that because the paper did not 
contain the solution of some important problem, it would be better not to publish it but 
to allow the results to be rediscovered—in say one hundred years’ time—when it would 
be found that Cantor had possessed them much earlier. Aggrieved at being told that 
his paper was “one hundred years premature,” Cantor nevertheless complied with this 
unfortunate request. As events fell out, the corrected page proofs of the rejected article 
were indeed rediscovered among Cantor’s surviving papers. The paper was published, 
85 years later, in the 1970 Acta Mathematica. 

Cantor had become exhausted in the hard struggle to gain recognition for his work. 
Beginning with the rst of his attacks of depression in 1884, the rest of Cantor’s life 
was punctuated by bouts of mental illness that would force him to spend time in various 
sanitoria. His intervening periods of clarity were more often devoted to Elizabethan 
scholarship and religious writings than to mathematical activity. Apart from a short 
article in 1892 setting forth the “diagonal argument’ for the uncountability of the real 
numbers, Cantor published little on set theory until his comprehensive Contributions to 
the Founding of the Theory of Transfiite Numbers (in the Mathematische Annalen for 
1895 and 1897). This two-part memoir was less a collection of new ideas than a nal 
statement of many of the most important results going back to 1870. 

Only by the 1900s, when Cantor had ceased his research, did his ideas at last 
begin to receive some recognition. Initial distrust by the mathematical world turned into 
appreciation and even admiration. Of the various awards and honorary degrees belatedly 
bestowed on Cantor, the Sylvester Medal of the Royal Society of London (1904) is worth 
particular mention because it is so rarely given. The rst comprehensive textbook on set 
theory and its applications to the general theory of functions, The Theory of Sets of Points, 
was published in 1906 by William Henry Young and his wife Grace Chisholm Young. 
Initial homage came mainly from abroad, and as late as 1908, Cantor complained to the 
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Youngs of the lack of importance attached to his work by the Germans “who do not seem 
to know me, although I have lived among them for fty-two years.” In 1915, an event 
of international importance was planned at Halle to celebrate his seventieth birthday; but 
because of the war, only a few close German friends were able to gather to pay him 
honor. Cantor died in a psychiatric clinic in Halle in 1918. 

From our present vantage point, we see that Cantor won from the next generation of 
mathematicians the recognition that most of his contemporaries denied him. Although the 
discovery of the paradoxes of the in nite were to force the modi cation of many of his 
ideas, the main concepts of set theory survived to become cornerstones in the foundations 
of many other branches of mathematics. Kronecker, on the other hand, despite his great 
authority, failed to gain supporters for his “mathematical nihilism.” Faithful adherence to 
the position that existence statements are meaningless unless they contain a construction 
for the asserted object would result in the abandonment of much of modern mathematics. 
Kronecker was contending against the unquestionable fact that proofs of pure existence 
often produce the most general results with the least effort. An in exible advocate of 
his convictions, Kronecker in his violent opposition of Cantor’s work succeeded only in 
curbing its early development for two decades. 


Countable and Uncountable Sets 


As Cantor turned away from the established traditions of mid-nineteenth-century 
analysis and focused on linear point sets, a new era in mathematics opened. In 1874, he 
published his rst purely set theoretic work, On a Property of the Collection of All Real 
Algebraic Numbers, in which he made a distinction between two types of in nite sets on 
the real line. Dedekind, writing in his memoir Stetigkeit und die Irrationalzahlen (1872), 
had already perceived this distinction: “The line L is in nitely richer in point-individuals 
than is the domain R of rational numbers in number-individuals.” The term “countable” 
was later used by Cantor to describe the simplest kind of in nite, one that has the power 
of the natural numbers. 


Definition 


A set A is said to be countable (denumerable, enumerable) if there is a one-to-one 
correspondence between it and the set N of natural numbers. Infiite sets which are 
not countable are called uncountable (nondenumerable). 


This de nition affords us a certain convenience. If a set A is countable, then for a partic- 
ular one-to-one correspondence between A and N that element of A associated with the 


natural number n may be labeled a,,. This allows us to write A = {a, a2, 43,...,n,---}; 
with the elements of A listed in the form of a sequence. The converse is also true: A set 
that can be designated {a), a2, 43, ..., dy, .-.} 1s countable. 


The rst thing to notice about countability is that it is the smallest kind of in nity. 


Every infiite set contains a countable subset. 


Proof. Let an in nite set A be given. Choose an element of A and call it a}. Because A 
is anin nite set, A — {a,} isin nite also; choose one of its elements and call it ay. Then 
A — {a,, a2} is an in nite set; indeed, after k repetitions of this selection process, A — 
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{a,, d2,..., a} is still in nite, and we can choose from it a next element a,,;. Because 
there is no stage of this process at which we lack a successor to the elements already 
chosen, this selection scheme produces a countable subset {a),dz,..., ay, ...} of A. 


| 


The next theorem uses in its proof the well-ordering principle for positive integers: 
Any set of positive integers has a smallest element. 


THEOREM A subset of a countable set is either fiite or countable. 


Proof. Let B be a subset of the countable set A = {a , a2,...}. We shall list 

the elements of B in the order in which they occur in A, calling b; the element of B 
that has the least subscript when viewed as an element of A. If B — {b,} is not 
empty, it too has an element with the least subscript when viewed as an element of 
A; call this element b>. 


If after k repetitions of this process, it is found that B — {b,, bo, ..., by} is 
empty, then clearly B = {b,, bo,..., bg} is a nite subset of A. 
If on the contrary, B — {b,, bo, ..., bg} remains nonempty for each positive 


integer k, then we can always choose b;+, as its element with the least subscript in 
A. In this way it is possible to construct a countable subset {b;, bo,...} of B. 

To see that each element b of the set B is a member of the countable subset 
{b,, bz, ...}, recall that since D is in A, then b = a, for some n. At some stage in the 
construction of {b;, b2,...}, certainly by the nth step, a, must have been that 
element of B with the least subscript when viewed as an element of A. Then b = a, 
belongs to the countable set {b,, b2,...}; indeed it is one of its rst nm elements. 
Hence B = {b,, bo, ...} is a countable subset of A. 


The springboard from which to prove many results on countable sets is the following 
theorem. 


THEOREM The union of a countable number of countable sets is a countable set. 


Proof. We consider a countable collection {A;, Az,..., An, ...} of sets A; each of 
which is itself countable. Thus, for each 7, the set A; can be displayed in sequence 
form as A; = {dj}, Gj2,..., Gin, ...}. The theorem will become evident when a 

sequence is constructed in which all the elements of all the set A; appear. Because 


_ 7o™N “™s ; 
Ay = (411, 412, 413, .- +, Ay, ...}; 


i 
Ay = {a91, 297, 4935--+-, A n»-- ao 


gully ya 
4 =(1431, 432, 233... + Aap. 


— fs 
Ay = 1s Cx Gigyec< 5 Bigs P55 
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we may use the back-and-forth diagonal path indicated to list all the aj; 
systematically as a sequence: 


11, 412, 421, 431, 422, 413, 414, 423, 432, 441,...- 


Every element of each set A; will eventually be encountered somewhere on the path; 
hence, a one-to-one correspondence between the set of a;; and the set N of natural 
numbers is implied (any element that is repeated can be deleted from the list when 
we come to it a second time). 

It is not hard to write down the actual formula giving the one-to-one 
correspondence from UA; to N being used in this process. In fact, one can easily 
show that 

(G+ j—-1)G@+j-2) 
2 
G+j-D)E+j—2) 
2 
but the description of our listing is so simple that there is no need to use this explicit 
correspondence. 


+j ifi+ 7 is even 
f (aij) = 
+i ifi+ 7 is odd 


——— ss 


With this theorem we acquire several useful corollaries. 


The union of a fiite number of countable sets is countable. 


Proof. If countable sets A,, A2,..., An are to be considered, we may use the 
theorem on taking Ay+1, Ani2,... to be the empty set. 


a 


The set Z of integers is countable. 


Proof. Certainly the set N of natural numbers is countable. We display the set of all 
the integers 


Za NW Ss 3 oo ad 


to see that Z is a union of two countable sets. 


a 


The set Q of rational numbers is countable. 


Proof. First consider the positive rational numbers, classifying them by 
denominator: 
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That is, all positive rational numbers with denominator | appear in A;, those 
with denominator 2 appear in A>, and so forth. The union UA; is the set Q, of 
positive rational numbers, and just as in the proof of the theorem, we see that it is 
countable. With the obvious modi cations in the sets A;, it can also be inferred that 
the set Q_ of negative rationals is countable. Because 


Q=Q,U{0}UQ-, 
an appeal to Corollary 1 completes the proof that Q is a countable set. 


a 


A more “visual” proof of the countability of Q4 is obtained by writing out the 
positive rationals as the array 


a ™“ a ™“ —™ 
1 2 3 4 5 
1 1 Ll 
ul 5 
2 2 2 2 2 
CH dl a 5 
pe 3 3 
1 2 3 4 5 
ia 4 4 4 
C 2 3 4 5 
5 5 5 5 5 


All that is left to do is to start counting down the diagonals in the manner of the last 
theorem: that is, we traverse the diagram as shown by the arrows, discarding rational 
numbers like Z, z, 3, and 4, which are equal to numbers that have been previously 
passed. The enumeration according to our procedure begins with 
eae 2 ee, 
2. 3 2345 
In this way, we get an in nite sequence in which every positive rational number occurs 
exactly once. 
By what we have just proved, we must conclude that the set Q of rational numbers 
and the set Z of integers are of “equal size,” despite the inclusion of the second set in 


1,2 


ee Te 
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the rst. You will probably want to object vigorously. Doesn’t this contradict the famous 
principle formulated by the Greeks that the whole is greater than any of its parts? Cantor 
realized that in dealing with in nite sets this principle, which holds for nite sets, must 
be abandoned. The sets Q and Z have the same number of elements because they are 
equivalent; their elements can be matched against each other. If we accept “equivalent” as 
the criterion for comparing the sizes of in nite sets, then we must put aside the traditional 
way of thinking and agree that “the part has the power of the whole” in this instance. 
This is the essence of Galileo’s paradox. 

Because all the in nite sets considered so far have had the same power, one might 
get the impression that all in nite sets are equivalent. Cantor’s theory would be trivial 
if there were no uncountable sets, no kinds of in nity beyond the countably in nite. 
Cantor gave two proofs that the set of real numbers cannot be arranged in one-to-one 
correspondence with the natural numbers. The rst, which appeared in the pathbreaking 
paper On a Property of the Collection of All Real Algebraic Numbers (1874), involved 
taking a nested sequence of closed intervals and claiming the existence of a limiting 
number contained within all these intervals. We shall describe Cantor’s second proof 
(1891), this being both simpler in form and more general in application. It uses what is 
today known as Cantor’s diagonal argument. 


The set R of real numbers is uncountable. 


Proof. Let us assume that the theorem is false, so that the real numbers form a 
countable set. Each in nite subset of the real numbers must then be countable, by an 
earlier theorem. The subset we want to consider is the set of all real numbers x 
satisfying 0 < x < 1. Every such real number has a nonterminating decimal 
expansion 0.x%1x2x3...X,..., where each x; represents a digit in the expansion; that 
is, 0 < x; < 9. Not only is there an expansion of this form for each real number 
between 0 and 1, but some numbers have two such expansions. The confusion is 
caused by decimals in which all the digits assume the value 9 after a certain point, 
numbers like 0.36999 ..., which is no different from 0.37000... = 0.37. To 
eliminate this ambiguity, let us rule out those expansions ending in an in nite string 
of zeros. In other words, we shall not identify the real number — by 0.37000..., 
but rather by 0.36999.... 

The set of real numbers between 0 and 1, being countable, can be displayed as 
{a|, 42, 43,---,Gy,---}, and each a; has an in nite decimal expansion. Let us say 
that 0.a;14;24)3 ...@jn ... is the decimal expansion of a;. We can then write the 
elements of the set under consideration as the array 


A; = 0.0;1 4;24j3 *** Gin **s 


where 0 < aj; < 9 for all i and j. 
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We intend to construct a real number b, with 0 < b < 1, which appears nowhere 
on our list. The desired contradiction is established and the theorem proved when 
such a number b is produced. Our assumption that the set of real number is 
countable has led to a claim that any real number between 0 and | must be one of 
the listed a;. 

Looking down the diagonal of the preceding array, we shall form b as an in nite 
decimal, digit by digit, so that it disagrees at some decimal place with each of the 
a;. For a given value of i, if the “diagonal digit” a;; = 1, then we put b; = 2, and if 
a;; 4 1, then we put b; = 1. To illustrate: if a; = 0.31429....,a) = 0.81621,..., 
a3 = 0.58207... happened to be the rst three numbers in our list, then the decimal 
expansion of b would start off as b = 0.121.... (We have avoided the possible 
ambiguity that might arise from an in nite sequence of zeros by making sure that 
the decimal expansion of b doesn’t have any.) 

This procedure de nes a new number b that equals none of the a; just listed, 
since b; 4 a;;; that is, the decimal expansion of b differs from the decimal 
expansion of a; at least in the ith place. But b is surely a real number between 0 and 
1, which contradicts our assertion that the set of all such numbers is 
countable. 


| 


Although this last theorem confounded hopes of a certain tidiness of the in nite sets, 
it also meant that the varieties of in nity are richer than rst expected. The distinction 
between countable sets and uncountable sets is not an empty one. Cantor, somewhat at 
a loss for new symbols for trans nite cardinals, called the power of the natural numbers 
No, where 8 (aleph) is the rst letter of the Hebrew alphabet. The countable cardinal 
Xo is the rst of the trans nite cardinals. By showing that the set of real numbers was 
uncountable, Cantor demonstrated that their cardinal number c (for “continuum’’) strictly 
exceeded Xo. 

Although any countable set has cardinality Xo, it does not follow that any uncount- 
able set must have cardinality c. The test for whether a given set has cardinality c is 
to see whether it can be placed in one-to-one correspondence with the set of all real 
numbers. For instance, the set of real numbers can be regarded as the union of the 
rational numbers and the irrational numbers, where the rationals have been shown to 
be countable. Were the irrational numbers to form a countable set also, a reference to 
an earlier theorem would convince us that the reals are countable also. We have just 
proved that this is not the case, so we must conclude that the set of irrational num- 
bers is not countable. Nothing that has been said so far, however, convinces us that 
the set of irrational numbers has the same power as the continuum does. And from 
another point of view, were we to nd a set whose power exceeds c, it would be a 
noncountable set. 


Transcendental Numbers 


For the moment we have been considering a division of the real numbers into two 
subsets: those that are rational and those that are not. It is also possible to think of 
another partition of the real numbers, into those that are roots of equations and those that 
are not. 
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Definition 


A number, real or complex, is said to be an algebraic number if and only if it is the 
root of the algebraic equation 


agx" + ajx""! +++ +a, 1X +a, =0, ay #0 


where n is a natural number and each aq; is an integer. 


A moment’s pause will show that any rational number is algebraic; if r is a rational 
number, say r = s/t where s and ¢ are integers, then r is the solution of the equation 
tx —s =0. On the other hand, not all algebraic numbers are rational; the irrational 
number V2 is algebraic, because it is a solution of x? -2=0. Similarly, 573, as a root 
of the equation 8x* — 3 = 0, is an algebraic number. 

Let us use the letter A to denote the set of real algebraic numbers. Then in the 
hierarchy of subsets of real numbers, we have an ascending chain of sets: 


FEO ECACR 


The question naturally arises whether the set A constitutes the totality of all real numbers, 
or whether there are real numbers that are not algebraic. Such numbers are called “tran- 
scendental,” for as Euler said, “They transcend the power of algebraic methods.” Just as 
it was by no means obvious that irrational numbers exist, so it was by no means obvious 
that transcendental numbers exist. No satisfactory answer to the question of their exis- 
tence was found until 1844, when the great French analyst Joseph Liouville (1809-1882) 
obtained a whole class of such numbers, namely, those de ned by the in nite series 


ee) 

—n! 
) an 10’, 
n=1 


where each a, is an arbitrary integer between | and 9. These so-called Liouville numbers 
are characterized by increasingly long blocks of zeros interrupted by a single nonzero 
digit, as with 


= 107 10 + 10 + 10-19 es 
= 0.1100010000000000000000010. . . . 


The proof of their transcendence was eventually set forth (1851) by Liouville in his own 
periodical, Journal des mathématiques, in a memoir under the title “On a very extensive 
class of quantities which are neither algebraic nor reducible to algebraic irrationals.” 

For quite a time, Liouville’s transcendental numbers were the only ones known. The 
situation changed radically when Cantor, in his 1874 paper, used relatively simple set- 
theoretic methods to show that “almost all” real numbers are transcendental, not algebraic. 

To develop the proof of the countability of the set of algebraic numbers, we begin 
by considering an arbitrary algebraic equation 


Gt” Pax” 4 tae +a eo, 


where do, d|,...,@, are integers and dj > 0. The height of this equation is de ned to 
be the integer 


h =n+adao+ lay] +--+ + lan—1| + lanl. 
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Because the integers ao and n are at least one, the height # > 2. Thus, for instance, 
3x2 — 2x + 1 = 0 has height h = 2+3+2+1=8. Forany xed height h, the integers 
N, A, 4),-++;A4n—1, 4, can be specied in only a nite number of ways, thereby leading 
to a nite number of equations; each such equation can have at most as many different 
roots as its degree. Thus, there are just a nite number of algebraic numbers arising from 
equations of a given height. By grouping the algebraic equations according to height, 
starting with those of height 2, then taking those of height 3, and so on, one can write 
down the set of algebraic numbers in a sequence. This is what Cantor actually did. 

When an equation’s height is 2, we must have n = ay = | and a, = 0, so that the 
equation must be x = 0. Then 0 is the sole root of this equation, hence the rst algebraic 
number. 

Let us arrange equations of height 3 (and also equations of larger heights) rst 
according to the increasing degrees of the equations; then let us arrange all equations of 
the same degree according to the size of their initial coef cients, those with the same 
rst coefcient according to the second, and so on. This enumeration scheme yields: 


Equation Roots 
x+1=0 =) 
2x =0 0 
x7=0 0 


We shall be explicit about one more step by considering equations that satisfy the 
condition n + do + |a;| +--+ + |a,| = 4. There are various possibilities, as listed here: 


Equation Roots 
ea 2 =0 +2 

Ix t1=0 +3 
3x =0 0 
wrtx=0 0,+1 
etl =0 1 ti 
2x? = 0 0 

xe =0 0 


We are trying to list distinct, real algebraic numbers, so let us at each stage discard 
imaginary numbers and any numbers that are roots of an equation of lower height. Where 
there are several real roots of the same equation, we shall order the numbers according 
to their increasing magnitude. With these conventions, we get the following sequence of 
algebraic numbers: 


il <i a a2 
OG =1;1.=9 9. 3.3: Oe ne 


Pay 2 
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An algebraic number is a root of an algebraic equation and every such equation has a 
height; we cannot “miss” any algebraic number with this scheme. Thus, our argument 
proves a theorem. 


The set A of (real) algebraic numbers is countable. 


The initial half of Cantor’s 1874 paper established the countability of certain sets, 
such as the algebraic numbers, which scarcely seemed at rst glance to possess this 
property. But it was the second half that contained the more profound result and the rst 
great triumph of set theory: The set 7 of transcendental numbers is uncountable. The set 
of real numbers, R = A UT. If it happened that T were countable, then R as the union 
of two countable sets would itself be a countable set. This contradicts an earlier theorem, 
so the transcendentals must be uncountable. The remarkable thing about this argument 
is that it demonstrates the existence of an uncountable set of real numbers (to wit, the 
transcendental numbers), no member of which had been constructed or exhibited in any 
way. To Kronecker, such unconstructive existence proofs were sheer nonsense, without 
any hope of redemption. 

Cantor’s proof of the uncountability of the transcendentals has only a theoretical 
character and is not of much use in determining whether certain specic numbers are 
actually transcendental. The problem of the transcendence of the classical constants e 
and z attracted mathematicians as soon as Liouville had justi ed the distinction between 
algebraic and transcendental numbers. Because the numbers e and z are closely connected 
by the Euler equation e7/ + 1 = 0, the investigation of their nature was carried on at 
much the same time. (Felix Klein once observed that Euler’s celebrated formula “is 
certainly one of the most remarkable in mathematics,” relating ve important symbols, 
each with its own history.) The irrationality of e had been demonstrated earlier by Euler 
(in 1737, published in 1744), and Liouville had shown in 1840 that neither e not e” 
could be a root of a quadratic equation with integral coef cients. This was the rst 
step forward in verifying that e cannot be classed among the algebraic numbers. But 
more than 30 years passed before Charles Hermite (1822-1901), in 1873, published a 
memoir entitled Sur la fonction exponentielle, in which he succeeded in establishing 
the transcendental character of e. Although Hermite’s paper marked the beginning of a 
prosperous period in the recognition of speci c transcendental numbers, he himself turned 
his attention elsewhere. He expressed his view in a letter to a former pupil, Carl Wilhelm 
Borchardt: 


I shall risk nothing on an attempt to show the transcendence of z. If others undertake it, no 
one will be happier than I at their success, but believe me, my dear friend, this cannot fail 
to cost them some effort. 


Within a decade, Ferdinand Lindemann was able to conrm in an article Uber 
die Zahl mw (1882) that w is transcendental, modeling his proof on Hermite’s. 
Lindemann’s argument required the theorem that for any distinct algebraic numbers 
Q1,Q,...,@,, real or complex, and any nonzero algebraic numbers £1, B2,..., B,, the 
expression 


Bre fre poesia 
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must always be nonzero. Because the complex number 7, as a root of the equation 
x*>+1=0, is algebraic, and since e7! + e° = 0, it follows that the number zi and 
therefore a cannot be algebraic. 

Lindemann’s victory over the obstinate z left the doubters still skeptical, and loud 
among them rose the voice of Kronecker. In a conversation with Lindemann, he com- 
plained, “Of what use is your beautiful investigation regarding 7? Why study such 
problems when irrational [hence, transcendental] numbers do not exist?” 

The proof of the transcendence of z was far more exciting than the proof for e, 
because it put an end to the ancient dream of “squaring the circle,” that is, constructing 
with straightedge and compass alone a square that equaled a given circle in area. This 
requires the construction of a line segment whose length is ./77, which can be accom- 
plished if a line segment of length z is constructible. The construction of a segment of 
specied length is possible only if that length is a root of a special algebraic equation. 
But Lindemann showed that z is not a root of any algebraic equation, whence a segment 
of length z is not constructible by Euclidean tools (nor is any transcendental length). A 
man distinguished more by industry and determination than by mathematical brilliance, 
Lindemann achieved greater fame than Hermite for this discovery based on Hermite’s 
work. 

To prove that some specic real number is transcendental is usually dif cult, and 
only recently have such numbers as 2v? and e7 = (—1)7 been disposed of. At the 
International Congress of Mathematicians held in Paris in 1900, Hilbert asked, as the 
seventh of his 23 outstanding unsolved problems, whether a is transcendental for any 
algebraic number a # 0, | and any algebraic irrational 6. Later in a number theory lecture 
at Gottingen (1919), he speculated that the resolution of the problem lay further in the 
future than a proof of Fermat’s last theorem, and that no one present in the lecture hall 
would live to see it successfully concluded. Modern progress has been more rapid than 
Hilbert anticipated: for the desired transcendence was established, independently and by 
different methods, in 1934 by A. O. Gelfond in Russia and T. Schneider in Germany. 
Still the best efforts of mathematicians have succeeded in proving the transcendence of 
only a relatively limited class of numbers. Such numbers as e°, 7°, 2°, 77, and 2” have 
not yet been classi ed as algebraic or transcendental. 

Before ending this digression, we should observe that although the matter of its tran- 
scendence was put to rest, there was still a concern with obtaining an accurate numerical 
value for wz. In 1853, the Englishman William Shanks (1812-1882) used the in nite 
series for the arctangent function 


arctan x =x —x°/34x°/5—---, |x| <1, 
together with the formula 
mz = l6arctan(1/5) — 4 arctan(1/239), 


to hand-calculate the rst 607 purported digits in the decimal expansion of z. He later 
returned to his computations and by 1874 had worked out a total of 707 digits. For the 
next 70 years this approximation stood as the accepted decimal value of z. The error in 
Shank’s evaluation was not caught until 1945, when it was observed that his expansion 
for a seemed to disfavor the digit 7; that is, 7 appeared noticeably less frequently than 
did any of the other nine digits. A check on his accuracy revealed erroneous gures 
from the 528th decimal place onward (the 528th digit should be 4, but Shanks called 


694 


Chapter 12. Transition to the Twentieth Century: Cantor and Kronecker 


it 5). Within the next four years, the decimal representation had been extended to 1120 
correct digits, using a desk calculator. 

With the advent of computers, the evaluation of the digits of z proceeded with a 
frenzy. The rst such determination in 1949 produced 2037 digits in 70 hours elapsed 
time. By 1961 at least 100,000 decimal places were available. The number of known 
digits increased rapidly as larger and faster machines came on the scene. A million 
decimal gures were reached in 1973 with a calculation that required only 24 hours 
running time; and, by 1987, the 100 million digit mark was passed in roughly a day and 
a half of work. 

Certainly such accuracy far exceeds utilitarian concerns: a value for z correct to 
20 decimal places is suf cient for any imaginable application. But in recent years the 
computation of its decimal expansion has become a popular check on the reliability of 
new supercomputers and their software. The latest feat carried the approximation of z 
beyond the fty-billion-digit barrier; this number, if printed out, would stretch across the 
United States. Because z-records are made to be broken, many billions of digits more 
will no doubt soon be aroused from their deep slumber. Probably the most signi cant 
mathematical motivation for these large-scale calculations is to investigate whether 
the digits in the decimal expansion of z are “statistically random”; that is, whether 
the expansion shows no preponderance of any one of the 10 digits 0 through 9. The 
billions of digits now known suggest that this is the case, but questions concerning the 
distribution of decimal digits of particular numbers such as 2, e, and even /2 appear 
beyond the scope of current mathematical techniques. 


The Continuum Hypothesis 


Cantor, having succeeded in proving the existence of in nite sets with the same 
“power” and with different “powers,” went on to attack new and bolder problems. In 
the paper A Contribution to Manifold Theory submitted to Crelles Journal in 1877 
and published the following year, he showed that the points in a square, “clearly two- 
dimensional,” can be put in one-to-one correspondence with the points of a straight 
line segment, “clearly one-dimensional.” Quite unprepared for this paradoxical result, 
which seemed to cloud the concept of dimension, Cantor tried to discuss it with fellow 
mathematicians, but they treated the whole idea as absurd, even with contempt. Cantor 
himself found the result so odd that he wrote to Dedekind, “I see it, but I do not 
believe it,” and asked his friend to check the details of the proof. (Because Dedekind 
recognized immediately that Cantor’s one-to-one mapping was not continuous, he did not 
read the same signi cance into Cantor’s counterintuitive discovery that Cantor himself 
had.) Publication of the paper was postponed time and time again in favor of manuscripts 
submitted at a later date. The presence of the skeptical Kronecker on the editorial board 
seemed to impede its progress. After Dedekind intervened, the dif culties were eventually 
resolved, but Cantor never again permitted his work to appear in Crelles Journal . 

To establish that the unit square S, de ned by 0< x <1 and 0< y < 1, and the 
interval [0, 1] are equivalent, it is suf cient to match their points in a one-to-one fashion. 
Give a point (x, y) of S, let us represent x and y as innite decimals, 


xX = 0.x1x2x3..., yY=O.yiy2y3... 
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with the usual proviso that in any decimal terminating in an in nite string of 0s, the Os 
are replaced by 9s (with the single exception of the number 0 itself). By taking digits 
alternately from x and y, it is possible to form the decimal representation 


Z = 0.x 1 X292X3Y3-.- 


of a number z in the interval 0 < z < 1. Conversely, a knowledge of the decimal expan- 
sion of z permits us to reconstruct x and y by “unlacing” the digits of z. The obvious 
approach then is to pair the point (x, y) of the square with the point z of the interval. 
Indeed, this was the strategy that Cantor used when he sent his rst “proof” to Dedekind. 

There is one drawback to all this, which Dedekind quickly pointed out to Cantor; 
although to each (x, y) there corresponds a single z, there exist values of z that arise 
from no (x, y) by the previous procedure. If, for example, z = 0.3404040404..., then 
unlacing the digits leads to 


x = 0.300000...,  y=0.44444.... 


And because x consists exclusively of zeros after the rst digit, it is not written in 
admissible decimal form. Moreover, if the trailing 0s are replaced by 9s, then the in nite 

decimal form for this number x does not correspond to the specied number z. The 
dif culty can be remedied by breaking z into blocks of digits, instead of single digits, each 
block ending with the rst nonzero digit encountered. The transition from z back to (x, y) 
can be accomplished by alternating the blocks when forming the decimal expansions of 
x and y. Thus, for instance, 


Z = 0.2|7|03|009|4|06] .. . 
would be paired with the point of the square having 
x = 0.2034..., y = 0.700906.... 


By treating blocks as single digits for purposes of interlacing, we can also go from (x, y) 
to z in this manner. Thus, the one-to-one correspondence between geometric gures of 
different dimensions is established: 


The set of points in the unit square has cardinal number c. 


The publication of Cantor’s 1878 paper destroyed the feeling that the plane is richer 
in points than the line and forced mathematicians to take a fresh look at the concept of 
dimension. Because Cantor’s argument did not involve continuous mappings from one 
dimension to another, there was a_urry of activity to show that the result failed under the 
additional assumption that the mapping between the spaces should be continuous. Not 
until the appearance of an article by L. E. J. Brouwer in 1911, however, would there be 
established a rigorous proof of the invariance of dimension under continuous one-to-one 
mappings. 

Cantor’s most important investigations into set theory are spread across a series of 
six papers entitled Uber unendliche lineare Punktmannichfaltigkeiten (On In nite Linear 
Point Sets) published in the German journal Mathematische Annalen in the period 1879— 
1884. These brilliant papers, some of whose French translation appeared in Mittag- 
Lef er’s Acta Mathematica, constitute the acme of Cantor’s lifework. The underlying 


696 


Chapter 12. Transition to the Twentieth Century: Cantor and Kronecker 


concept is that of an accumulation point (a point p is an accumulation point of a set P if 
every neighborhood of p contains points of P). The fundamental theorem is the so-called 
Bolzano-Weierstrass theorem, rst proved by Weierstrass in his lectures at Berlin in the 
1860s and known to Cantor from these; this result states that every bounded in nite set 
of points in Euclidean n-space possesses at least one accumulation point. From this basic 
idea and theorem owed a host of new types of sets that today lie at the foundations of 
the theory: closed sets, perfect sets, sets of rst and second category, dense sets, and so 
forth. 

Cantor’s use of trans nite cardinals to compare sizes of in nite sets created a storm 
in the camp of the orthodox. Cantor dened an order relation in which the cardinal 
number of a set A is “smaller” than the cardinal number of a set B provided that A is 
equivalent to a subset of B. A precise de nition is given: 


Definition 


Let a and B be two given cardinals and A and B sets with a = o(A) and B = o(B). We 

write a < B if there exists a one-to-one mapping from A into B. We also write a < B 

ifa < B, but nota = Bp. 
This de nition conrms our intention that o(A) < o(B) whenever A C B, because if A 
is a subset of B, then the inclusion mapping i: A > B dened by i(a) =a is one-to- 
one, so that we have o(A) < o(B). In general, to show that o(A) < o(B), it must be 
demonstrated that there exists a one-to-one mapping from A into B, but no one-to-one 
mapping of A onto B. A note of caution—this is not equivalent to the statement “There 
is a one-to-one mapping of A that is into B, but not onto B.” 

We are now in a position to tie up two loose ends. We have observed that the set 
of irrational numbers and the set of transcendental numbers are both uncountable, but 
we have assigned a cardinal number to neither. Cantor showed, in fact, that whereas 
the algebraic numbers are countable, the transcendental numbers have cardinality c, the 
power of the continuum. Because every transcendental number is irrational, the irrational 
numbers must then also have cardinality c; for using T and / to denote the sets of 
transcendental numbers and irrational numbers, we have a chain of set inclusions 


TCICR. 
This leads to a corresponding inequality involving cardinals: 
c=0(T) < 0) < o(R) = c. 


There is no alternative but to conclude that o(/) = c. 

The inequality &o < c raises the question whether there are any sets with cardinality 
between Xo and c. On the face of the matter, there seems no reason why some uncount- 
able set should not have cardinality less than c; yet all attempts to discover a set of 
real numbers that is in nite and uncountable, but whose power is less than that of the 
continuum, have been unsuccessful. The conjecture that there is no cardinal number a 
satisfying No < @ <c is customarily known as the continuum hypothesis. This hypoth- 
esis may also be stated: Every in nite subset of R has cardinal number either Xo or c. 
It is frequently alleged that Cantor’s emotional breakdown in 1884 was caused by the 
strain of his prolonged, futile efforts to nda set of the desired intermediate cardinality. 
First he thought that he had succeeded in proving that the continuum hypothesis was 
true; the next day, he asserted that he could demonstrate its falsehood. Then he withdrew 
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this claim and announced a new proof of the conjecture. In the end, an embarrassed 
Cantor must have seen that all his purported proofs of the intransigent hypothesis were 
invalid. 

When Hilbert delivered his famous address of 1900 on outstanding problems awaiting 
solution by future mathematicians, the continuum hypothesis headed the list. Yet for 
decades the question eluded all efforts at resolution, and it is today “settled” in an 
unlooked-for sense. Kurt Gédel announced in 1938-1939 (and published in 1940) that 
the continuum hypothesis was consistent with the current axioms of set theory and hence 
could not be disproved. Twenty-four years later, Paul Cohen (1963) demonstrated that 
the continuum hypothesis was independent of the other axioms of set theory, thereby 
showing that it could not be proved within the framework of these axioms. Thus, on 
the basis of our present understanding of sets, Cantor’s conjecture remains in a sort of 
limbo, as an undecidable statement. It is ironic that so specic a problem as the rst on 
Hilbert’s list—that ambitious program put forward in such a spirit of optimism—should 
have its status changed from “unknown” to “unknowable.” 


7. Let C be the set of all circles in the Cartesian plane 
that have rational radii and centers at points whose 


. Prove that the following sets are countable: 


(a) {2,27,23,...,2")...}. 


are countable sets. [Hint: Show that 
Ax B=U(A x {b}), where A x {b} ~ A for any b 
in B.] 


. If S is the set of all right triangles whose sides have 
integral lengths, then S is a countable set. Prove this 
statement. 


coordinates are both rational. Show that C forms a 
countable set. [Hint: Consider the mapping 

f:€ +> QxQx Q dened by f(C) = (x, y, z), 
where (x, y) is the center and z the radius of a circle 
C inC.]) 


(b) {1,4,4,...,1/n,...}. 
(c) {5,10,15,...,5n,...}. 8. Prove the following: 
(d) 12 3 n (a) If Z,[x] is the set of all polynomials of degree 
VRA Cnt sy n with integral coef c ients, then Z,[x] is 
. Verify that the set N. of all even natural numbers and patios ie eat ye ae 
the set N,, of all odd natural numbers are countable; Ss a ae ene 
do the same for the sets Z, and Z, of all even and f(anx" +++++ a,x + ap) 
odd integers. = (Gieines fli, 
. Prove, by conrming that the function f:Q — N Mote that Pie Be sense Rowe 
dened by f(m/n) = 23" is one-to-one, that the set Gis RN ONG 
rc ce Hoes rational seas is countable. [Hint: (b) ‘The set Z[x] of all’ polynomials with intestal 
ence thE Se Mal coef cients is countable. [Hint: Show that 
. Use the theorems in this section to show that the set Z[x] = UZ, [x].] 
f pri bers i table set. : 
Rode ene n eaten gear y oe 9. Use Cantor’s diagonal argument to show that the 
. Establish that the Cartesian product A x B [that is, set of all in nite sequences of 0s and Is (that is, 
the set of all ordered pairs (a, b) with a in A and b of all expressions such as 11010001...) is 
in B] of two countable sets A and B is countable; in uncountable. 
panviculat conclude ea ay A ean 10. Determine whether each of the following sets is 


countable or uncountable: 


(a) The set of all numbers of the form m/2", where 
m is an integer and n is a natural number. 

(b) The set of all straight lines in the Cartesian 
plane, each of which passes through the origin. 
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(c) The set of all straight lines in the Cartesian 
plane, each of which passes through the origin 
and a point having both coordinates rational. 

(d) The set of all intervals on the real line having 
both endpoints rational. 

(e) Any innite set of nonoverlapping intervals on 
the real line. 


Prove that the set L of Liouville numbers, and hence 
the set of transcendental numbers, has cardinality c. 
[Hint: Consider the function f : L — [0, 1], which is 
de ned by sending )~~., a,10-"' to 0, aiana3....] 


n=1 


12.3. The Paradoxes of Set Theory 


The Early Paradoxes 


12. 


13. 


Transition to the Twentieth Century: Cantor and Kronecker 


Verify that the function f : R > [0,1] dened by 


1 1 x 
fo = 5 ( +i) 


is one-to-one, so that R ~ [0, 1]. 


Establish that any nondegenerate interval in R has 

cardinality c. [Hint: Show that [0, 1] ~ [a, b] via the 

function f(x) = a+ (b—a)x and [0, co] ~ [0, 1] via 
x 

f= | 


It has been said that the relation between Cantor’s theory 
of sets and mathematics was like the course of true love, 
never running smooth. About 1900, just when Cantor’s 
ideas were beginning to gain acceptance, a series of en- 
tirely unexpected logical contradictions were discovered 


in the fringes of the theory of sets. Curiously, these were at rst called “paradoxes,” 
rather than at contradictions, and regarded as little more than mathematical oddities. 
The feeling was that the conceptual apparatus of set theory was not yet quite satis- 
factorily constituted and that some slight alteration of the basic de nitions would set 
things right. Then, in 1902, the British philosopher, mathematician, and social reformer 
Bertrand Russell (1872—1970) offered a paradox in which Cantor’s very de nition of set 
seemed to lead to the contradiction. The simplicity and directness of Russell’s paradox 
shook the very foundations of logic and mathematics, and the tremors are still being 


felt today. 


This most notorious of the modern paradoxes appeared in Russell’s Principles of 
Mathematics, published in 1903. Before we examine Russell’s paradox let us observe 
that some sets are members of themselves, and some are not. The set of all abstract ideas, 
for example, is an abstract idea, but the set of all stars is not a star. Most sets are not 
elements of themselves; those that are tend to be far-fetched in description. 

With this in mind, we can formulate Russell’s paradox quite simply, using the bare 
notions of set and element. If one naively accepts the Cantorian view that every condition 
determines a set, then it is obviously possible to consider the set of all sets that have the 
property of not being elements of themselves, that is, the set 


S = {AA is a set and A ¢ A}. 


Here, we have used the symbol ¢, standing for the phrase “does not belong,” for the rst 
time. The use of € to denote set membership was initiated by Peano in his Arithmetices 
principia (1889); it is abbreviation of the Greek word eo 1, meaning “is.” 

Now S itself is a set, so that by the law of the excluded middle, which says that 
every proposition is either true or false, S € S or S ¢ S is a true assertion; but, it is easily 
seen, each of the two cases leads to a contradiction. For if S € S, it follows that S must 
be one of the sets A described in the condition; hence, S ¢ S is an impossible situation. 
On the other hand, if S ¢ S, then S satis es the property by which one determines which 
sets are elements of S; thus § € S, which is equally impossible. Because either case leads 
to contradiction, the paradox is apparent. 
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Russell’s paradox jolted the mathematical world and proved to be a personal disaster 
for the logician Gottlob Frege. Frege had labored for more than 10 years in the production 
of the second volume of his treatise on the logical foundations of arithmetic, Grundgesetze 
der Arithmetik. In the preface, Frege had written as follows: “The whole of the second 
part is really a test of my logical convictions; it is impossible that such an edice could 
be erected on an unsound basis.” At the very time the volume was coming near the end 
of its printing before being offered to the public, Frege received a personal letter from 
Russell announcing his discovery of the paradox mentioned earlier. Frege barely had 
time to compose an appendix to the Grundgesetze, which said in part: 


A scientist can hardly meet with anything more undesirable than to have the foundation give 
way just as the work is nished. I was placed in this position by a letter from Mr. Bertrand 
Russell as the printing of the present volume was nearing completion. 


Frege’s immediate reaction, as shown in his prompt reply to the young British logician, 
was one of consternation: “Arithmetic has begun to totter.” On the other hand, many 
distinguished mathematicians rejoiced in the paradox. The elder statesman of French 
mathematics, Henri Poincaré, who had little faith in mathematical logic, was overjoyed 
that the carefully constructed logical foundation was insuf cient to bear the weight of 
arithmetic. He exclaimed, “It is no longer sterile, it begets contradictions.” This may also 
have been a play on the saying, “Logic is barren, where mathematics is the most proli c 

of mothers.” 

Russell’s paradox was not, to be sure, the rst paradox noted in set theory. The 
earliest of the paradoxes—one based on the consideration of the “set of all ordinal 
numbers”—was published in 1897 by the Italian mathematician Cesare Burali-Forti, 
and known to Cantor at least two years earlier. Then, in 1899, Cantor discovered a 
paradox that had to do with his theory of cardinal numbers. This was based on another 
far-reaching theorem of his (1883), which said that for any set A, its power set had a 
larger cardinal number than that of A. By the power set of A was meant the set of all 
subsets of A; our notation for the power set of A is P(A). 

The result that needs to be proved rst in deducing Cantor’s theorem may be stated 
as a theorem. 


For any set A, there does not exist a function mapping A onto its power set P(A). 


Proof. The general plan of attack is to show, by an indirect argument, that no such 
function exists. Therefore we start with the assumption, to be refuted in the end, that 
there is a function f:A — P(A) that is onto P(A). This means that to each element 
a € A, there is assigned a subset f(a) of A. Let us consider the set B dened by 


B={acAlag f(a)}. 


The set B is a subset of A, so that B = f(b) for some b € A [recall that f maps 
onto P(A)]. We now ask an innocent question: Is the element b a member of the 
set B? 

If b € B, then by the very de nition of the set B, we must have b ¢ f(b) = B, 
which is impossible. On the other hand, b ¢ B = f(b) implies that b satis es the 
de ning property of B and so b € B. This is also nonsense. It follows that no 
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Henri Poincaré 


(1854-1912) 


(From A Concise History of Mathematics by Dirk 
Struik, 1967, Dover Publications, Inc., N.Y.) 


function f as described here can exist, and the proof of the theorem is thereby 
concluded. 


With this accomplished, we are ready to give a proof of Cantor’s theorem: 


For any set A, o(A) < o(P(A)). 


Proof. We must show rst that o(A) < o(P(A)) and then that o(A) 4 o(P(A)). The 
meaning of the statement 0(A) < o(P(A)) is that A is equivalent to a subset of 
P(A); that is, there exists a one-to-one correspondence between A and a subset of 
P(A). Quite obviously the function that takes each a € A into the single-element set 
{a} in P(A) de nes such a correspondence. 

For the task of proving that o(A) 4 o( P(A)), it is enough to show that there is 
no one-to-one correspondence between A and P(A). Because such a correspondence 
would certainly be a mapping of A onto P(A), it cannot exist, by the previous 
theorem. This gives the desired conclusion. 


| 


Cantor’s theorem not only answers the question whether to every cardinal number 


there exists a still larger cardinal (in particular, whether there are cardinals larger than c) 
but also furnishes a way of constructing a strictly increasing sequence of trans nite cardi- 
nal numbers. For in the special case of the cardinal number c, the theorem indicates that 
c = 0(R) < o(P(R)). By iterating the operation of forming the power set, an unending 
hierarchy of in nite cardinals can be obtained: 


c < o[ P(R)] < of P[P(R)]} < o( P{P[P(R)]}) <.... 
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Here we see in nity on in nity, each incomparably larger than the last, in a process that 
never ends. The imagination is beggared, but the cardinal numbers are not. 

A paradox arises when we consider the most comprehensive of all sets, the set U 
that contains all sets. By Cantor’s theorem, o[ P(U)] > o(U). Because U is the set of 
all sets and P(U) is a set (the elements of which happen to be subsets of U), then 
P(U) is contained in U; hence, o[ P(U)] < o(U), and we have a contradiction. Although 
this paradox was published only posthumously with Cantor’s correspondence in 1932, 
rumor of it reached Russell in 1901, and he then devised a paradox of his own. The 
Cantor paradox requires a good deal of mathematical machinery, enough so as to make 
it suspect on various grounds; but not so the Russell paradox, which uses only simple 
and well-established principles. 

It is natural that after the shock of the paradoxes, the foundations upon which 
mathematics—and in particular, set theory—had been built were scrutinized as never 
before. For ages, the reasoning used in mathematics had been regarded as a model of 
logical perfection. Mathematicians prided themselves that theirs was the one science so 
irrefutably established that in its long history it had never had to take a backward step. 
But now many mathematicians turned away from Cantor’s ideas and ceased to work 
on aspects of their discipline that depended on an unqualied acceptance of set theory. 
Doubts voiced in France reached such proportions that at the International Congress of 
Mathematicians held in Rome in 1908, the eminent Henri Poincaré, who was regarded 
as something of an oracle on mathematics, went so far as to say, “Later mathematicians 
will regard set theory as a disease from which one has recovered.” Other mathematicians 
believed that Cantor’s basic tenets were essentially correct, but set theory in its existing 
form was too naive. They felt that set theory must be built on logical and consistent 
foundations if Cantor’s innovations were to be secured for posterity. 

The fatal aw in early set theory proved to be Cantor’s broad approach, which 
permitted any conceivable property to give rise to a set, namely, the set of all elements that 
possessed the property. Russell’s paradox showed, without any considerations involving 
cardinal numbers, that one cannot allow arbitrary conditions to determine sets and then 
indiscriminately permit the sets so formed to be members of other sets. Because the 
dif culty appeared to originate in the liberality with which Cantor’s theory allowed the 
formation of sets, it seemed that the very concept of “set” as it then stood was inherently 
faulty. The immediate aim was to restrict the de nition of set in such a way as to forestall 
the emergence of those sets that entered into the paradoxes, and yet to allow mathematics 
the greatest possible latitude for development. Each of the various solutions proposed that 
proved to be successful lay the blame on the introduction of sets that were “excessively 
large.” Thus, an essential ingredient in any formal theory of sets was to be an axiom 
guaranteeing a “limitation of size.” 


Zermelo and the Axiom of Choice 


In his original development of set theory, Cantor relied on intuition, rather than any 
set of axioms, in deciding which objects were to be sets. But common sense turned out 
not to be a good enough lighthouse to keep set theory from being wrecked on the shoals 
of the paradoxes. The rst successful axiomatic treatment of set theory was published by 
the German mathematician Ernst Zermelo (1871-1953) in 1908. Zermelo, who received 
a doctorate from the University of Berlin in 1894 with a dissertation on the calculus of 
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variations, began his career not in set theory, but in mathematical physics. He became 
curious about Cantor’s ideas, and as an assistant professor at Géttingen, he lectured on the 
subject during the winter semester 1900-1901. The following year (1901), he published 
his rst relevant paper consisting of several results on the arithmetic of cardinal numbers. 
In a famous memoir, Foundations of a General Manifold Theory (1883), Cantor stated 
that “every well-de ned set can be brought into the form of a well-ordered set.” (An 
ordered set is well-ordered if every nonempty subset has a “ rst” element.) And Cantor 
promised to return in a future publication to this “law of thought which seems to be both 
fundamental, rich in consequences, and particularly remarkable for its generality.” This 
promise was never fully carried out. When Hilbert presented his famous 23 problems at 
the 1900 International Congress of Mathematicians, he indicated that the discovery of 
such a proof was one of the tasks challenging mathematicians the world over. Zermelo 
supplied the critical proof of the well-ordering theorem in 1904, a proof that unleashed 
much spirited controversy. Zermelo based his argument on a powerful new, and suspect, 
device, the axiom of choice. The axiom of choice (a name given by Zermelo) asserted 
that from any given collection of disjoint nonempty sets, it would be possible to choose 
exactly one element from each set and thereby form a new set. Intuitively, this axiom 
allowed for a simultaneous but independent selection from each of an in nite number 
of sets. The idea of making in nitely many choices was not entirely new, having been 
an important part of many mathematical arguments around 1900. As early as 1883, 
Cantor himself had unconsciously applied the choice axiom, and in an 1890 article on 
differential equations Peano had incidentally alluded to and rejected it (“as one cannot 
apply innitely many times an arbitrary law by which one assigns to a class A an 
individual of that class”). The rst explicit mention of the statement of the axiom was 
by Beppo Levi in 1902, in considering a proof of a theorem on cardinal numbers. 

The controversy touched off by the axiom of choice reminds one of another famous 
axiom, Euclid’s parallel postulate. This time the dispute centered on the question of what 
are admissible methods in mathematics; for the essence of the axiom of choice is that it is 
an existential statement giving no constructive de nition of the representative elements 
involved in its use. One of those who resolutely opposed nonconstructive methods in 
set theory, Emile Borel (1871-1956), insisted: “Any argument where one supposes an 
arbitrary choice to be made an uncountably in nite number of times... [is] outside the 
domain of mathematics.” Another such mathematician, Jacques Hadamard (1865-1963), 
crystallized the whole controversy into the question. “Can the existence of a mathemati- 
cal entity be proved without de ning it?” In a paper written in 1908, Zermelo furnished a 
second proof of the well-ordering theorem, in which the objections to the rst were dis- 
cussed at length. He concluded his spirited defense with the statement “No mathematical 
error can be demonstrated in my [earlier] proof.” Once again, the axiom of choice was 
used in the same way, the only difference from the 1904 article being in the remaining 
set-theoretic axioms. Although Zermelo’s two papers raised as many questions as they 
professed to settle, their great merit lay in the formal recognition of the principle of 
arbitrary choice as an independent method of proof. 

Zermelo took a decisive step in the attempt to rehabilitate the heretofore haphazard 
formulation of Cantor’s set theory. In the same issue of the Mathematische Annalen that 
contained his second proof of the well-ordering theorem, he also published /nvestigations 
into the Foundations of Set Theory. He hoped that this paper, which presented a strictly 
axiomatic theory of sets, would in turn serve as a basis for all mathematics. Zermelo 
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AXIOM OF 
SPECIFICATION 


did not say what sets are but simply postulated them together with their basic properties. 
The conciseness of his system of axioms is surprising. Only seven axioms, involving 
just two unde ned _ technical terms (set and membership, the latter relation denoted by €) 
suf ced to build up the set theory required for practically all mathematics. The original 
axioms of Zermelo, as amended by Fraenkel, von Neumann, and others, are now called 
the Zermelo-Fraenkel axioms for set theory. 

Zermelo, to avoid paradoxes that would render his system useless, refused to admit 
into the club of decent sets those collections that were “too big.” He observed that 
mathematicians would not normally think of using such sets as “the set of all sets” or 
“the set of all sets which are not elements of themselves.” He held that the sets needed 
in practice were always built up by means of given operations from certain simple sets 
(like the sets of natural numbers or real numbers) that are known about to begin with. 
Thus, Zermelo formulated a principle by which he guaranteed the existence of certain 
subsets. 


To every set A and every defiite property P (x) there corresponds a set whose elements 
are exactly those elements x in A for which the property P(x) holds. 


The essential difference in Zermelo’s system was that both a property P(x) and a preex- 
isting set A were needed to form a new set, whereas in Cantor’s original scheme only the 
property P(x) was required. Rather than proclaim the existence of sets, the speci cation 
axiom posits the existence of certain subsets of a given set; a set is admitted into the the- 
ory only by being related to a known set. (The vague notion of “de nite property” gave 
rise to misgivings almost from the outset; the idea was made more precise by Fraenkel 
in 1922 and, somewhat differently, by Skolem in 1922-1923.) Zermelo also recognized 
that because the axiom of choice did not follow from any previously known principle of 
mathematics or logic, he must make it one of his seven axioms. 

How do the restrictions implied by Zermelo’s axiomatization avert the disaster of the 
paradoxes? The critical set that appears in Russell’s paradox, “the set of all sets which 
are not elements of themselves,” cannot be formed in Zermelo’s formal set theory; the 
best one can do is to produce the set 


S={AeA|A isaset and A ¢ A}, 


where A is a set (of sets) known to exist. Notice that S € S is impossible; for then 
S €¢ Aand S ¢ S, a contradiction. Thus, S ¢ S. The implication is that § ¢ A. Indeed, if 
S € A, then S would (because S ¢ S) satisfy the condition determining the members of 
S and we should have S € S, which would be a contradiction. The outcome of Russell’s 
argument has changed completely. All it shows is that if A is any set that exists, then 
the set S cannot be an element of A. This does not lead to a contradiction, so Russell’s 
paradox cannot be reproduced in axiomatic set theory. 

Let us now consider the paradox of Cantor, which derives its origin from the pos- 
sibility of constructing the set of all sets. Suppose for the moment that there is a set 
U that contains every set. Then because U/ contains every set, S € U; this violates the 
conclusions of the last paragraph. The set of all sets, no matter how curiously natural 
it seems, does not exist within Zermelo’s system, and Cantor’s paradox falls away. As 
Zermelo hoped to show, axiomatization was a successful antidote to the paradoxes. 
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Declaring that “I have not yet even been able to prove my axioms are consistent, 
though this is certainly essential,” Zermelo left the difcult questions of consistency 
and independence to his successors. Absolute consistency turned out to be a blind al- 
ley. According to Gédel’s famous incompleteness theorem, it is impossible in certain 
logical systems—Zermelo-Fraenkel set theory, for example—to demonstrate the inter- 
nal consistency of the system by methods formalizable within the system itself. In a 
series of lectures at Princeton in 1938, published afterward as The Consistency of the 
Axiom of Choice and of the Generalized Continuum Hypothesis with the Axioms of Set 
Theory, Gédel proved that if the other axioms present in set theory are consistent with 
one another, then the system obtained by adjoining the axiom of choice will not give 
rise to any contradictions. This was, in other words, a proof of the relative consistency 
of the axiom of choice. Cohen (1963) succeeded in showing that the negation of both 
the axiom of choice and the continuum hypothesis is also consistent with the rest of the 
Zermelo-Fraenkel axioms (provided they are consistent themselves). 

The combined results of Gédel and Cohen imply that the choice axiom is an in- 
dependent axiom of set theory; its use or rejection in an axiom system is a matter of 
personal inclination. Because so many profound theorems have been obtained using the 
axiom of choice, without visible alternative, the average mathematician would probably 
speak in favor of its retention. On the other hand, because it has troubled the consciences 
of so many, it is important to know which theorems have been proved with the aid of the 
axiom of choice and the extent to which the axiom is needed in the proof. An alternative 
proof without the axiom of choice would then be desirable. 


The Logistic School: Frege, Peano, and Russell 


The paradoxes of the in nite laid the stage for a modern “crisis in the foundations,” 
not unlike the profoundly disturbing situation that arose when the Pythagoreans unexpect- 
edly discovered incommensurable quantities. It appeared to many that the entire structure 
of mathematics was weak or at least built on weak foundations. In the early 1900s, widely 
different diagnoses of the ills of mathematics divided mathematicians into various enemy 
camps. The three main schools of thought concerning the origin and nature of mathe- 
matics are usually distinguished as the logistic (also called logicistic), formalistic, and 
intuitionistic schools, and their best-known proponents identied as Bertrand Russell, 
David Hilbert, and L. E. J. Brouwer, respectively. Although the adherents of these fac- 
tions had the common purpose of coming to grips with the destructive paradoxes, the 
radically different ways they chose to accomplish this led to some sharp con icts. It was 
almost as though mathematics had become a kind of religious fanaticism instead of a 
labor of love. 

The chief characteristic of the logistic school was its uncompromising insistence 
that logic and mathematics were related as earlier and later parts of the same subject, 
that mathematics was ultimately derivable from logic alone. Logic was not simply an 
instrument in the construction of mathematical theories; mathematics now became the 
offspring of pure logic. According to the logistic philosophy all mathematical concepts 
must be given de nitions only from ideas that are a part of logic, and all mathemati- 
cal statements must be deducible from universally recognized assumptions of logic. Of 
course, to say mathematics is logic is merely to replace one unde ned term by another; 
logic is as much in need of de nition as mathematics is. 
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The rst determined effort to rewrite the established body of mathematics in logical 
symbolism was made by the German logician and philosopher Gottlob Frege (1848— 
1925). Frege received his doctorate in mathematics from G6ttingen in 1873. The fol- 
lowing year he began his teaching career at the University of Jena, where he remained 
for 45 years. Frege’s mathematical work was almost wholly con ned to mathematical 
logic and foundations. His small but weighty treatise, entitled Begriffsschrift, einer der 
arithmetischen nachgebildete Formelsprache des reinen Denkens (Conceptual Notation, 
a Symbolic Language of Pure Thought Modeled on the Language of Arithmetic), pub- 
lished in 1879, was a natural milestone in the history of modern logic. More than 20 years 
passed, however, before Bertrand Russell sensed the greatness of the achievement. In the 
Begriffsschrift, the whole calculus of propositions and the device known as quanti ca- 
tion theory was presented for the rst time. Frege framed a formal language of symbols, 
which was intended to be adequate for the exposition of any mathematical statement, 
together with certain rules of inference for expressing any train of proof. 

During the succeeding years, Frege turned to the actual formalization of a particu- 
lar branch of mathematics. In two principal works, he chose arithmetic as his subject: 
Die Grundlagen der Arithmetik (The Foundations of Arithmetic), which appeared in 
1884, and Grundgesetze der Arithmetik (Fundamental Laws of Arithmetic), of which 
volume 1 was published in 1893 and volume 2 in 1903. Owing to the complexity of 
the symbolism—‘“a monstrous waste of space,” said one critic—and the novelty of the 
approach, these writings passed almost wholly without recognition until Russell devoted 
an appendix to them in Principles of Mathematics (1903). Frege’s system of axioms, 
as we observed earlier, introduced sets in a way that led directly to one of the newly 
discovered paradoxes. When the Grundgesetze was at the printshop, Russell found the 
fatal aw and communicated it to Frege in a letter, asking, “Is the set of all sets which are 
not members of themselves a member of itself?” After acknowledging the contradiction 
contained in his system, it seems that Frege was never able to regain his former faith in 
the possibility of a purely formal presentation of arithmetic. He effectively abandoned 
his creative research and died a bitter man, convinced that his life’s work had been for 
the most part a failure. Frege’s death went virtually unmarked by the scholarly world, a 
tragic fate for a man who singlehandedly created a revolution in logic. 

Another great pioneer in the study of the foundations of mathematics, and a master 
of the art of reasoning by formal logic, was the Italian mathematician Giuseppe Peano 
(1858-1932). Peano’s rst attempt at deducing the truths of mathematics from pure logic 
was his Arithmetices principia, nova methedo exposita of 1889, a small tract of 29 pages 
written almost entirely in the symbols of what he called mathematical logic. Among the 
body of signs Peano invented for this were € (belongs to), U (logical sum or union), 9 
(logical product or intersection), and D (logical implies or contains). The “horseshoe” 
symbol D became the standard notation for material implication after Whitehead and 
Russell used it in their Principia Mathematica. The Arithmetices principia is noteworthy 
for containing the rst statement of the famous postulates for the natural numbers, perhaps 
the best known of Peano’s achievements. 

Peano’s goal was to take mathematics as he found it and translate it completely 
into his designed language of signs, thereby making the principles of logic the vehicle of 
demonstration. This was carried out in the Formulaire de mathématiques, which expressed 
in symbols all the known de nitions, proofs, and theorems of extensive tracts of math- 
ematics. Essentially a series of reports by Peano and his collaborators, the Formulaire 
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was published in ve successive editions or volumes. The rst edition appeared in 1895; 
the last was completed in 1908 and contained some 4200 theorems. The nal version 
came out under the title Formulario mathematico and was written in a new international 
language of Peano’s invention, which he called Jatino sine fxione (Latin without gram- 
mar), afterward named Interlingua. Peano spent almost the whole of his life in Turin. He 
entered the University of Turin as a student in 1876 and held a position there from 1880 
to 1932. He also held a position at the military academy, which was next door, from 1886 
to 1901. When Peano adopted the Formulaire as a textbook, the pre-engineering students 
at the academy rebelled, complaining that the lectures contained an excess of queer new 
symbolism. His attempt to regain popularity by passing everyone who registered for his 
course proved ineffectual, and Peano was forced to resign his professorship. Another 
example of Peano’s infatuation with formalism is his famous paper (1890) in the Math- 
ematische Annalen concerning the existence of solutions of real differential equations. 
The article was so clothed in the language of symbolic logic that it could be read only 
by a few of the initiated and did not generally become available to the working math- 
ematician. It became something of the fashion to refer scornfully to Peano’s symbolic 
language as Peanese. 

At the rst International Congress of Philosophy at Paris in 1900, the Italian phalanx 
of Peano, Burali-Forte, Padoa, and Pieri dominated the discussion. Bertrand Russell’s 
meeting with Peano brought about, in the Englishman’s own words, “a turning point 
in my intellectual life.” Russell asked Peano for copies of his published works, quickly 
mastered the techniques of mathematical logic, and returned home to write Principles 
of Mathematics. In the Principles of Mathematics, which contains the rst explanation 
of the paradox that bears his name, Russell put forward the opinion that logic is the 
progenitor of mathematics: 

The present work has to ful ll two objects, r st, to show that all mathematics follows from 


symbolic logic, and secondly, to discover, as far as possible, what are the principles of 
symbolic logic itself. 
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The book was to appear in two volumes, the rst of which was to conne itself to a 
popularly understandable explanation, avoiding symbolism. The second volume was to 
provide the logical proofs that would support the view that mathematics and logic are 
identical. The proposed second volume never came out, because the revision of material 
necessitated writing a wholly new work, the Principia Mathematica. 

The apex of the logistic conception of mathematics is Principia Mathematica, written 
by Russell in stages in collaboration with his friend and older colleague Alfred North 
Whitehead (1861-1947). A massive work of bewildering complexity, its three volumes 
comprise more than 2000 pages. The Principia must thus be regarded as one of the 
classics of mathematical literature, more often quoted than read. Frege and Peano were, 
so to speak, the godfathers of the Principia. Russell was unacquainted with Frege’s 
writings during most of the preparation of the Principles of Mathematics (“Professor 
Frege’s work, which largely anticipates my own, was for the most part unknown to 
me when the printing of the present work began”). In the preface of the Principia, 
however, the authors acknowledged their obligation: “In all questions of logical analysis, 
our chief debt is to Frege.” On the formal side, the new symbolism that Whitehead and 
Russell adopted had its roots in Peano’s work. They explained that they were obliged to 
renounce ordinary language and write almost entirely in symbols, because no words had 
the exact value of the symbols; they held that without the symbolic form of the work, 
they would have been unable to perform the requisite reasoning. Taken as a whole, the 
Principia constituted a formidable effort to prove the logistic thesis that mathematics 
was indistinguishable from logic. Russell later wrote: 


If there are still those who do not admit the identity of logic and mathematics, we may 
challenge them to indicate at what point, in the successive de nitions and deductions of 
Principia Mathematica, they consider that logic ends and mathematics begins. 


After analyzing the paradoxes, Russell arrived at the view that they all resulted from 
the same circular kind of reasoning, a misuse of self-referential expressions. To quote 
Russell’s words in the Principia: 


The principle which enables us to avoid illegitimate totalities may be stated as follows: 
“Whatever involves all of a collection must not be one of the collection,”...We shall call 
this the “vicious-circle principle,” because it enables us to avoid the vicious circles involved 
in the assumption of illegitimate totalities. 


To ensure strict observance of the vicious-circle principle, he introduced his theory of 
logical types. The theory set up a hierarchy of levels of elements. On the lowest level 
there are “individuals”; on the next level, sets of individuals; on the next level, sets of sets 
of individuals, and so on—a different type of object at each level. In applying the theory, 
one follows the rule that sets have as members only those things from the next lower 
level. This idea made set theory secure against the “illegitimate totalities,” but it led to 
other dif culties in the body of mathematics itself—many important theorems not only 
could not be proved but could not even be expressed. (Among them was Cantor’s theorem 
that there are more real numbers than positive integers.) Adherence to Russell’s rule of 
hierarchies introduced complications in Dedekind’s theory of the real line. For instance, 
the least upper bound of a bounded set, because it is de ned in terms of a set of real 
numbers, must be of a higher type than the real numbers, and so itself is not a real number. 

Russell overcame this weakness by positing a new axiom, which he called the 
axiom of reducibility. (The axiom is less a part of the deductive system of logic than a 
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rule for the manipulation of symbols.) The sole justi cation for the axiom of reducibility 
was that there seemed to be no other way out of the particular dif culty engendered 
by the theory of types. It was the arti cial, ad hoc character of this device that brought 
forth an overwhelmingly negative response from mathematicians. It became the main 
bone of contention for the critics of the system of the Principia. They argued that the 
axiom was incompatible with the integral program of logicism, that the claim could no 
longer be made that the axioms were purely logical or that the resulting system was 
founded exclusively on logic. One critic stated that the introduction of the axiom of 
reducibility into the Principia, without any supporting reason, was an act of harakiri. 
Russell felt that this was not the sort of axiom with which one could rest content, 
and he conjectured that some less objectionable axiom might give the desired results. 
But he could nd no satisfactory alternative in his attempt to drop it from the second 
edition (1925) of the Principia. As the authors admitted, “This axiom has a purely 
pragmatic justi cation; It leads to the desired results, and to no others.” The impetus of 
the logistic movement seemed to falter with the publication of the third great tome of 
the Principia, for beyond the distrust inspired by the axiom of reducibility was another 
dif culty. Russell had shown merely that all the known paradoxes were circumvented 
in his system; he had not shown that his system would remain free of contradictions. 

Whitehead, when he began his collaboration with Russell, was already the author of 
two tracts on geometry and also a book on the symbolic structure of algebra, A Treatise 
on Universal Algebra (1898). The preparation of a fourth volume of the Principia, on 
the logical foundations of geometry, which was to be written largely by Whitehead, 
was interrupted by World War I. At a time when pacism aroused bitter emotions, 
Russell’s opposition to the war caused him to be denounced as unpatriotic. In 1916, he 
was prosecuted and ned £100 for writing a pamphlet containing “statements likely to 
prejudice the recruiting and discipline of His Majesty’s Forces.” The conviction, and 
Russell’s general unpopularity, led to his dismissal from Trinity College, Cambridge. 
Then, in 1918, he was sentenced to six months’ imprisonment for another article libeling 
the American army. In prison, Russell found the leisure to write his Introduction to 
the Philosophy of Mathematics, which was published in 1919. His subsequent works on 
moral and social issues were more popular. 

Whitehead was also inclined to philosophy, going to the United States in 1924 as 
professor of philosophy at Harvard. As the authors of the Principia moved into this new 
territory, the leadership in research into the foundations of mathematics passed to the 
great German authority, David Hilbert. The younger generation was probably thankful 
to be delivered from such a dry and desolate undertaking as the Russell-Whitehead 
enterprise; besides, Hilbert’s axiomatic approach appeared more in keeping with the 
traditional character of mathematics. 


Hilbert’s Formalistic Approach 


Hilbert’s interest in foundations dated to his investigations of geometry in the 1890s. 
His masterful Grundlagen der Geometrie (Foundations of Geometry) was an attempt to 
rewrite Euclid in accordance with the principles of Peano, although without Peano’s unfa- 
miliar logical symbolism. That is, the Grundlagen laid out a rigorous axiomatic treatment 
of elementary geometry that avoided any illegal appeal to intuition. The question that 
Hilbert explicitly formulated for the rst time was the consistency of his set of axioms: 
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there should be no paradox or contradiction consisting of two theorems, one of which 
would be the negation of the other. As a natural evolution of this work, at the Second In- 
ternational Congress of Mathematicians (1900), he offered as one of his list of the most 
challenging current problems that of the consistency of the axioms of arithmetic. By 
the time of the Third International Congress, which was held at Heidelberg in 1904, the 
emergence of the paradoxes had brought mathematicians into uncertainty. Hilbert, speak- 
ing before those gathered at Heidelberg, volunteered his service in the reconstruction of 
mathematics. Having achieved initial success with the axiomatization of geometry, he 
saw no reason why the same approach could not be applied to other areas—indeed, to 
all mathematics. After considering the problems that beset a rigorous development of the 
number system, he offered the outline of a concrete plan: 


I believe that all the dif culties that I have touched upon may be overcome, and an entirely 
satisfactory foundation of the number concept can be reached, by a method which I call the 
axiomatic method, and whose leading idea I wish now to develop. 


Hilbert did not act on his Heidelberg proposal for many years; instead, he became ab- 
sorbed rst in integral equations and then later in mathematical physics. He next returned 
to the old problem, publicly at least, when he delivered an address in 1917, Axiomatisches 
Denken, before the Swiss Mathematical Society. In the interim, the nagging questions 
concerning the logical foundations of mathematics had reached a critical stage. To make 
matters worse, L. E. J. Brouwer of the University of Amsterdam was winning converts 
to his distinctly personal philosophy of mathematics known as “intuitionism.” One of the 
fundamental canons of this thinking was that the analysis of Weierstrass and the concept 
of the innite as it appeared in the work of Cantor were “built on sand.” Fearing that 
his cherished theorems, his paradise, would be among the sacrices required, Hilbert 
returned in earnest to the task of providing mathematics with a secure foundation. In a 
lecture delivered at Munster in 1925 to honor the memory of Weierstrass, he declared 
stoutly, “No one will expel us from this paradise Cantor has created for us.” 

It was a fundamental thesis of Hilbert that once he consistency of any axiom system 
had been established, then its use was “legitimate.” Indeed he held that mathematical 
existence was nothing less than consistency: 


If the arbitrarily given axioms do not contradict each other through their consequences 
... then the objects de ned through the axioms exist. That, for me, is the criterion of truth 
and existence. 


Thus, to salvage traditional mathematics in the face of Brouwer’s attacks, Hilbert pro- 
posed a bold new program. It required rst that the whole of existing mathematics, 
including logic, should be axiomatized, and second that this axiomatic theory should 
then be proved consistent by simple nitary arguments. This approach to foundations 
became known as formalism. 

Before Hilbert’s proposal, the method used in consistency proofs for an axiomatic 
theory was merely to shift the burden to another area of mathematics. In the Grundlagen 
der Geometrie, for instance, consistency of geometry had been established only in relative 
terms, assuming the consistency of arithmetic; doubt about the consistency of arithmetic 
cast a shadow over the whole enterprise. Hilbert proposed a more thoroughgoing cure 
for those ills that the set paradoxes had generated in mathematics. His aim was to furnish 
an absolute consistency proof for some axiomatic system within which all mathematics 
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could be deduced. For if a system strong enough to embrace the notion of in nite 

set and the operations on such sets could be shown to be mathematically incapable of 
producing an inconsistency, then no contradiction would ever be forthcoming as a result 
in set theory. The establishment of the consistency of classical mathematics was Hilbert’s 
golden dream. 

Asa_ rst step toward carrying out such a program, Hilbert introduced the notion of 
a formal theory, a completely symbolic axiomatic theory in which there was explicitly 
incorporated a system of logic. (The formal theory is usually a formalization of some more 
intuitively conceived theory of the ordinary mathematical kind.) Hilbert’s idea was the 
axiomatic development pushed to its extreme. Even the logical methods used uncritically 
in carrying out a mathematical proof had to be themselves subjected to formalization. 
Hilbert denied, however, that logic was more than mathematics and should precede 
it. He was concerned with extracting from the whole of logic only so much as was 
needed to reason in his formalism. Particularly helpful in symbolizing the statements 
of mathematics was the Principia of Russell and Whitehead, which Hilbert called “the 
crowning achievement of the work of axiomatization”; for by actually exhibiting the 
details, the Principia showed that all mathematical statements could be translated into 
a small body of symbols and that the laws of reasoning reduced to several simple rules 
for combining these symbols. Hence, an appropriate formal theory could be considered 
equivalent to the whole of mathematics. 

A formal theory, as set up by Hilbert, starts with a stock of symbols and the rules 
governing them. The rules consist of a certain set of initial formulas involving the sym- 
bols, the axioms, and certain explicitly stated rules of inference determining how further 
assertible formulas are to be constructed. A proof in such a theory is a nite sequence 
of formulas, each of which either is an axiom or is obtainable from one of the earlier 
formulas in the sequence by applying the rules of inference. The last formula in the 
sequence is, by de nition, a theorem. The aim is to make certain that no one of these 
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formalized reasonings will ever lead to a contradiction. Hilbert insisted that proofs must 
have a_ nite character, arguing that man is capable of only a nite number of logical 
deductions, so that any contradiction that might arise must do so after a nite number of 
operations. In Hilbert’s formalism, proofs themselves become the objects of a mathemat- 
ical study, which he called proof theory, or metamathematics. In this way, investigation 
into the nature of mathematical proofs becomes another branch of mathematics. 

Mathematics, for Hilbert, became a purely formal calculus, an almost mechanical ma- 
nipulation of symbols devoid of concrete content. John von Neumann, a collaborator with 
Hilbert on axiomatic foundations, said at the time, “We must regard classical mathemat- 
ics as a combinatorial game played with symbols.” The comment is perhaps unfortunate 
in that it suggests that formalistic mathematics is a trivial game played with meaningless 
marks on paper. Pure axiomatics presupposes an already existing intuitive theory, which 
it represents in idealized form: “The axioms and demonstrable theorems which arise in 
our formalistic game are the images of the ideas which form the subject matter of ordi- 
nary mathematics.” The formalist position is that the inherent structure of intuitive proofs 
is fully re ected in the combinatorial relations between formal expressions—the outward 
and visible signs of mathematical reasoning. Hilbert argued, “My theory of proof actually 
is nothing more than the description of the innermost processes of our understanding and 
it is a protocol of the rules according to which our thought actually proceeds.” 

During the decade 1920-1930, Hilbert and his two young assistants, Wilhelm Ack- 
ermann and Paul Bernays, worked diligently at carrying through the objectives of the 
formalist program. After some partial success, there was every reason to believe that a 
few years’ sustained effort would succeed in establishing the hoped-for consistency of 
the formal equivalent of classical mathematics, that it was now merely a matter of nding 
the correct technique. But such optimistic expectations were dealt a staggering setback 
when Kurt Gédel, a 25-year-old mathematician at the University of Vienna, announced 
an important and dismaying discovery. Gédel proved his result in 1930, an abstract was 
published at the end of the year, and the full details appeared early in 1931. Hilbert 
originally intended to prove consistency of a formal system by such means as could be 
formalized in the system itself. Gddel showed this program to be incapable of ful Ilment. 
According to Gédel, the consistency of a formal system strong enough to be considered 
a foundation of mathematics could not be established from within the system by strictly 
nitary methods. Thus, either the system was inconsistent to begin with, or if consistent, 
the limitations prescribed by Hilbert were inadequate to formalize a proof of its consis- 
tency. Each formal system needed a wider and more inclusive system to demonstrate its 
consistency. This melancholy revelation effectively brought to an end the initial phase 
of the formalistic movement. 


Brouwer’s Intuitionism 


While Hilbert proposed to save and safeguard the customary formulation of mathe- 
matics by a consistency proof, the Dutch mathematician L. E. J. Brouwer was ready to 
sacrice those parts of the subject in which he felt that language had outrun clear mean- 
ing. Better to be rid of these offensive embellishments, beautiful in form but hollow in 
content. It was Brouwer’s intention to develop an intuitionistic mathematics, using only 
those constructions that had a clear intuitive justi cation. Intuitionism, though in some 
way anticipated by Kant, Kronecker, and Poincaré, was shaped as a de nite philosophy of 
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mathematics by Brouwer and his Amsterdam school. Brouwer had a rare insight into the 
defects of classical mathematics (the familiar mathematics using the logic of Aristotle), 
and he made a valiant attempt to set things right, beginning in 1907 with his doctoral dis- 
sertation. The dissertation, entitled On the Foundations of Mathematics, was a penetrating 
criticism directed against Cantor’s trans nite numbers, against the logicism of Russell, 
and against Hilbert’s axiomatic method. Some of the ideas he put forth in his dissertation 
were sketchy and had to be revised and expanded in later papers. A modest six-page 
criticism written in 1908, On the Unreliability of Logical Principles, questioned the logic 
used in traditional reasoning. This was followed in 1912 by Intuitionism and Formalism, 
a memoir that consolidated Brouwer’s program. Two articles of faith emerged: the re- 
duction of mathematics to the ultimate intuition of the natural numbers and the rejection 
of the unrestricted application of the law of the excluded middle. 

Another no less important feature of the intuitionistic position was its challenge to 
the logicist presumption of the priority of logic over mathematics. Where Frege, Peano, 
and Russell thought a logical symbolism was a prerequisite for mathematical knowledge, 
the intuitionists repudiated any such requirement. Brouwer, in his dissertation, argued 
that mathematics was a mental construction, a free creation of the mind, completely 
independent of language and logic. Although mathematical language, whether ordinary 
or symbolic, may be unavoidable from a practical standpoint, it is nothing more than 
an imperfect tool used by mathematicians to assist them in memorizing their results 
and communicating them to one another. Symbolic logic does not represent an essential 
feature of mathematical reasoning and by itself can never create new mathematics. The 
symbolic language must not be confused with the mathematics it conveys; it re ects, 
but does not contain, mathematical reality. “These paradoxes arise,” in the words of 
Brouwer, “when the language which accompanies mathematics is extended to a language 
of mathematical words which is not connected with mathematics.” He upbraided the 
formalists for building self-subsistent verbal edices that could be studied apart from 
any intuitive interpretation; to that extent, they reduced mathematics to a meaningless 
“game of formulas”: 


What it [the formalist school] seems to have overlooked is that between the perfection of 
mathematical language and the perfection of mathematics proper, no clear connection can 
be seen. 


Brouwer’s conception of mathematics had some af nity to the conception of Im- 
manuel Kant, both men nding the source of mathematical truth in intuition. Kant, in 
his Critique of Pure Reason, argued the case for believing that a substantial part of the- 
oretical knowledge has an inescapably a priori nature. It was his rm conviction that 
the axioms of arithmetic and geometry had this a priori character; that is, they were 
judgments independent of experience and not capable of analytic demonstration. For 
Kant, the possibility of disproving arithmetical and geometrical laws was entirely un- 
thinkable. Although Brouwer rejected Kant’s discredited notion that our geometry was 
based on an a priori intuition of space, he retained the complementary thesis that a 
temporal intuition would allow us to conceive one object, then one more, then another, 
and so on inde nitely. Thus, the natural numbers were accepted by Brouwer, not on 
the basis of any axiom system such as Peano’s, but as arising from some primordial 
instinct of the passage of time; and all further mathematical objects would have to be 
constructed out of these numbers by intuitively clear nite methods. This constructive 
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tendency in mathematics had been espoused apart from and before intuitionism in the 
work of Kronecker and Poincaré. Kronecker, you will remember, objected to introducing 
into mathematics objects that could not be produced by any kind of nite construction. 
This was the view that led to his notorious feud with Cantor. As Kronecker himself 
said in a striking phrase, “God made the natural numbers, and all the rest is the work of 
man.” Another early forerunner of the spirit of intuitionism was Poincaré, who contended 
that mathematical induction was a pure intuition of mathematical reasoning, not just an 
axiom that was useful in some systems. Brouwer agreed with Poincaré on this point, 
but rejected his opinion that mathematical existence coincides with any demonstration 
of noncontradiction. 

In his famous paper, On the Unreliability of Logical Principles, Brouwer challenged 
the view that the classical logic founded on the authority of Aristotle has a universal 
validity independent of the subject matter to which it is applied. Brouwer maintained 
that the common belief in the applicability of classical logic to mathematics could only 
be considered a phenomenon of the history of civilization, of the same order as the 
oldtime belief in the rationality of 7. According to his interpretation, traditional logic 
arose from the mathematics of nite sets and their subsets; then, forgetful of this limited 
origin, people mistook that logic for something that had an a priori existence independent 
of mathematics; and nally, they unjustly applied it, by virtue of its a priori character, 
to the mathematics of in nite sets. Hermann Wey] said, “This is the Fall and original sin 
of set theory, for which it is justly punished by the antinomies.” That paradoxes showed 
up was not surprising to Brouwer, only that they showed up so late in the day. 

A specic example of a logical principle, valid in reasoning with nite sets, that 
Brouwer rejected for in nite sets was the law of the excluded middle. The supposition 
underlying this principle is that each mathematical statement is determinably true or false, 
independent of the means available to us of recognizing its truth value. For Brouwer, 
purely hypothetical truth values were an illusion. A given mathematical statement was 
true only when a certain self-evident construction had been effected in a_ nite number 
of steps. Because it could not be guaranteed beforehand that such a construction could 
be found, we should have no right to assume of each statement that it is either true 
or false. For instance, Brouwer asked whether it is true or false “that in the decimal 
expansion of z there occur ten successive digits forming a sequence of 0123456789.” 
This would evidently require that we either indicate a sequence of 0123456789 in z or 
demonstrate that no such sequence could appear, and no method existed for making the 
decision, so one could not apply the law of the excluded middle here to conclude that 
the statement would have to be either true or false. On the other hand, it is legitimate 
from the intuitionists’ standpoint to assert that the number 1010" + 1 is either prime or 
composite, without being able to say which alternative actually holds. There is a method, 
if one were to take the trouble to apply it, that is in principle effective for deciding which 
alternative is correct. 

Brouwer’s criticism of the logic used in classical reasoning arose from his refusal 
to accept nonconstructive existence proofs; that is, proofs that establish the existence of 
something without providing an effective means for nding it. Giving a nonconstructive 
existence proof, observed Weyl, is like informing the world that somewhere there is a 
buried treasure, but not saying where it is. These proofs tend to take the form of an 
indirect demonstration and as a rule rely on the law of the excluded middle. (Euclid’s 
proof of the innitude of primes is a perfect illustration.) Brouwer would have none 
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of this; for him, “mathematical existence” was synonymous with actual constructibility. 
It is not enough, in showing that an in nite set has members, to demonstrate that the 
assumption that the set is empty leads to a contradiction; one must exhibit a process that 
at least in principle will enable an element of the set to be found or constructed. The 
intuitionists regarded the idea that there might exist a set that couldn’t be constructed as 
a piece of hopelessly confused metaphysics. By refusing to de ne a set by means of an 
attribute characteristic of its elements, and limiting themselves to constructive methods, 
Brouwer’s followers did away with the problems raised by the paradoxes of set theory. 

The abandonment of the law of the excluded middle, and with it the nonconstructive 
existence proofs, was too radical a step for Hilbert to accept. “Taking the law of the 
excluded middle from mathematicians,” he exclaimed, “is the same as prohibiting the 
astronomer his telescope or the boxer the use of his sts.” For his part, Brouwer would not 
go along with the proposition that proving classical mathematics to be consistent would 
restore its meaning. Thus, he wrote, “Nothing of mathematical value will be attained in 
this manner; a false theory which is not halted by a contradiction is none the less false, 
just as a criminal act not forbidden by a reprimanding court is none the less criminal.” 

At the height of this noisy battle between Géttingen and Amsterdam, the most gifted 
of Hilbert’s pupils, Hermann Weyl (1885-1955), accepted Brouwer’s main conclusions 
and began to ght actively on his side. Weyl had entered Gottingen in 1903 and remained 
there, rst as a student and then as a privatdozent, until his call to the Polytechnicum 
in Ztirich in 1913. Weyl concluded that despite the introduction of Weierstrassian rigor, 
analysis was still not well founded, and that part of classical mathematics should be 
swept away. In his monograph Das Kontinuum (1918), he wrote: 


In this little book, I am not concerned to disguise the “solid rock” on which the house 
of analysis is built with a wooden platform of formalism, in order to talk the reader into 
believing at the end that this platform is the true foundation. What will be proposed is rather 
the view that this house is largely built on sand. 


Strong words were answered in kind. Hilbert refused to accept the mutilation of 
mathematics that Brouwer’s standpoint demanded, comparing these attacks with the ear- 
lier negativism of Kronecker. Speaking in Hamburg in 1922: 


What Weyl and Brouwer are doing is mainly following in the path of Kronecker; they are 
trying to establish mathematics by throwing everything overboard that does not suit them 
and dictatorially promulgating an embargo. The effect of this is to dismember and cripple 
our science and to run the risk of losing a large part of our most valuable possessions 
... Brouwer’s program is not, as Weyl believes it to be, the Revolution, but only the repetition 
of a vain Putsch, which then was undertaken with greater dash, yet failed utterly. 


The anger and determination with which Hilbert made the above declaration is most 
readily understandable when one remembers that Hilbert made his early reputation by 
using nonconstructive methods. His existential proof of Gordan’s theorem in invariant 
theory had perplexed his contemporaries and provoked Gordan’s outraged cry of “the- 
ology.” For Hilbert’s intransigent adversary, Brouwer, it was proofs like this that had to 
be jettisoned. Because Hilbert would not follow in the treasonous rejection of the greater 
part of mathematics and set theory, the battle was joined. Those determined not to be 
driven from a “paradise” must continually struggle to protect it. 

We have now examined three rival philosophies concerning mathematical activity; 
it is pure logic for the logicist, the manipulation of abstract symbols for the formalist, 
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12.3. Problems 


constructions in the medium of temporal intuition for the intuitionist. Each of these 
schools of thought contained serious defects, and none achieved its objective of providing 
a universally acceptable approach to mathematics. In the last analysis, it seems that we 
are no closer to understanding the ultimate meaning of mathematics than the founders of 
these movements some 85 years ago. Today, foundations of mathematics is not a_ eld in 
which the same intense activity takes place as occurred in the early part of the century. 
Modern investigators are less inclined to dogmatism than Brouwer or Hilbert was, and 
no particular doctrine any longer pretends to represent the true mathematics. Despite the 
indecisive outcome of the original undertaking, the work done by the three schools had 
one great value: it revealed the extreme subtlety and complexity of the interplay between 
logic and mathematics. 


6. Let a and b be cardinal numbers and A and B be 
sets such that a = o(A) and b = o(B). Then the sum 


. The argument involved in establishing Russell’s 


paradox can be used to show that P(N) is an 
uncountable set. That is, suppose to the contrary that 
N ~ P(N) via the function f : N > P(N). Put 
B={neéeN\|n ¢ f(n)}. Because B € P(N), it 
follows that B = f(b) for some b € N. Complete the 
argument by reasoning until a contradiction is 
obtained. 


. Verify that P(R*) is uncountable, where R” is the set 
of real numbers. [Hint: Apply Cantor’s theorem. ] 


. For any sets A and B, either prove or give a 
counterexample for each of the following assertions: 
(a) If ACB, then P(A) C P(B). 

(b) P(AUB)= P(A)U P(B). 

(c) P(AN B)= P(A)N P(B). 

(d) P(A x B)= P(A) x P(B). 


. Prove that if A and B are sets for which A ~ B, then 


P(A) ~ P(B). [Hint: A function f : A > B induces 
a function f* : P(A) > P(B), dened by taking 
f*(S) = { f(s) | 8 € SH.) 


. If N denotes the set of natural numbers, establish that 


(a) The set of all in nite subsets of N is 


uncountable. 
(b) The set of all nite subsets of N is countable. 
[Hint: With each ni te subset {m),12,..., nx} 


of N, with ny <n, <... < nj, associate the 
Nk 


number 2”'3”"2--- p,“, where p, is the kth 
prime.] 


and product of a and b can be de ned as follows: 


a+b=o(AUB), provided AN B= 9; 
a-b=0o0(A x B). 


Using these de nitions, prove that 


(a) x+0=x,x-0=0,x-1=-.x. [Hint: Recall 
that 0 = o(%).] 

(b) Ro +80 = No. (Hint: Xo +80 = O(N. U No), 
where N, and No are the even and odd natural 
numbers, respectively. ] 

(c) Xo ‘ Xo => Xo. 

(d) c+&o =c. [Hint: c+ &o = o(/ U Q), where J 
and @Q are the irrational and rational numbers, 
respectively. ] 

(e) c+c=c. [Hint: 
c+c=o([0, co) U[—on, 0)).] 

(f) c-o =c. [Hint: [0, 1) x N ~ (0, co) using 
f@un)=r+n.] 

(g) c:c=c. 


(a) Could there exist a town in which the barber 
shaves all men who do not shave themselves? 
(Hint: If such a town exists, who shaves the 
barber? In this traditional conundrum, it is 
presumed that barbers are male.] 

(b) Could there exist a book that lists in its 
bibliography exactly those books that do not list 
themselves in their bibliographies? 


. Suppose that a lexicon is drawn up containing all the 


words that occur in the text of this book; names and 
punctuation marks and also mathematical symbols 
are counted as words. Let S be the set of natural 
numbers that are dened by sentences containing at 
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most 50 words (a word being counted each time it 
occurs), all of them chosen from our lexicon; then 
the set S is nite. Consider the natural number 

de ned as follows: 

Let n be the smallest natural number, in 
accordance with the usual ordering of the natural 
numbers, that cannot be de ned by means of a 
sentence containing at most 50 words, all taken from 
our lexicon. 

Show that 7 is de ne d in no more than 50 
words, but is not a member of S (this is Berry’s 
paradox). 


Let S be the set of all decimals that are de ned by 
sentences containing a nite number of words, all 
taken from our lexicon; then S is a countable set of 
real numbers r;, 72,73, .... Consider the real number 
r dened as follows: 

If the digit in the nth decimal place of r, is 
denoted by r,,,, then construct the real number 
r = 0.a,a,a3... so that in nth digit a, = 1 if 
Tnn #1, and a, = 2 1f Ty, = 1. 

Show that r is dened ina _ nite number of 
words but is not a member of S (this is Richard’s 
paradox). 


Show, as an example of a nonconstructive existence 
proof, that there exists a solution of the equation 
x” = z with x, y irrational and z rational. [Hint: 


Consider Vr] 


(a) Consider the number n = (—1)*, where k is the 
number of the rst decimal place in the decimal 
expansion of z where the sequence of 
consecutive digits 01234567890 begins; or if no 
such number k exists, then n = 0. Would the 
intuitionist accept the statement that n is either 
positive, negative, or zero? 

(b) The intuitionist views the following as a 
situation in which the statement p is not the 
same as “not not p.” Write r = 0.3333..., 
breaking this off as soon as a sequence of 
consecutive digits 01234567890 has appeared 
in the decimal expansion of z. Thus, if the 9 of 
the rst sequence 01234567890 in z is the kth 
digit after the decimal point, then 


lI 


0.33333...3(k decimals) 
10 —1 
3+ 10k 


Let p be the statement “r is a rational number.” 
Show that “not p” leads to a contradiction, so 
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that “not not p” must hold. On the other hand, 
the intuitionist would not conclude that r is 
rational, because there is no effective way of 
calculating a and b for which r = a/b. 
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CHAPTER l 3 


Extensions and Generalizations: 


Hardy, Hausdorff, and Noether 


A mathematician, like a painter or poet, is a maker of patterns. If his patterns are more permanent 
than theirs, it is because they are made of ideas. 
G. H. HARDY 


In the late decades of the nineteenth century, the study 


Hardy and Ramanujan of higher mathematics in England was centered at 


Cambridge University. And at Cambridge, any thought 
The Tripos Examination of a serious mathematical education was badly distorted 

by the local obsession with the Mathematical Tripos, the 
most notorious, most challenging test that any university has known. 

For a student taking a degree at this time, the Tripos was a series of examinations, 
spread over more than a week, and given in the middle of the fourth year of residence. 
Preparation for the Tripos became the student’s chief mathematical activity. Few under- 
graduates paid any attention to the professors: Cayley, for instance, lectured to classes of 
only two or three. Instead, most students entrusted their mathematical education almost 
entirely to private tutors or “coaches.” These coaches had all the past Tripos papers at 
their ngertips and knew all the tricks of the examiners. For handsome fees, they would 
prepare their pupils for the venerable contest. The most famous coach, Edward Routh, 
achieved remarkable results. Between 1858 and 1888 he produced 27 Senior Wranglers 
(those scoring highest on the examination) and 41 winners of the Smith’s Prize. 

Because of the exaggerated importance attached to the Tripos, candidates were more 
concerned with their place in the examination’s Order of Merit list than in learning math- 
ematics. To attain high ranking, the Cambridge coaches drove their pupils ferociously. 
They were drilled for up to seven hours a day in solving dif cult problems against time; 
a good memory and facile manipulative skills were critical, whereas rigor in argument 
was generally derided. Curiosity about mathematical topics that were not apt to appear 
on the examination was rmly discouraged. Coaches forbade the reading of great math- 
ematical works such as Laplace’s Mécanique céleste. Many insisted that their pupils 
not attend professorial lectures. The process was a terrible ordeal for students. Bertrand 
Russell, who stood as Seventh Wrangler in 1893, later remarked, “When I nished my 
Tripos, I sold all my mathematical books and made a vow that I would never look at a 
mathematical book again.” 
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The Mathematical Tripos generated the sort of interest as would a horse race: serious 
contenders for Senior Wrangler were the subject of betting by all and sundry. Sometimes 
there were surprises. William Thompson, later to be made Lord Kelvin, was easily the 
best mathematician of his year, an odds-on favorite to be rst. On the day the Tripos 
results were read, Kelvin stayed in bed late and sent his college servant out to hear the 
news. When the servant returned and Kelvin demanded to know who was second, he 
was greeted with the reply, “You, sir.” Someone in the examination hall had evidently 
memorized better, or written faster, than Kelvin. 

During the nineteenth century, classical applied mathematics had become an En- 
glish specialty, typied by the work of Green, Kelvin, Stokes, Rayleigh, and Maxwell. 
Cambridge paid homage to their achievements by making the Tripos primarily a test of 
mathematical physics. Only rarely were its problems related to physical questions not 
contained in Newton’s Principia. When it came to the Principia, students were “expected 
to know any lemma in that great work by its number alone, as if it were one of the Com- 
mandments or the 100th Psalm.” There was little on the examination that could be called 
contemporary analysis, which is to say the advanced developments dependent on in nite 
processes—limits, differentiation, integration, summation of series, and the like—and in 
particular the theories of functions of a real or a complex variable. 

The effect of the Tripos was to train some outstanding applied mathematicians but to 
stie pure mathematics. Although Cayley and Sylvester were notable exceptions, the area 
was largely ignored by students aspiring chie y to a high ranking in the Order of Merit. 
In the Cambridge of the 1880s, when the Tripos stood at the zenith of its reputation, 
English mathematics was somewhere near its lowest ebb: self-supporting, self-satis ed, 
and indifferent to the developments that were taking place on the Continent. 


The Rejuvenation of English Mathematics 


England’s isolation from the current thinking at Paris, Berlin, and Géttingen was 
broken by Andrew Forsyth (1858-1942), Cayley’s successor as Sadlerian Professor at 
Cambridge. Forsyth’s Theory of Functions of a Complex Variable, which was published 
in 1893, “burst with the splendour of a revelation.” Some hailed it as having a greater 
in uence on English mathematics than any work since Newton’s Principia. The faults of 
the Theory of Functions are much more obvious today than its merits. Because Forsyth 
had little faculty for displaying the crucial points of a delicate argument, his exposition 
is often obscure and illogical. Nevertheless, for all its shortcomings Theory of Functions 
was the work that brought the methods of modern analysis to Cambridge. 

The principal architect of an English school of mathematical analysis was Godfrey 
Harold Hardy (1877-1947). For more than a quarter of a century Hardy dominated 
English mathematics, through both the signicance of his work and the force of his 
personality. 

A product of England’s nest private schools, the young Hardy went to Cambridge 
in 1896 on an Entrance Scholarship. Starting something of a trend among the better 
students, he took Part I of the Mathematical Tripos in the third instead of the normal 
fourth year, and he nished Fourth Wrangler. (By Hardy’s day, the Tripos had been 
divided into two parts, with the advanced Part II being taken at a later stage by those 
seeking fellowships.) Relieved of two years of tedium, he was ready to begin learning 
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genuine mathematics. Hardy was later to write of the astonishment with which he read 
Camille Jordan’s Cours d’analyse de L’Ecole Polytechnique and to realize for the rst 
time “what mathematics really meant.” In 1900, he placed highest in Part II of the Tripos 
and succeeded automatically to a fellowship. Three years later Hardy received an M.A. 
degree, which was the highest academic degree awarded by Cambridge. (Cambridge did 
not offer the doctorate, a German innovation, until after the Great War of 1914-1918 and 
in hopes of attracting American students who would otherwise have gone to Germany.) 
At the expiration of this fellowship in 1906, Hardy joined the Cambridge faculty as a 
lecturer in mathematics. 

Hardy’s research specialty was “analytic number theory,” an area of mathematics 
that uses analysis to answer questions about number theory. Developments stemming 
from Bernhard Riemann’s epoch-making eight-page paper “On the Number of Primes 
Less than a Given Magnitude” (1859) had made the eld suf ciently promising. Riemann 
took as his starting point a remarkable formula discovered by Euler over a century earlier, 

1 
1/n* = | | ———_., 
where the in nite product is taken over all the prime numbers p, and the sum is over 
all positive integers n. This formula results from expanding the factor involving p as a 
geometric series 


ET a ee ee ee 
1 — 1/p* 


On multiplying these series for all primes p, we get a sum of terms of the form 
1/(pi" py? «+p, 


where pj, p2,..-, Pr are distinct primes and ky, ko,..., k, are positive integers. By the 
Fundamental Theorem of Arithmetic, the products pi , pe, ..., p* so obtained yield 
precisely the positive integers, allowing us to conclude that the sum in question is simply 
>, 1/n’. This sum, which is a function of a real variable s, is called the (Riemann) 


zeta function and denoted by ¢(s): 


(oe) 


£(s)= Do 1 /n’. 


n=1 
Because the series for ¢(1) diverges, Euler’s formula implies the existence of an 
in nitude of prime numbers; for if there were only nitely many primes, then the product 
on the right-hand side of the formula 
= 1 
Yim=T 
n=1 Pp 1— 1/ P 
would be a_ nite product and hence would have a_ nite value. 
Using the zeta function, Euler proved that the sum of the reciprocals of the prime 
numbers diverges. It is known that 
oo 


£(2)= 0 1/n? = 17/6, 


n=1 
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Godfrey Harold Hardy 
(1877-1947) 


(Courtesy of the Master and Fellows of Trinity College, 
Cambridge.) 


a result that Euler also obtained, and that ¢(4) = 14/90. One immediate consequence of 
this is that ¢(2) and ¢(4) are both transcendental numbers. A more recent (1979) gain 
along these lines was to establish that ¢(3) is irrational; but whether it is transcendental 
remains unknown. 

Riemann’s key idea was to extend the zeta function ¢(s) so that, instead of being 
restricted to a real variable, s is allowed to be a complex number s = a + bi. Among 
the many questions that can be asked about the complex function ¢(s), a paramount one 
concerns the location of its zeros. Riemann stated that ¢(s) vanishes when s = —2n, 
the “trivial zeros,” and that all other complex zeros must lie in the so-called critical 
strip 0 <a < 1. He went on to conjecture that these zeros are on the vertical axis of 
symmetry, the critical line a = 1/2; that is, if f(s) = 0 for a complex number s = a + Di, 
with 0 <a < 1, then s is of the form 1/2 + bi. This is the famous Riemann hypothesis, 
still open to proof or disproof—by universal agreement the outstanding unsolved problem 
in mathematics. Hilbert listed it as the eighth problem in his address before the Paris 
Congress of 1900. He obviously thought it incredibly dif cult, once remarking that if he 
awakened after having slept for 1000 years his rst question would be, “Has the Riemann 
Hypothesis been proved?” 

Hardy was the rst to give any sort of answer to the Riemann hypothesis when, in 
1914, he established that in nitely many zeros of ¢(s) are located on the critical line 
(this does not preclude the existence of in nitely many that are not). Current opinion is 
predominantly in favor of Riemann’s celebrated conjecture, because there is considerable 
numerical evidence in support of it. Recent computer calculations have veri ed that the 
rst 1,500,000,000 nontrivial zeros of the zeta function, ordered by the size of their 
imaginary part, lie on the critical line. 

There is a pleasant anecdote in this connection. Hardy was returning from Denmark 
to England on a day when the weather conditions in the North Sea channel were unusually 
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rough. With Fermat’s famous marginal note in mind, he sent a message to a friend: “Have 
proved the Riemann Hypothesis.” He was con dent that God would not let him die with 
such undeserved glory, so that his safe return was assured. 

Perhaps Hardy’s greatest service to mathematics in this period was his well-known 
book A Course in Pure Mathematics, published in 1908. It was designed to give the 
undergraduate student a rigorous exposition of the basic ideas of analysis—limits, con- 
tinuity, convergence of series, and the like. Writing in the preface, Hardy decried the 
neglect of analysis in England: “I have indeed in examination asked a dozen candidates, 
including several future Senior Wranglers, to sum the series 1 + x + x*+--- and not 
received a single answer that was not practically worthless.” 

Running through numerous editions and translated into several languages, A Course 
in Pure Mathematics transformed the state of university teaching. The remarkably modern 
second edition, which was brought out in 1914, acknowledges the changing curriculum. 
Hardy now introduces the Bolzano-Weierstrass theorem, Eduard Heine’s result asserting 
that a continuous function on a closed interval is uniformly continuous, and the Borel 
Covering theorem (which concerns the question: if a set S is covered by any collection 
of open intervals, under what conditions is it possible to choose a_ nite number of sets 
from the collection and still cover S?). Subsequent editions of the text were but minor 
variations of the 1914 edition. 

Hardy despised the old Tripos system and was a leading advocate of its reform. In 
his view, the excessive concentration on examination topics drew the students’ attention 
away from modern mathematics, contributing to England’s backwardness. He would 
have preferred to do away with the Tripos altogether, but settled for the abolition of the 
strict Order of Merit: degree candidates still took Part I of the examination but were 
ranked only by broad classes—Wrangler, Senior Optime, or Junior Optime—in which 
their names appeared in alphabetical order. With the adoption of the new regulations in 
1910, the practice of “coaching” disappeared almost at once. 


A Unique Collaboration: Hardy and Littlewood 


Hardy’s name is inevitably linked with that of John Edensor Littlewood (1885-1977). 
A Cambridge graduate, Littlewood had been Senior Wrangler in 1905, but the fellowship 
that normally would have been his went to someone else. After spending three years as 
a lecturer at Manchester University, he joined the Cambridge faculty to succeed Alfred 
North Whitehead. Hardy found in Littlewood a partner to help strengthen and build on 
the foundations of analysis. Together they carried on the most prolonged (35 years), 
extensive, and fruitful collaboration in the history of mathematics. They wrote nearly 
100 papers together, the last one published a year after Hardy’s death. It was often joked 
that there were only three great English mathematicians in those days: Hardy, Littlewood, 
and Hardy-Littlewood. One mathematician, upon meeting Littlewood for the rst time, 
exclaimed, “I thought that you were merely a name used by Hardy for those papers 
which he did not think were quite good enough to publish under his own name.” 

Hardy’s sympathy with Bertrand Russell’s paci sm put him in an unpleasant position 
during and immediately after the war; some of his Cambridge colleagues scarcely spoke 
to him. In 1919, he was only too ready to accept the Savilian Chair in geometry at 
Oxford. Although Littlewood—who served as an artillery of cer from 1914 until 1918— 
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remained at Cambridge, their mathematical partnership suffered no interruption. Because 
they seldom, if ever, met together to discuss or write mathematics, the only noticeable 
change was a collaboration by mail instead or by college messenger. Contemplating this, 
they drew up a set of “axioms” expressing the personal freedom of their cooperation. 
One of their principles was: if one wrote a letter to the other, the recipient was under no 
obligation to reply to it, or even to read it. Another, designed to prevent quarrels, was: it 
made no difference if one of them had not contributed the least bit to the contents of a 
paper appearing under both their names. With these guidelines, the two mathematicians 
entered the most productive decade of their far-reaching and intensive joint work. Hardy 
established a school of analysis at Oxford, gathering about him a team of colleagues and 
research students; but Cambridge, far more than Oxford, was still the center of English 
mathematics. After a lapse of 11 years, Hardy returned to Cambridge to assume its senior 
mathematical chair, the Sadlerian Professorship. He held the position until his retirement 
in 1942. 

There are few major problems in analysis or the theory of numbers to which Hardy 
and Littlewood did not make signi cant contributions. A primary interest of theirs was 
Waring’s problem. Much like Fermat’s last theorem, it is a simply stated assertion about 
the positive integers, which gives no suggestion of the dif culty or mathematical depth 
of a correct solution. The problem began in 1770 when Edward Waring conjectured, on 
the basis of empirical evidence, that every number is the sum of at most 4 squares, 9 
cubes, 19 fourth powers, and so on. It has become traditional to let g(‘) denote the least 
integer such that every positive integer can be expressed as the sum of at most g(k) 
positive kth powers. 

At about the same time that Waring recorded his conjecture, the value g(2) = 4 was 
conrmed by Lagrange in his classic “four-square theorem.” The general result that g(h) 
is nite for all & is attributable to Hilbert (1909), who gave an existence proof that shed 
no light on how many kth powers are needed. Shortly thereafter, it was shown that g(3) 
does indeed equal 9. Using the powerful techniques of analysis, Hardy and Littlewood 
established (1921) that all suf ciently large integers can be written as the sum of 19 or 
fewer fourth powers. Because 79 = 4- 24 + 15- 14 requires 19 fourth powers, g(4) > 19. 
This observation, together with the Hardy-Littlewood result, suggested that g(4) = 19 as 
Waring had guessed and raised the possibility that its actual value could be settled by 
direct, exhaustive computation. In 1986, it was nally veried that g(4) = 19. 

Another topic that drew the attention of the two collaborators was a variation of the 
Goldbach Conjecture called the “three-primes problem”: can every odd integer n > 7 be 
written as the sum of three prime numbers? In 1922, Hardy and his younger colleague 
showed that, assuming the Riemann Hypothesis holds, there exists a positive integer N 
such that every odd integer n > N is the sum of three primes. They also conjectured 
that every large integer is a sum of a prime and two squares, an assertion that was 
subsequently veri ed. 


India’s Prodigy, Ramanujan 


In 1913, Hardy “discovered” Srinivasa Ramanujan (1887—1920)—by which he meant 
that he was the rst really competent mathematician to see and judge Ramanujan’s work. 
The discovery led to an association that Hardy was to call “the most romantic incident 
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Srinivasa Ramanujan 


(1887-1920) 


(Courtesy of the Master and Fellows of Trinity College, 
Cambridge.) 


in my life.” India has from time to time produced mathematicians of remarkable power, 
but Ramanujan is universally considered to have been its greatest genius. 

Ramanujan was born in the southern Indian town of Erode, near Madras, the son 
of a bookkeeper in the shop of a cloth merchant. He began his single-minded pursuit 
of mathematics when, at the age of 15 or 16, he borrowed a copy of Carr’s Synopsis 
of Pure Mathematics. This unusual book contained the statements of more than 6000 
theorems, very few with proofs. Ramanujan undertook the task of establishing, without 
help, all the formulas in the book. In 1903 he won a scholarship to the University of 
Madras, only to lose it a year later for neglecting other subjects in favor of mathematics. 
He dropped out of college in disappointment and wandered the countryside for the next 
several years, impoverished and unemployed. Compelled to seek a regular livelihood 
after marrying, Ramanujan secured (1912) a clerical position with the Madras Port Trust 
Ofce, a job that left him enough time to continue his work in mathematics. After 
publishing his rst paper in 1911, and two more the next year, he gradually gained 
recognition. 

At the urging of in uential friends, Ramanujan began a correspondence with Hardy, 
who was by then recognized as the leading British pure mathematician. Appended to 
his letters to Hardy were lists of new theorems, 120 in all, some de nitely proved and 
others only conjectured. Hardy took the time and, with Littlewood’s help, made the 
considerable effort to analyze Ramanujan’s ndings. Their conclusion was that “they 
could only have been written down by a mathematician of the highest class; they must 
be true because if they were not true, no one would have the imagination to invent them.” 
Hardy immediately invited Ramanujan to come to Cambridge University to develop his 
already great, but untrained, mathematical talent; for up to that time Ramanujan had 
worked in almost total isolation from modern European mathematics. 
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Supported by a special scholarship, Ramanujan arrived in England in April of 1914. 
There, under the guidance of Hardy and Littlewood, he had three years of uninterrupted 
activity. Some 32 of Ramanujan’s 37 published papers took shape during the period 
1914-1917, seven written in collaboration with Hardy. Hardy wrote to Madras University 
saying, “He will return to India with a scientic standing and reputation such as no Indian 
has enjoyed before.” England gave Ramanujan such honors as were possible. The Royal 
Society—England’s preeminent scientic body—made Ramanujan a Fellow in 1918, and 
Trinity College, Cambridge, elected him a Fellow later in the same year. He was the rst 
Indian to gain either of these high distinctions. 

Even as Ramanujan’s prominence grew, his health deteriorated disastrously. In 1917 
he became incurably ill with a disease that was then believed to be tuberculosis. The 
exact nature of his illness is not known, but the decline in his health was doubtless 
accelerated by the dif culty Ramanujan had in maintaining an adequate vegetarian diet 
in war-rationed England. Early in 1919 when the seas were nally considered safe for 
travel, he returned to India. In extreme pain, Ramanujan continued to do mathematics 
while lying in bed. He died the following April, at the age of 32. 

The theory of partitions is one of the outstanding examples of the success of the 
Hardy-Ramanujan collaboration. A “partition” of a positive integer n is a way of writing 
n as a sum of positive integers, the order of the summands being irrelevant. The integer 5, 
for example, may be partitioned in seven ways: 5,4+ 1,3+2,3+14+1,2+2+1,2+ 
1+14+1,1+1+1+1+1. If p(m) denotes the total number of partitions of n, then the 
values of p(n) for the rst six positive integers are p(1) = 1, p(2) = 2, p(3) = 3, pA = 
5, p(5) = 7 and p(6) = 11. Actual computation shows that the partition function p(n) 
increases very rapidly with n; for instance, p(200) has the enormous value 


P(200) = 3,972,999, 029,388. 


Although no simple formula for p(7) exists, one can look for an approximate formula 
giving its general order of magnitude. In 1918, Hardy and Ramanujan proved what is 
considered one of the masterpieces in number theory: namely, that for large n the partition 
function satis es the relation 


ecvn 
AnJ/3 ; 


where the constant c = m(2/3)!/?. For n = 200, the right-hand side of the relation is 
approximately 4 - 10'*, which is remarkably close to the actual value of p(200). 

According to Hardy, Ramanujan could remember the idiosyncracies of numbers 
in almost uncanny ways: Littlewood is said to have remarked, “Every positive integer 
was one of his personal friends.” There is a well-known story that Hardy once visited 
Ramanujan as he lay ill in a hospital and observed incidentally that he had arrived in a 
taxi with a rather dull number, 1729. “No,” Ramanujan replied without hesitation, “it is 
a very interesting number; it is the smallest number expressible as the sum of two cubes 
in two different ways.” Ramanujan had immediately recognized that 1729 = 1° + 12? = 
93 + 10°. Hardy then asked for the smallest number that is a sum of two fourth powers in 
two different ways. After a moment’s re ec tion, Ramanujan responded that there was no 
obvious example and that the rst such number must be very large. (In fact, the number 
is 635,318,657 = 594 + 1584 = 1334 + 1347) 


p(n) © 
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As a further example of Ramanujan’s creativity, we mention his unparalleled ability 
to come up with in nite series representations for 7. Computer scientists have exploited 
his series 

1 V8 QS (4n)! (1103 + 26,3907) 


x 9801 & (nl 396%" 


to calculate the value of z to millions of decimal digits; each successive term in the series 
adds roughly eight more correct digits. Ramanujan discovered 14 other series for 1/z, 
but he gave almost no explanation as to their origin. The most remarkable of these is 


ay ei) 42n +3 
= “9 12n+4— 
This series has the property that it can be used to compute the second block of k (binary) 
digits in the decimal expansion of z without calculating the rst & digits. 

Despite the brevity of Ramanujan’s life, his in uence is still evident in many parts 
of mathematics and its allied elds. He left behind three notebooks—composed between 
1903 and 1914—recording the statements of approximately 3000 results, with scarcely 
any indication of proof. The task of editing and deciphering these notebooks is only now 
nearing completion; most of the incorporated material has been substantiated, but there 
remain many asserted theorems and identities so startling that no one knows how to derive 
them. In 1976, a “lost notebook” of Ramanujan, with more than 100 pages listing 600 
formulas one after another, was found tucked away in the Cambridge University Library. 
Apparently written after his return to India, the notebook’s discovery caused roughly as 
much stir in the mathematical world as a previously unknown symphony of Beethoven 
might generate in the musical world. Ramanujan has bequeathed an unexpected legacy 
that will keep mathematicians busy for many more decades. 


While English mathematics was waking from 


13.2 The Beginnings of Point-Set Topology its slumber, pure mathematicians on the Con- 
tinent had not been entirely idle. Convergence 
Frechet’s Metric Spaces problems connected with trigonometric series 


led Georg Cantor, during the years 1872 to 
1890, to investigate properties of certain in nite subsets of the real line. For the sake 
of such investigations, he introduced the basic concept of limit point of a set and the 
associated ideas of closed set, derived set, and dense set: 


A real number x is a limit point of a set X of real numbers provided that for every positive 
number € there is an element y of the set XY such that 0 < |x — y| <e. 


Cantor described a set as being closed if it contained its limit points, while the collection 
of limit points of a given set was called its derived set. Through these notions, he 
inaugurated the study of what is known today as point-set or general topology, that 
part of topology most relevant to analysis. Loosely speaking, point-set topology may be 
considered as an abstract investigation of the limit point concept in more generalized 
spaces of unspeci ed elements. 

The term “topology” appears to have been coined by Joseph Listing—pupil of 
Gauss and later professor of physics at Géttingen—for the title of his 1847 textbook 
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Vorstudien Zur Topologie: it is derived from two Greek words—topos meaning “place” 
or “surface,” and Jogos meaning “study.” A rival name, now obsolete, was “analysis 
situs” (situs, for “site”’). 

The development of point-set topology did not go beyond the real line, plane, and 
higher-dimensional Euclidean spaces for several decades, although the de nitions in- 
troduced there have a more general validity. This was recognized by Maurice Fréchet 
(1878-1973), who may be regarded as the creator of the rst systematic point-set theory 
in “abstract spaces.” An active researcher, Fréchet had more than 20 papers in print 
before his thesis for the doctorate at Paris was published in 1906. It is likely that this 
thesis had more impact on the mathematical world than anything else he ever wrote. 

The outlines of Fréchet’s thesis took shape in a series of ve notes that appeared in 
the Comptes Rendus of the Academie des Sciences in 1904—1905. In these articles, he 
was intent on building up an abstract theory that closely resembles the point-set theory 
of classical analysis. Because the most important and best known results of analysis were 
those based on limits, Fréchet considered abstract spaces in which limit points of sets or 
sequences are de nable. 

To ask whether x is a limit point of a set X, it is essential to be able to say when x 
is close enough to X. Fréchet suggested and explored several ways of generalizing the 
intuitive notion of “closeness,” but his most in uential proposal was the concept of what 
is now called a “metric space.” (The term was introduced by Felix Hausdorff, using the 
German name metriche Raum in his book of 1914.) Such spaces are equipped with a 
notion of distance between two points. 

Fréchet recognized that some suitable assumptions about distance are needed; and, 
for the concept to be useful, these assumptions should be broad enough to include the 
most familiar spaces of nineteenth-century mathematics. Abstracting from the real line, 
where the standard method of measuring the distance between points x and y is by 
the nonnegative number |x — y|, he dened his generalized distance d(x, y) to be a 
nonnegative real number satisfying the three conditions 


1. d(x, y) = 0 if and only if x =y. 
2. d(x, y)=d(y, x). 
3. d(x, y)< dx, z)+d(, y). 


Condition 3, whose antecedents go back at least to Euclid’s Elements (Proposition 20 
of Book I: In any triangle, two sides taken together in any manner are greater than the 
remaining one) is called the “triangle inequality.” 

By a metric space, we mean nothing more than a set XY having a distance function 
governed by the conditions above. The elements of a metric space are usually referred to 
by the generic name “points,” with the number d(x, y) being the distance from the point 
x to the point y; Fréchet used the French word ecart, meaning “difference,” for d(x, y). 

An obvious example of a metric space is Euclidean n-space, whose points are ordered 
n-tuples of real numbers. In the case n = 2, this gives the coordinate plane, and in the 
case n = 3 we get the usual three-dimensional space. For arbitrary x = (x), X2,...,Xn) 
and y = (1, ¥2,---,¥n), dene d(x, y) by 


: 1/2 
d(x, y= (33. = 0) 
i=l 
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the standard Euclidean distance between points. The set of all real-valued continuous 
functions on the closed interval [0, 1] makes for a more interesting example of a metric 
space. For two such functions f and g, we let 


d(f, g) = max{| f(x) — g(x)| |x € [0, 1]}. 


That is, d(f, g) is the largest of the values of | f(x) — g(x)| as x varies within the interval 
[0, 1], the “largest separation” of the functions. 

The existence of an idea of distance lets us formalize what we mean by saying that 
x is a limit point of a subset A of a metric space X. The essential idea is that the points 
of A different from x can be “arbitrarily close to” x: x is a limit point of 4A provided 
that for any ¢ > 0 there is some point y in A, different from x, such that d(x, y) < . 
Closed sets can be de ned exactly as in the set theory of the real line; that is, if a subset 
A CX contains each of its limit points, then we declare that it is closed. 

Using this generalized theory of limits, Fréchet carried over to metric spaces the 
familiar concepts of classical analysis that depended mainly on distance. For example, 
convergence of sequences can be de ned in this way: the sequence x), x2, x3,... of points 
on X converges to x if the sequence of real numbers d(x,, x), d(x2, x), d(x3, X),... 
converges to 0; that is, for each positive number ¢ there is a positive integer mo such 
that d(x,,x) < € for all m > no. Another central theme of analysis, that of a continuous 
function, makes sense in a metric space setting. Speci cally, if X and X’ are both met- 
ric spaces with respective distance functions d and d’, then a function f: X¥ > X’ is 
continuous at a point xeX provided that d’(f(x,), f(x)) > 0 whenever d(x,, x) > 0. 
Though the de nition may sound a bit intricate, it is a straightforward adaptation 
of the familiar de nition of sequential continuity that is found in standard calculus 
texts. 


The Neighborhood Spaces of Hausdorff 


Fréchet’s thesis laid the groundwork, but it was Hausdorff’s Grundziige der Mengen- 
lehre (Foundations of Set Theory) that marks the emergence of set-theoretical topology 
as a cohesive mathematical discipline. Until the late 1920s, the Grundztige was the most 
convenient single source from which the succeeding generation of young mathematicians 
could learn the elements of set theory and point-set topology. Eminently readable, the 
text exerted a greater in uence on the development of these subjects during their forma- 
tive years than any other work. What is perhaps remarkable is that Hausdorff was not 
primarily a topologist, nor had he published anything at all on the theory of topology or 
metric spaces prior to the appearance of the Grundziige. 

The son of a wealthy merchant, Felix Hausdorff (1868-1942) earned his doctorate 
in astronomy from Leipzig University in 1891. From 1902, when he was appointed an 
associate professor at Leipzig, his attention seems to have focused on Cantor’s theory 
of sets. Hausdorff’s lectures on the subject in the summer of 1902 were most likely 
the rst course in set theory anywhere in Germany; oddly enough, in his 44 years at 
Halle, Cantor himself never lectured on set theory. Hausdorff opted to leave Leipzig in 
1910 for an associate professorship at Bonn, where he wrote his classic textbook; the 
principal features of his theory of topological spaces based on neighborhoods were pre- 
sented in the summer semester of 1912. He subsequently (1913) accepted a professorship 
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at Greifswald, but later (1921) returned to Bonn until forcibly retired by the Nazis 
in 1935. 

Hausdorff’s approach to point-set topology was to let the notion of “neighborhood of 
a point” play the fundamental role. Neighborhoods as sets of some kind already appeared 
in Fréchet’s work, where distances were used to dene them: In a metric space X, a 
spherical neighborhood S-(x) of the point x is the set of those points y in X satisfying 
d(x, y) < €, the number e > 0 being the radius of the neighborhood. Hausdorff wanted 
to retain the concept of Fréchet’s neighborhood but rid himself of any dependence on 
distance. In the seventh chapter of the Grundziige, “Point Sets in General Spaces,” he 
de nes what he calls a topological space. It is a set X of points x, to each of which 
there corresponds a family of subsets U,, called the neighborhoods of x, which satisfy 
the conditions: 


1. To each point x there corresponds at least one neighborhood U,, and U, contains x. 


2. If U, and V, are neighborhoods of the same point x, then there exists a 
neighborhood W,, of x such that WC U.N Vy, 


3. If y is a point in U,, then there exists a neighborhood U,, of y such that U, C U,. 
4. For distinct points x and y, there exist two disjoint neighborhoods U, and Uy. 


Because spherical neighborhoods in a metric space satisfy these “neighborhood axioms,” 
Hausdorff’s topological space (or a Hausdorff space, as it is generally called today) is 
the more general concept. 

Hausdorff developed the fundamental topological concepts from his theory of neigh- 
borhoods. The idea of a limit point of a set carries over to the setting of a Hausdorff 
space X as follows: x is a limit point of a subset A C X provided that every neigh- 
borhood U, of x contains a point of A different from x. Convergent sequences can be 
similarly de ned in terms of neighborhoods, by saying that the sequence x), x2, X3,... 
converges to x if for each neighborhood U,. of x there is an integer no such that x, € U, 
for all n > no. This is just a direct extension of what occurred with metric spaces, a 
generalization of a generalization. 

The Grundztige was the source of the vigorous growth in point-set topology in the 
1920s and 1930s. Many mathematicians added new ideas and results to the eld. In the 
period right after the war a pair of young Russians, Paul Alexandroff (1896-1983) and 
Paul Urysohn (1898-1924), introduced the de nition of compactness in the presently 
accepted sense using open coverings. They also proved that a metric space is compact if 
and only if each in nite subset possesses a limit point, the so-called Bolzano-Weierstrass 
property. (Urysohn’s premature death at the age of 26 occurred when, swimming with 
Alexandroff on the coast of Brittany, he was dashed against the rocks.) The two also 
produced the rst major work on what is known as the “metrization problem.” Although 
every metric space must be a topological space, there do exist topological spaces whose 
neighborhoods cannot be specied by any distance function. The metrization problem 
was to nd topological conditions under which a topological space can be considered 
a metric space; in other words, distance de nitions under which the limit points for all 
subsets in the resulting metric space will be the same as those in the topology already 
associated with the space. The search for necessary and sufcient conditions for the 
metrizability of a given topological space was not satisfactorily concluded until the early 
1950s. 
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Banach and Normed Linear Spaces 


The theory of abstract spaces is a typical example of the internationalism of math- 
ematics. The originators of the subject were mainly French and German, but its most 
active workers in the 1920s and 1930s were Russian and Polish. After independence 
was restored to Poland in 1918, many distinguished Polish mathematicians returned to 
the country from emigration or from exile. They decided to rejuvenate the Polish math- 
ematical tradition by concentrating research on one or two branches of the subject at 
rst. One specialization chosen for this daring and novel approach was the budding the- 
ory of topological spaces. This led to what would soon be called the “Polish School” 
of mathematics: a small group of scholars with common interests, working on similar 
problems in close contact with each other. In the rst rank of the Polish School were 
such luminaries as Casimir Kuratowski (1896-1980), Waclaw Sierpinski (1882-1969), 
and Stefan Banach (1892-1945). Their rst success came in 1920 with the publication 
of Fundamenta Mathematicae, a journal devoted not to mathematics as a whole but to 
set theory and its related questions. The initial issue, consisting of 24 articles, was de- 
signed “to introduce all the Polish mathematicians who are interested in the theory of 
sets.” Fundamenta Mathematicae was immediately developed into a unique periodical 
that attracted international recognition and coworkers from abroad. The appearance of 
Banach’s doctoral thesis in the journal in 1922 is sometimes said to have marked the 
birth of functional analysis. Fundamenta Mathematicae was joined by an equally famous 
periodical, Studia Mathematica (commencing in 1929), which was primarily concerned 
with problems in functional analysis. 

Stefan Banach was the most celebrated gure in Polish mathematics during the period 
between the two world wars. He was born in Cracow, Poland, the son of a railway civil 
servant. His unmarried mother gave the baby up to be raised by a laundress as soon as 
he could be baptized. She never saw him again. Although mostly self-educated in his 
early years, by the age of 15 Banach was earning a living by tutoring in mathematics. 
He entered Lvov Polytechnical Institute in 1910 to study engineering, but he returned to 
his native Cracow four years later without having graduated. 

The Polish mathematician Hugo Steinhaus (1887-1972) is said to have recognized 
Banach’s unusual talent accidentally by overhearing Banach and another student dis- 
cussing mathematics on a park bench in 1916. Steinhaus subsequently became a close 
friend, collaborator, and mentor to Banach. He claimed that Banach was the “greatest 
discovery” of his mathematical career. 

Banach’s rst publication was a paper concerning the convergence of Fourier series. 
It was coauthored with Steinhaus and appeared in the Bulletin of the Cracow Academy in 
1918. Three years later, he started his teaching career as a lecturer at Lvov Polytechnic, 
and in the same year, the school awarded him a doctorate, even though he had never 
graduated from college. The French version of Banach’s thesis, entitled Sur les opérations 
dans les ensembles abstraits et leur application aux équations intégrales, appeared in print 
in 1922. It introduced the concept of normed linear spaces, an idea with roots in the earlier 
investigations of Fréchet and Hausdorff. Roughly speaking, a normed linear space is a 
vector space V on which there is a non-negative real-valued function known as the norm 
(the norm of x is denoted by ||x||) with the properties 
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where c is a constant. A normed linear space becomes a metric space by setting 
the distance d(x, vy) = ||x — y||. When the space is complete in this metric—that is, 
for any Cauchy sequence x), x2,x3,...in V, there is an element x of V for which 
limy oo ||X, — x|| = O—it is called a Banach space. 

The theory of Banach spaces developed into an extensive branch of functional anal- 
ysis, with numerous applications to various other areas of mathematics. The American 
mathematician Norbert Wiener (1894—1964) laid claim to having arrived at the same no- 
tion almost simultaneously and quite independently of Banach (hence, the term Banach- 
Wiener space was used for a short time), but left the eld after publishing a paper or two 
on the topic. 

In 1922 Banach accepted a teaching position at Lvov University, where Steinhaus 
was on the faculty. He rose to a full professorship ve years later. In Lvov, the journal 
Studio Mathematica was jointly founded by Banach and Steinhaus. Poland was now to 
have two main centers of mathematics—the other one in Warsaw, headed by Waclaw 
Sierpinski. The way Banach’s circle conducted research was to spend long hours each day 
at its favorite café, discussing old problems and formulating new ones. One memorable 
session lasted 17 hours interrupted only for meals. Because little was ever written down 
other than a few pencil jottings on a napkin (or the marble tabletop itself), it is said that 
the waiter wiped away more mathematics than was ever published. 

Banach’s classic Théorie des Opérations Linéaires came out in 1932, after appearing 
the previous year in a somewhat shorter Polish edition. Enormously fruitful, it opened 
up entirely new areas of research and stood for years as the standard source from which 
one learned functional analysis. Banach made equally signi cant contributions to sev- 
eral other areas. For example, in 1924 he and the mathematical logician Alfred Tarski 
together established a counterintuitive result whose proof hinges on the axiom of choice: 
brie y, that a (solid) sphere can be decomposed into a nite number of pieces that 
can then be reassembled to produce two disjoint spheres, each having the same size 
as the original one. This decomposition of the sphere is now called the Banach-Tarski 
Paradox. 

A second world war intervened in 1939, virtually halting mathematical activity 
in Europe. The war was a particular tragedy for mathematical progress in Poland: 
many of the nest Polish mathematicians were murdered or died in the concentration 
camps. When Fundamenta Mathematicae resumed publication (1945), the editors ded- 
icated the volume to their colleagues, contributors to the journal, who had perished in 
the war. 

Hausdorff was also a casualty of those perilous years. With the rise of Nazism 
in Germany, the liberal professions such as law, medicine, and teaching were rigidly 
controlled and regimented. Increasingly repressive legislation led to the dismissal of uni- 
versity professors deemed to be political or racial enemies of the state. Ernst Zermelo, 
for instance, had his (honorary) professorship at the University of Freiburg rescinded for 
refusing to give the Hitler salute. The climax came with the sweeping Nuremburg Laws 
of 1935, which deprived all Jews of citizenship. As a consequence of this “puri cation 
of education,” Hausdorff was forced to leave his position at Bonn. Although he remained 
active mathematically for several more years, his research could only be published out- 
side of Germany—most notably in Fundamenta Mathematicae. In January 1942, when 
internment in a concentration camp became imminent, Hausdorff, his wife, and his sister 
committed suicide together. 
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Show that d(x, vy) > 0 and d(x, y) = d(y, x) for all 


13.2 Problems x,y € X. Hence, a function satisfying the two given 
properties is a distance function for X. 


1. Indicate why each of the following functions fails to 6. For the metric spaces in Problems 2 and 3, sketch the 
be a distance function for R: spherical neighborhoods S;(x). 
(a) d(x,y)=|x+yI. 7. In a metric space (X, d), prove that the limit of a 
(b) d(x,y)=x?4+y?. convergent sequence x), X2,X3,... is unique. [Hint: 
(c) d(x, y)=|x?—y’|. Suppose that x, y € X are both limits of the sequence 
(d) d(x, y) = ||x|- |yIl. and show that d(x, y) = 0.] 
2. Suppose that XY is any nonempty set and for x, y e X 8. For R, with the standard Euclidean distance function 
dene d(x, y) by d(x, y) = |x — y|, determine which of the following 
0 ifx=y subsets are closed sets: 
d(x, y)= : 
1 ifx fy. (a) (0, 1]U {2}. 
Prove that (Y, d) is a metric space, called the (b) [0, co) 
discrete space. (c) {1, 1/2, 1/3, 1/4...) 
3. Let X¥ = R x R, the usual Euclidean plane. For (d) Q, the set of rational numbers 
points x = (x), x2), and y = (1, 2), verify that both 
functions below are distance functions for X: 9. Consider the metric space (R x R, d), where the 


distancefunction is given by 
(a) d(x, y) = |x) — vil + [x2 — yal. 
(b) d(x, y) = max{[x1 — yil, [x2 — yal}, that is, the i= (a= Ge 
larger of |x; — y,| and |x2 — yo]. 
for x = (x1, x2), y = (1, ¥2). Which of the subsets of 


4. Let X be the set of all conti functi 
et X be the set of all continuous functions oe Hace cloacal 


f : [a,b] ~ R. For two functions f, g € X, de ne 
d(f, g) by (a) {Q, y)lx = I. 


b b ; 0}. 
a(f.g)= f |G) gtx)lae. Rene 


(c) {@. yk? +y <1}. 
Show that (XY, d) is a metric space. [Hint: Recall that 


a : 10. Let i i ,d), Wi 
if h(x) = 0 on [a,b], then f? A(x)dx > 0; and if 0. Let x and y be points of a metric space (X, d), with 


x #y. Ife = d(x, y)/2, establish that the spherical 


b 

Ja h(x)dx = 0 for h(x) = 0, then h(x) = 0 for all neighborhoods S.(x) and S,(y) are disjoint; that is, 

x € [a, b].] Condition 4 of Hausdorff’s de nition of a topological 
5. Letd: Xx X — R be a function that satis es the space is satis e d. 


pollowames FOE allege andere: 11. For a Hausdorff topological space, express the notion 


d(x, y) = 0 if and only if x = y, of a closed set in terms of neighborhoods. 
d(x, y) < d(z,x)+d(z, y). 


No discussion of twentieth-century math- 


13.3 Some Twentieth-Century Developments ematics would be complete without men- 
; tioning Emmy Noether, considered the 
Emmy Noether’s Theory of Rings greatest female mathematician up to her 


time. Amalie Emmy Noether (1882-1935) 
was born in the small South German town of Erlangen, where her father, Max Noether, 
was a professor at the university. Much like the Bernoullis in Switzerland, the Noethers 
provide a striking example of a mathematically talented family. Max Noether (1844— 
1921) was a distinguished mathematician who played a considerable part in the devel- 
opment of the theory of algebraic functions, and Emmy Noether’s younger brother Fritz 
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Amalie Emmy Noether 
(1882-1935) 


(Stock Montage.) 


later became a professor of applied mathematics at Breslau. However, nothing in the 
woman’s early years seemed to foreshadow the unmistakable mathematical genius that 
she would later show. Somewhat reminiscent of Gauss, Emmy Noether seemed to favor 
the study of languages at rst; indeed, after graduating from secondary school, she passed 
the tests that would qualify her to teach French and English. From 1900 to 1902, she 
studied mathematics and languages at the University of Erlangen, one of two women 
among nearly a thousand students enrolled. Conditions had changed little during the 30 
years since Sonya Kovalevsky went to Heidelberg; female students, unable to enroll in 
the usual sense, could merely audit lectures on an unof cial basis and then only with the 
permission of the professor giving the course—a permission frequently denied. The one 
noteworthy difference was that a woman, having passed through the required courses 
or not, was allowed to take a nal university examination leading to a degree. Emmy 
Noether did so in the summer of 1903. 

Having decided to specialize in mathematics, Emmy Noether attended classes at 
G6ttingen during the winter of 1903. Mathematics at the University of Berlin had reached 
its peak during the “heroic period” 1855-1891, when the immense talents of Kummer, 
Weierstrass, and Kronecker provided leadership. Following Kummer’s retirement and 
Kronecker’s sudden death in 1891, with less distinguished men __ Iling principal positions, 
Berlin relinquished its primacy in mathematics. Gottingen quickly regained preeminence 
in Germany. The great tradition of the university was currently being carried on by a 
quartet of full professors: Felix Klein, David Hilbert, Hermann Minkowski, and Carl 
Runge. The legendary Klein “ruled Gottingen like a god,” but as he began to devote 
more time to administrative matters, Hilbert took over the role of leading mathematician. 
Had Emmy Noether remained at Gottingen, she no doubt would have been attracted—as 
she later was—to Hilbert’s axiomatic approach to mathematics. After only one term, 
however, she returned to Erlangen, where educational opportunities had improved to the 
point that women could now be registered and tested in the manner formerly reserved 
for men. 
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By 1907, Emmy Noether had completed her doctoral thesis, On Complete Systems of 
Invariants for Ternary Biquadratic Forms, under the tutelage of one of the most prominent 
mathematicians of the day and an old family friend, Paul Gordan. The dissertation itself, 
which was not an epoch-making enterprise, ended with a list of 331 forms written in 
symbolic notation; she was later to dismiss it as “a jungle of formulas.” As an extreme 
example of formal computation, the dissertation was entirely in line with the spirit of the 
earlier work of Gordan, whom his admiring colleagues called the King of the Invariants. 
The theory of algebraic invariants was one of the branches of mathematics much in vogue 
in the early 1900s, and when Gordan was once asked about its value, he said, “Oh, it is 
very useful indeed; one can write many theses about it.” 

Emmy Noether spent her next few years in Erlangen, publishing half a dozen papers 
and occasionally substituting for her father at the university when he was ill. During this 
time, Hilbert was working on the mathematical aspects of a general theory of relativity. 
Because he ran into problems that required a knowledge of algebraic invariants, he invited 
Emmy Noether to come to Géttingen in 1916 and assist him. Although Gottingen had 
been the rst university in Germany to grant a doctoral degree to a woman—to Sonya 
Kovalevsky in absentia and to Grace Chisholm Young through the regular examination 
process—it was still reluctant to offer a teaching position to a woman, no matter how 
great her ability and learning. Resistance was particularly high among the classicists and 
historians of the philosophical faculty, who had to vote on Noether’s “habilitation,” which 
carried with it the license to deliver lectures as a privatdozent. In a well-known rejoinder, 
Hilbert supported her application by declaring during a university senate meeting: “I do 
not see that the sex of the candidate is an argument against her admission as a privat- 
dozent; after all, we are a university and not a bathhouse.” When the appointment failed 
to win approval, Hilbert bridged the matter by letting her deliver lectures in courses that 
were announced under his name. She continued in that insecure status until 1919, when 
she at last obtained the desired position of privatdozent. Three years later she was ap- 
pointed nichtbeamteter ausserordentlicher Professor (unof cial professor-extraordinary), 
a merely honorary title that carried neither obligation nor remuneration. Subsequently, 
Emmy Noether was entrusted with a lectureship in algebra, which carried with it a very 
modest salary, the rst and only salary she was ever to be paid at Géttingen. 

Not long after Germany’s defeat in the Great War, foreign students once again 
thronged to G6éttingen because of the reputation of its great scholars. Although Emmy 
Noether never reached the academic standing due her in her own country, she nonetheless 
became the center of the most fertile group of young algebraists in Europe. According to 
Norbert Wiener (1894-1964), for many years a professor of mathematics at Massachusetts 
Institute of Technology, “Her many students ocked around her like a clutch of ducklings 
about a kind of motherly hen.” She was particularly popular with the Russian visitors; 
when they began going around the university in their shirtsleeves, some of the more 
reserved Géttingen professors dubbed the informal style the Noether-guard uniform. 

The mathematics that grew out of her papers following 1920 and the lectures she 
gave at Gottingen to the “Noether boys” made Emmy Noether one of the pioneers of 
modern algebra. Whereas classical algebra was concerned chie y with the theory of 
algebraic equations, modern algebra tends to concentrate on the study of the formal 
properties of sets on which certain abstract operations are de ned. Under the in uence 
of Hilbert’s axiomatic thinking, Emmy Noether sought a system of axioms for “rings” 
(we are indebted to Dedekind for the term itself) that would allow her to subsume a 
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number of earlier results under a general theory. These axioms appeared in 1921 in her 
now famous paper The Theory of Ideals in Rings. 

Noether was not the rst to give an abstract de nition of a (commutative) ring, 
having been preceded by Abraham Fraenkel in 1914 and Masazo Sono in 1917. She 
was apparently unaware of Sono’s work, as its publication was in an obscure Japanese 
journal during the Great War, but does cite Fraenkel as a reference in her famous paper. 
Removing several extraneous axioms from Fraenkel’s treatment, Noether gave a relatively 
modern de nition. For her, a ring R is a “system” closed under two abstract operations 
+ and -, to which she gives the names addition and multiplication; these operations are 
required to satisfy six conditions: 


The associative law for addition: (a+ b)+c=a+(b+c). 
The commutative law for addition: a+b=b-+a. 

The associative law for multiplication: (a -b)-c=a-(b-c). 
The commutative law for multiplication: a-b=b-a. 


The distributive law, for multiplication over addition: a-(b+c)=a-b+a-c. 


St oe eS 


For any a and b in R, there exists a unique element x satisfying the equation 
a+x=b, 


With this de nition the study of rings was transformed into a powerful abstract 
theory, one of the pillars of modern mathematics. (Today, commutativity of multiplication 
is not part of the de nition of a ring.) Noether also made the notion of “ideal” a central 
concept in her exposition, framing it in a general setting: an ideal of a ring R is a 
nonempty subset J such that if a and b belong to /, then so does a — b, as well asr-a 
and a-r for anyr in R. 

It is worth pointing out that Noether’s de nition of a ring is not the one in common 
use today: the current one usually species that R is a commutative group under addition. 
But this is ensured by her sixth condition, as demonstrated in the following: Let a be 
an arbitrary, but otherwise xed, element of R. Then the equation a+ x =a has a 
solution in R; denote it by 0. For any other element b in R, let 6 be the solution of the 
equation a +x =b. Thus9+a=a+06=b), and 


b+0=(+a)+0=0+(a+0)=0+a=b, 


making 0 an identity element for the operation of addition. Furthermore, the solution of 
a+x =0 will furnish the additive inverse of a, designated by —a. 

Historically, several of the fundamental notions in Emmy Noether’s abstract theory 
of ideals can be traced to the work of Dedekind, Kronecker, and Lasker. The use of 
ideal numbers in algebraic number theory was initiated by Kummer (1844), who found 
that he needed unique factorization to help to prove certain cases of Fermat’s last theo- 
rem. Kummer’s ideal numbers, though ordinary numbers, belonged to a more extensive 
eld than the one in which the factorization was attempted. Dedekind took a different 
approach; rather than suitably expand the eld at hand, he sought to restore the desired 
unique factorization by introducing certain subsets (in the same _ eld), which he called 
“ideals” in honor of Kummer’s vision of ideal numbers. Dedekind, by substituting re- 
lations among sets for relations among numbers, was able to state and prove theorems 
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analogous to the theorems on the factorization of the integers into primes. The principal 
result was that each nonzero ideal in the ring of algebraic integers in a_ xed algebraic 
number eld could be uniquely represented as a product of a nite number of prime 
ideals. This was published in a supplement to a second edition of Dirichlet’s Lectures on 
Number Theory (1871), which Dedekind edited. The idea of a decomposition theory for 
ideals in rings where a unique decomposition into prime factors does not exist seems to 
have originated with Kronecker. 

It is often said that Hilbert’s 1890 proof of Gordan’s theorem slew invariant theory; 
it is a fact that publication in the subject diminished rapidly. But if Hilbert ended the 
old ways of doing invariant theory, he opened a new chapter in the development of 
ideal theory. For framed in the language of modern algebra, his proof showed that 
any ideal in a polynomial ring is nitely generated. A complete theory of ideals for 
polynomial rings was obtained by Emanuel Lasker (1868-1941), better known to non- 
mathematicians as world chess champion for many years. Lasker, who took his doctoral 
degree under Hilbert’s guidance in 1905, established that every polynomial ideal is a nite 
intersection of primary ideals. Emmy Noether, in her 1921 paper, The Theory of Ideals 
in Rings, generalized Lasker’s primary decomposition theorem to arbitrary commutative 
rings satisfying an ascending chain condition for ideals—that is, to rings in which any 
strictly ascending chain of ideals in nite. The ascending-chain condition is a weak 
restriction; it holds in all polynomial domains over any eld and in many other cases. 
In recognition of Noether’s inauguration of the use of chain conditions in algebra, rings 
in which the ascending-chain condition for ideals hold are today called Noetherian. 
In a second important article, Abstract Construction of Ideal Theory in the Domain of 
Algebraic Number Fields (1927), Noether did for abstract rings what Dedekind had 
done for rings of algebraic numbers; namely, she formulated ve axioms that ensure the 
possibility of factoring every ideal into a nite product of prime ideals (rings satisfying 
these axioms are known as Dedekind rings). This pioneer work of Emmy Noether is a 
cornerstone of the modern algebra course now presented to every mathematics graduate 
student. 

Being relatively unknown both inside and outside of Germany, Emmy Noether 
required someone capable of popularizing the abstract theory of ideals that she had 
developed. B. L. van der Waerden, who came to Gottingen in the fall of 1924, eventually 
served in this way. Van der Waerden spent a year studying with Noether, before returning 
to the University of Amsterdam to complete his doctorate. He was, at 22 years of age, 
already regarded as one of the most gifted mathematical talents in Europe. Quickly mas- 
tering Noether’s ideas, he later gave them brilliant exposition in his two-volume Moderne 
Algebra (1930); reprinted in numerous editions and translated into many different lan- 
guages, it became the standard work in the eld. A large part of what is contained in the 
second volume of Moderne Algebra must be regarded as Noether’s property. 

While Emmy Noether and her school were making the abstract side of algebra the 
fashion, G6ttingen was clouded over by the threat of coming political events. Then, 
during the spring of 1933, the storm of the Nazi revolution, that modern Black Plague, 
swept over Germany. On January 30, the aged and confused President von Hindenburg 
resolved a parliamentary impasse by appointing Adolf Hitler to the post of chancellor. 
Although the decision was applauded by many, Hindenburg’s former comrade-in-arms, 
Ludendorff, saw the future more realistically. Two days later he wrote to the president, 
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“Because of what you have done, coming generations will curse you in your grave.” 
In the March elections, Hitler won 44 percent of the popular vote, the most he ever 
received under free conditions. Although the election did not bring him a majority of 
seats in the Reichstag, he achieved a majority by banning the Communist deputies and 
arresting a number of Socialists. Hitler asked the Reichstag for sweeping powers that 
would allow his government to dispense with constitutional procedures and limitations 
as it carried through “the political and moral disinfection of public life.” With the pas- 
sage of the Enabling Act on March 23, 1933, Germany’s fate was sealed for the next 
12 years. 

The total elimination of Jewish in uence in Germany had been a Nazi obsession from 
the outset. Premonitions of the horrors to come were found in the law for the restoration 
of the professional civil service (April 7) and its supplementary decrees, which deprived 
Jews of their positions in the state bureaucracy, the judiciary, the professions, and the 
universities. The law against the overcrowding of German schools and institutions of 
higher learning (April 25) deprived their children of the right of higher education. 

Among the many victims of these invidious measures was Emmy Noether. Sum- 
marily placing her on leave until further notice, the new rulers of Germany deprived her 
of even the modest position she had in Gottingen. Despite the efforts of Hilbert to have 
her reinstated, Emmy Noether, as well as other Jewish professors, was in a hopeless 
situation. Forced to emigrate from her native land, she accepted a visiting professorship 
at Bryn Mawr College, close to Philadelphia, beginning in the fall of 1933. This con- 
venient location, close to the Institute for Advanced Study, allowed her to give weekly 
lectures at the newly founded institute. These activities were cut short however, by her 
sudden death in 1935 from complications following an operation that seemed to have 
been completely successful. 

Like Sonya Kovalevsky, the greatest woman mathematician before the twentieth 
century, Emmy Noether died at the height of her career. Beyond that, the two female 
scholars had little in common. Whereas Kovalevsky was able to enthrall the middle- 
aged Weierstrass as much with her beauty as with the depth of her mind, “no one 
would contend,” wrote Hermann Weyl of Noether, “that the Graces stood by her cradle.” 
Heavy of appearance and loud of voice, she “looked like an energetic and nearsighted 
washerwoman.” Kovalevsky was as fully gifted in her literary talents as in mathematics. 
She wrote poetry, a novel, popular articles on literary and scientic themes, as well as 
an autobiography. She even shared in writing a play. But Noether’s only true and lasting 
love was mathematics: a simpler, less tormented personality, she may have been the 
happier of the two women. Hermann Wey] described her as “warm like a loaf of bread,” 
adding that “there radiated from her a broad, comforting, vital warmth.” In delivering 
her eulogy her old friend Weyl best summed it up: 


Two traits determined above all her nature: First, the native productive power of her math- 
ematical genius. She was not clay, pressed by the artistic hands of God into a harmonious 
form, but rather a chunk of human primary rock into which he had blown his creative breath 
of life. Second, her heart knew no malice: she did not believe in evil—indeed it never en- 
tered her mind that it could play a role among men. This was never more forcefully apparent 
to me than in the last stormy summer, that of 1933, which we spent together in Gottingen. 
The memory of her work in science and of her personality among her fellows will not soon 
pass away. She was a great mathematician, the greatest, I rml y believe, that her sex has 
ever produced, and a great woman. 
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Von Neumann and the Computer 


Developments in the German universities following the spring of 1933 are well 
known. Mass dismissal of “racially undesirable” professors took place, accompanied 
by political appointments and promotions for those who conformed to the ideas of the 
Nazi regime. Books were burned and boycott lists drawn up, and manuscripts had to 
be submitted for censorship. Academic self-government was lost. The universities never 
recovered from the expulsion of the Jewish professors and the voluntary resignations of 
those scholars who realized that serious study would be impossible under the totalitarian 
government. Within the rst year of Hitler’s regime, faculties showed an average numer- 
ical decline of 16 percent, with 45 percent of the established positions changing hands 
over the next ve years. The most promising of the displaced scholars were to give other 
lands the bene t of their intellectual energy and imagination, while the chairs that they 
might have Iled with distinction in their own country fell to lesser talents. Robbed of all 
independence, respected academic institutions were changed into “brown universities.” 

Hilbert was left practically alone in an “empty” G6éttingen—the honorable mathe- 
matics tradition rst kindled by Gauss, Dirichlet, and Riemann now broken. Among the 
scholars of whom the university had once been proud, Emmy Noether, Richard Courant, 
Hermann Weyl, Otto Neugebauer, Felix Bernstein, Hans Lewy, and Paul Bernays had all 
taken refuge outside of Germany. In the phrase of Weyl, “Gdttingen scattered into the 
four winds.” When Hilbert was asked by the Nazi minister of education how mathematics 
was progressing at Gottingen now that it was freed of Jewish in uence, he could only re- 
ply, “Mathematics at Gottingen? There is really none any more.” The era of mathematics 
on which Hilbert had impressed the seal of his spirit had drawn to a close. 

Many an American university reaped the benet of Hitler’s insane racial policies 
by adding one or more German mathematicians to its faculty. During the rst wave of 
emigration that began in 1933, Princeton chose Salomon Bochner (Munich); Yale, Max 
Zorn (Halle); Pennsylvania, Hans Rademacher (Breslau); New York University, Richard 
Courant (Géttingen): University of Kentucky, Richard Brauer (K6nigsberg); and the list 
could be extended. In Princeton, the presence of Hermann Weyl and Albert Einstein 
made the Institute for Advanced Study something of a reception center for refugee math- 
ematicians and physicists. (When the Institute for Advanced Study was founded in 1933, 
the original six professors in its School of Mathematics were J. W. Alexander, A. Ein- 
stein, M. Morse, O. Veblen, J. von Neumann, and H. Weyl; Kurt Gédel accepted an 
offer of permanent membership in 1938 after the German annexation of Austria.) As 
Nazism continued to spread over Europe, more and more of the mathematicians who had 
been driven from their homelands made their way to the United States. Emil Artin, Paul 
Erdés, Kurt Friedrichs, Richard von Mises, Georg Polya, Stanislaw Ulam, Andre Weil, 
and Antoni Zygmund—names that are familiar to American mathematicians—all joined 
the exodus to more friendly surroundings. The enrichment of American mathematics by 
this massive injection of European talent helped raise it to new heights. 

By the early 1930s, American mathematics had gradually evolved into a collection 
of formal, abstract specialties, detached from physical substance. Applied mathematics 
had always been a strong component of European universities, but in the United States 
its development was for the most part left in the hands of physicists and engineers. The 
wave of refugee mathematicians dramatically and permanently altered this situation. The 
newcomers were instrumental in creating major research programs of an applied nature 
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at several universities, most prominently at New York University under the guidance of 
Richard Courant (1888-1972). 

Richard Courant had been the director of Géttingen’s Mathematics Institute since 
it was established in 1929. In April of 1933, he was one of 18 men who, because they 
were Jewish or had Jewish wives, were summarily dismissed or placed on leave from 
the institute. After a year spent at Cambridge University, Courant accepted an offer from 
New York University to head its very modest graduate mathematics department. He was 
soon joined by two other emigrés from Géttingen, Hans Lewy and Kurt Friedrichs. The 
small group of mathematicians gradually expanded, aided by generous grants from the 
Rockefeller Foundation and later by government funding for numerous defense-related 
projects. In the immediate postwar years, Courant established the school’s Institute for 
Mathematics and Mechanics, with its accompanying journal Communications for Pure 
and Applied Mathematics. 

Courant had long envisioned creating a center for advanced training in mathematical- 
scientic research, an American counterpart of Gdéttingen’s once-mighty Mathematics 
Institute. The opportunity came in 1952, when the Atomic Energy Commission selected 
New York University as the site of its only high-speed electronic computer, a UNIVAC. 
As the new machine had to run continuously, the university was allowed use of it half of 
the time. Within a year, New York University’s Institute of Mathematical Sciences was 
formed, with Courant as its director. On his retirement from the position in 1958, this 
“capital of applied mathematical analysis” was formally renamed the Courant Institute 
of Mathematical Sciences. 

The most brilliant mathematician among the displaced Europeans was John von 
Neumann (1903-1957). He was born in Budapest, Hungary, the eldest of three sons of 
an afuent Jewish banker. Once his unusual mathematical promise was recognized— 
about the age of 10—he was regularly tutored at home by university professors. Von 
Neumann enrolled in the University of Budapest in 1921 to study mathematics, with the 
understanding that he would attend only at the end of each term when course examinations 
were given. Because his father wanted the boy to obtain a practical education, he also 
entered (1923) the Federal Institute of Technology in Zurich. The result of this unorthodox 
arrangement was the award of two degrees at about the same time: an undergraduate 
degree from Zurich in chemical engineering (1925) and a Ph.D. in mathematics from 
Budapest (1926). After graduation, von Neumann was a privatdozent at the University of 
Berlin from 1926 to 1929, and one in Hamburg during the academic year 1929-1930. His 
reputation in mathematics was established during these few years. Working at a prodigious 
rate, he developed the mathematical foundations of quantum mechanics, created game 
theory as a full- edged discipline, proposed a set of axioms for set theory quite different 
from those of Zermelo and Fraenkel, and extended Emmy Noether’s algebra to the study 
of “rings of operators” (now called von Neumann algebras). 

Aware of the deteriorating political situation in Germany, von Neumann decided to 
emigrate to the United States. He rst spent a term at Princeton University in 1930 as 
a visiting professor and a year later obtained a permanent position there. Then, in 1933, 
he was invited to become one of the original members—the youngest—of the newly 
established Institute for Advanced Study. 

In his 1928 paper Zur Theorie der Gesellschaftsspiele (Theory of Parlor Games), 
von Neumann initiated the mathematical study of games. His interest in the topic was 
reawakened a decade later when the Austrian economist Oskar Morganstern arrived at 
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Princeton. They believed that certain competitive economic situations could be effec- 
tively modeled by suitable games of strategy. The result of their collaboration was the 
now-classic 600-page The Theory of Games and Economic Behavior, which appeared 
in 1944. The von Neumann—Morganstern treatise covered cooperative games (that is, 
games in which the participants are free to cooperate to their mutual advantage) in de- 
tail, both in elaborating on von Neumann’s earlier work and providing applications to 
actual games; the theory for the more general case was complicated and unconvincing. A 
major step toward a comprehensive mathematical economics was a short Princeton Ph.D. 
thesis entitled “Non-cooperative Games,” written by John Nash at the age of 21. Nash’s 
dissertation was only 27 pages long and did not seem to be particularly meritorious at 
the time—vol Neumann labeled it “trivial” on a rst reading—but it would later earn 
him the 1994 Nobel Prize in Economics. 

Von Neumann was equally at home in applied mathematics and theoretical physics 
as in pure mathematics. With the outbreak of World War II, he was called on for advice in 
a wide range of scientic activities related to the defense effort. Most notably he served 
as a consultant at the Los Alamos Laboratory on the method of implosion for detonating 
the atomic bomb. He was appointed a member of the Atomic Energy Commission in 
1954, retaining the position until 1957, the year of his untimely death from cancer. 
Von Neumann was deeply involved in the latter years of his career with the logical 
design of electronic computing equipment. He proposed (1946) that the Institute for 
Advanced Study build a computer whose unprecedented speed and power would leap- 
frog all devices then existing or being planned. The machine—the AIS, as it would be 
called—was in working order in late 1951 and used effectively for scientic computation 
throughout the 1950s. To satisfy its governmental patrons, the initial test-run of AIS 
was a long series of calculations connected with the design of the hydrogen bomb. 
A duplicate of von Neumann’s computer later employed at Los Alamos was known 
as MANIAC (Mathematical Analyzer, Numerical Integrator, Automatic Calculator), but 
more affectionately called JONIAC in a borrowing from von Neumann’s name. 

Despite the great breadth of his knowledge, von Neumann admitted to begin occa- 
sionally daunted by the accelerating progress and complexity of modern mathematics. 
He once remarked that, although 30 years earlier a mathematician could have more than 
a passing familiarity with all of the subject, this was no longer possible. When asked 
as to what percentage of all mathematics one could aspire to understand in his day, von 
Neumann replied with a smile, “about 28 percent.” 

Among the many refugees who ed the political situation in Europe were a number 
of talented young people who would enrich the next generation of American research 
mathematicians. Peter Lax (b. 1926) and Walter Feit (1930-2004) were two of them. 
Lax was born in Budapest, Hungary. He and his parents were ship-bound for New York 
when the United States entered the Second World War. After spending a year in high 
school to improve his English, he enrolled at New York University and began taking 
graduate courses. Lax was drafted in 1944 and assigned to Los Alamos, New Mexico, 
where the rst atomic bomb was being developed. Returning to New York University 
at the end of the war, he received his doctorate in 1949 as a student of Kurt Friedrichs. 
In 1951, Lax joined the university’s faculty and rose quickly to become a professor 
seven years later. He served as director of the Courant Institute from 1972 until 1980. 
The Norwegian Academy of Science awarded its Abel Prize to Lax in 2005 for his 
impressive contributions to the theory and applications of partial differential equations. 
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Regarded by many mathematicians as Norway’s equivalent of a Nobel Prize, the award 
carried an amount of six million Norwegian crowns, or about $980,000. 

Walter Feit was born in Vienna, Austria. Two days before the Second World War 
began in 1939, his parents put him on “Children’s Transport,” the last train that was 
permitted to take Jewish children out of the country. He never saw his parents again, 
nor did he know what happened to them. Upon graduation from Oxford Technical High 
School in England, Feit relocated with relatives in the United States. He received a 
master’s degree from the University of Chicago in 1951 and was awarded a doctorate by 
the University of Michigan four years later. Following a decade on the faculty at Cornell 
University, Feit settled permanently at Yale University in 1964, where he remained 
until his retirement in 2003. In 1963, Feit and his contemporary John Thompson won 
international acclaim for their resolution of a 50-year-old problem in group theory called 
the Burnside Odd Order Conjecture. 

William Burnside (1852-1927) had given the rst comprehensive treatment in En- 
glish of group theory in his The Theory of Groups of Finite Order, “order” signifying the 
number of elements in a group. In the rst edition of this classic work, which appeared 
in 1897, Burnside remarked that “No noncommutative simple group of odd order is at 
the present known to exist.” (A simple group is one which has no “normal” subgroups 
other than the whole group itself and the trivial subgroup consisting only of the identity 
element.) By the time the second edition of Burnside’s textbook was published in 1911, 
he had examined the nite groups of order less than 40,000. This effort convinced him 
to suggest that “noncommutative simple groups of odd order do not exist.” The prophecy 
was at last conrmed in the Feit-Thompson paper in 1963. Their proof required a full 
255-page issue of the Pacific Journal of Mathematics; many view it as the most in uential 
article ever written on the theory of nite groups. 


Women in Modern Mathematics 


Among the more signicant developments of the post-World War II period was 
the increasing role women played in the mathematics community. Where earlier in the 
century they tended to spend their professional lives teaching at women’s colleges or 
undergraduate colleges, many now were trained to pursue careers in both teaching and 
research. But there were often barriers to be overcome, as experienced by the American 
logician Julia Bowman Robinson (1918-1985). 

Julia Robinson obtained her doctoral degree from the University of California at 
Berkeley under the supervision of the Polish emigré Alfred Tarski. Over the next two 
decades, Robinson did pioneering work leading to the solution of the Tenth Problem 
on Hilbert’s famous list of 23 problems. This deceptively simple-sounding problem lies 
in the border of number theory and logic. It asks for a computational algorithm that 
will determine whether a given polynomial equation with integer coef cients will have 
a solution in the integers. Robinson’s results on the behavior of solutions of certain 
diophantine equations enabled the 22-year-old Russian mathematician Yuri Matijase- 
vich to answer the Tenth Problem, negatively, in 1970: There is no such computational 
algorithm. 
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In the late 1960s, there were still vestiges of the antinepotism rules restricting the 
employment of husband and wife in the same academic department. Because Robinson 
was married to a Berkeley mathematician, she was not offered a position that her re- 
search accomplishments merited. Only after being elected to the mathematics section 
of the National Academy of Sciences in 1976—the rst woman so honored—was she 
appointed to a professorship; a metamorphosis from part-time lecturer to full professor 
almost overnight. In 1982, Robinson became the rst woman president of the American 
Mathematical Society. In the following year she was awarded a MacArthur Prize, the 
prestigious “genius award,” which carried an annual stipend of $60,000 for ve years. 

An equally accomplished midcentury mathematician was England’s Mary L. 
Cartwright (1900-1998). She earned her Ph.D. from Oxford University in 1930 where 
her thesis advisors were G. H. Hardy and E. C. Titchmarsh. Then Cartwright accepted a 
three-year research fellowship at Girton College, Cambridge, marking the beginning of 
a lifelong association with the college. She joined Girton’s faculty as a lecturer in 1934 
and assumed the duties of director of mathematical studies two years thereafter. Still 
later, in 1949, Cartwright answered the call to become mistress of Girton, remaining in 
the administrative position until her retirement in 1968. 

Cartwright published over 90 papers, the most important of which involved joint 
work with John Littlewood. Their collaboration began in 1938, shortly before Britain 
entered World War II, and lasted some 10 years. At the government’s request, they 
analyzed the behavior of the solutions of certain dif cult nonlinear differential equations 
occurring in connection with radar. These investigations played a signi cant role in the 
development of the modern theory of dynamical systems. Cartwright visited the United 
States in the spring of 1949, lecturing on her research interests at Stanford, UCLA, 
and, more extensively, at Princeton. Because Princeton had no women as professors, the 
university of cially listed the noted mathematician simply as a consultant to the Of ce 
of Naval Research. 

Cartwright, like Julia Robinson, was in many respects a pioneer. In 1947 she was 
elected a Fellow of the Royal Society of London, the rst woman mathematician to 
be named a Fellow. (Women had been members of the society since 1945.) In 1961, 
Cartwright also became the rst woman to hold the ofce of president of the London 
Mathematical Society. Her contributions were recognized with two of England’s pres- 
tigious awards: the Sylvester Medal of the Royal Society (1964) and the DeMorgan 
Medal of the London Mathematical Society (1968). Then, in 1969, she was honored by 
the queen, becoming Dame Mary Cartwright, Commander of the British Empire. 

Mathematicians in the twentieth century saw a great incursion of new ideas. The 
branches of mathematical investigation were enormously widened, and its methods pro- 
foundly deepened. Perhaps the most far-reaching development has been the effect of 
the “computer revolution,” which can be regarded as a continuation of the Scienti c- 
Industrial Revolution, on the discipline. The realization of Babbage’s dream of a fully 
automatic calculating device was the Automatic Sequence Controlled Calculator (ASCC), 
later known as the Mark I. The rst general-purpose electromechanical computer, it was 
completed in 1944 as a joint enterprise between Harvard University and the International 
Business Machine Corporation. The Mark I weighed ve tons, contained 500 miles of 
wiring, and accepted operations instructions prepunched into a roll of paper tape. Its 
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multiplication operation was done by the addition of partial products, just as it would 
have been done by assembling a set of Napier’s bones. The machine was kept running 
24 hours a day and could handle in a day calculations that would have taken six months. 

Scarcely was the Mark I built before it was antiquated by the need for ever-faster 
computations during the war years. The rst computer to use vacuum tubes, rather than 
electrical relays and mechanical parts, was the Electronic Numerical Integrator and Cal- 
culator (ENIAC). Appearing in 1946, it was designed and constructed by the University 
of Pennsylvania to produce ballistic ring tables for the U.S. Army. Perhaps the bulkiest 
computer ever made, the ENIAC weighed about 30 tons and occupied 30 by 50 feet of 
oor space. The machine carried out calculations 1000 times faster than its nearest rival, 
the Mark I; 7 was computed to 2035 decimal places in 70 hours. But its 18,000 vacuum 
tubes were a serious drawback, requiring too much power and producing too much heat. 
(An enduring—if questionable—legend is that every time the ENIAC was turned on, 
lights dimmed in all the houses in West Philadelphia.) 

The experience developed during wartime led to the Universal Automatic Computer 
(UNIVAC), the rst computer designed for commercial purposes. A direct descendant of 
the ENIAC, it needed only 5000 vacuum tubes and relied on magnetic tape storage of 
data instead of punched cards. The initial machine was installed at the Bureau of Census 
in 1951, where it was used continuously over the next 12 years before its retirement 
to the Smithsonian Institution. The UNIVAC received considerable publicity when the 
CBS network used it to provide early election-night predictions of the 1952 presidential 
race. Although the network’s political pundits were skeptical of the machine’s forecast, 
Eisenhower was elected by almost exactly the landslide voting margin rst predicted by 
UNIVAC. 

One of the systems engineers contributing to the development of the UNIVAC was 
Grace Murray Hopper (1906-1992). After earning a doctorate in mathematics from Yale, 
she taught at Vassar College before joining the U.S. Naval Reserve. Assigned by the 
Navy to the Harvard Computer Laboratory in 1944, Hopper wrote the original manual 
of operation for the Mark I. (Once when the machine had stopped executing, she used 
tweezers to pull a dead moth out of a relay. The remains of the now-famous moth are 
preserved with plastic tape in the logbook along with the note, “First actual case of 
bug being found.”) Over the years, Hopper continued to work on computer software 
design both for the Navy and for industry. Her pioneering work with compilers, and her 
ideas about what programming languages should be, led to her introduction in 1957 of 
“FLOW-MATIC,” the rst English-language data processing compiler. The existence of 
FLOW-MATIC greatly facilitated the development in the 1960s of the business-oriented 
language COBOL. When she retired in the mid-1980s with the rank of Rear Admiral, 
she had become the oldest of cer on active duty in the Navy. The retirement ceremony 
was held aboard “Old Ironsides,” the USS Constitution, the oldest American warship 
still in commission. The best-known woman in computer science, Hopper received 
honorary degrees from more than 40 colleges and universities. 

Grace Hopper’s lifetime saw extensive re nements in machine computation. By the 
late 1950s, vacuum tubes were superseded by transistors, which generated little heat and 
provided long service. The next decade saw the miniaturization of electronic circuitry; 
it wasn’t too many years before a million transistors could be replaced by a single chip 
of silicon. Computers accordingly became smaller, more powerful, and low-cost: instead 
of Iling rooms, machines now sat on desks. Just as mechanical machines created the 
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Industrial Revolution, computing machines became the engines powering the modern 
revolution of information. 


A Few Recent Advances 


Paul Erdés (1913-1996) was one of the towering gures in twentieth-century mathe- 
matics, certainly the most prolic writer on the subjet since Leonhard Euler. He published 
over 1500 mathematical articles with the aid of some 500 coauthors. In recognition of 
these contributions, Erdés was elected to membership in eight scientic academies and 
received honorary degrees from 15 universities. 

Erdés was born in Budapest, Hungary, to parents who were both high school teachers 
of mathematics. Except for three years at school, he was educated at home. In 1930, Erdés 
entered Péter Pagmanty University in Budapest and graduated four years later with a 
doctorate in mathematics. His thesis concerned prime numbers in arithmetic progressions. 
Shortly thereafter, Erdés was awarded a research position at Manchester University in 
England—a position that did not require him to teach. But as the political situation in 
Europe worsened, he decided to accept instead a fellowship at Princeton’s Institute for 
Advanced Study for the academic year 1938-1939. Surprisingly, the fellowship was not 
renewed at the end of the year. This led to a period in which Erdés was either unemployed 
or held short-term research grants at various American universities. In 1952, he was able 
to secure a permanent post at Notre Dame University. 

Two years later, Erdés attended a mathematics conference in Amsterdam. Unfortu- 
nately, he neglected to rst secure a reentry visa. When he tried to return to the United 
States, the American immigration authorities denied him reentry. It is possible that his 
connection to Hungary (at that time regarded as a Communist country) was a factor, but 
no reason was given. Nine years passed before Erdés was nally able to return to the 
United States. 

Erdés had no permanent home. With a mathematical career centered on proposing 
and solving problems, he traveled around the world to almost every center of mathe- 
matical activity. He would often appear unexpectedly at the home of one of his many 
friends, his legendary suitcase in hand. Upon his arrival, he would propose mathematical 
problems at various levels of dif culty, often collaborating on several resulting papers. 
He seldom stayed longer than two months in any one place. Erdés’s style of life required 
little money: much of what he did earn from lecturing at mathematics conferences he 
gave to help students, or donated as prizes for solving problems that he had posed. 

While Erdés contributed to many areas of mathematics, most of his papers dealt with 
number theory. He was particularly pleased with the “elementary” proof of the prime 
number theorem, which both he and Atle Selberg provided in 1948. In 1975, he joined 
with John Selfridge in conrming the 150-year-old conjecture that the product of two or 
more consecutive integers is never a square or other integral power: explicitly, that the 
equation 


(n+ 1)(n+2)---(‘24+k)=x" 


has no solution for integers k, r > 2 andn > 0. 
Another in uence on the direction of twentieth-century mathematics is the work of 
Nicholas Bourbaki. The name is not that of a single person but rather a pseudonym 
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initially adopted by a group of 10 or so young French mathematicians. They named 
their group after Charles Bourbaki, a French general during the Franco-Prussian War. 
Several members, most notably André Weil, Henri Cartan, Claude Chevalley, and Jean 
Dieudonné would become world renowned. 

The Bourbaki group gathered in a Paris café in December 1934 to discuss writing 
a new textbook on calculus for French university students, to replace Edouard Goursat’s 
sadly outdated but widely used Traité d’Analyse. Their project grew into something much 
more ambitious, that is, developing with utmost rigor the essentials of modern French 
mathematics. 

Bourbaki called the proposed work Eléments de Matheématique. Its content would 
be organized into six Books (Livres), each consisting of several subsections. 

The scope of the undertaking required a change of location. Beginning in mid-1935, 
Bourbaki would meet three times a year in a countryside estate—one week each in the 
spring and the fall with a two-week session during the summer. The members assumed 
that it would be dif cult for an older person to absorb new ideas, so they made it a rule 
that each person must leave the group at the age of 50. The group’s manner of writing 
was to be collaborative: everyone was expected to participate. After a rst draft of a 
topic was nished, it would be read aloud and revised as many times as necessary to 
obtain unanimous approval. 

It is not surprising that the mathematics being written became broader in scope than 
originally planned. The rst book of the E/éments appeared in 1939. It was entitled Set 
Theory and consisted of four subsections, along with historical notes and challenging 
exercises for the reader. Emphasis was placed on strict formalism and adherence to an 
axiomatic development. By 1959, all six books had been completed. In their order of 
publication, they are I. Set Theory, II. Algebra, III. Topology, IV. Functions of a Real 
Variable, V. Topological Vector Spaces, and VI. Integration. After the rst six books 
were published, the size of Bourbaki increased to 15 or 20 members (and some members 
were approaching the age of 50 years). The newer mathematics was considered to be 
more sophisticated, and two more books were published: VI. Commutative Algebra and 
VII. Lie Groups. On several occasions, Bourbaki considered that its work was _ nished, 
but new mathematics invariably led to additional subsections or books. 

Rapid advances in computer technology have led to an intriguing interplay between 
mathematician and machine. The computer has been an invaluable research tool in fur- 
nishing counterexamples to outstanding conjectures or in verifying conjectures up to 
specic numerical bounds. For example, Goldbach’s conjecture has been con rmed for 
the rst 2-10!* even integers: a factor, which has roughly 7000 decimal digits, has 
been found for the immense number 22) + 1; the initial 10 billion zeros of the zeta 
function have been calculated; and the expansion of z has been carried out to just 
more than 260 billion decimal places. In such instances of large-scale calculation, the 
computer serves to generate new data. As for the theorems, its use in assisting nu- 
merically to prove new results is as yet rather rare. There have been some remarkable 
successes with previously intractable problems; but perhaps the computer’s most impres- 
sive contribution to mathematics was the veri cation in 1976 of the famous Four-Color 
Conjecture. 

For more than a hundred years, the Four-Color Conjecture was one of the most 
popular and challenging problems in mathematics. In nontechnical terms, the conjecture 
is usually stated as follows: any conceivable map drawn on a plane or on the surface of 
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a sphere can be colored, using only four colors, in such a way that adjacent countries have 
different colors. Adjacent countries are those that border one another along a line, rather 
than having just a nite set of points as a common boundary. Moreover, the territory 
of each country must consist of a single connected region. In the case of a map of the 
United States, for example, Arizona and Colorado may be colored the same because 
they meet only at a point, but the two physically separated pieces of Michigan must be 
colored differently. 

If practical mapmakers were aware of the Four-Color Conjecture, they certainly 
kept the secret well. The rst known document indicating the problem is a letter dated 
October 23, 1852, from Augustus De Morgan to his friend William Rowan Hamilton, the 
inventor of quaternions. Earlier in the month, Francis Guthrie (1831-1899) had noticed 
that four colors suf ce to distinguish the various counties on a map of England. Francis 
asked his younger brother Frederick, still a student of De Morgan at University College 
London, if it could be shown mathematically that coloring any map would require only 
four colors. Unable to answer his brother’s question, Frederick brought the problem to 
the attention of De Morgan, who could not nd any method for determining its truth or 
falsity. For his part, Hamilton failed to recognize the conjecture’s importance, replying 
merely that he did not wish to work on this “quaternion of colors” soon. In fact, he never 
tried it at all. 

The coloring problem was entirely neglected for a quarter of a century. Other English 
mathematicians learned of it in 1878, when Arthur Cayley presented the conjecture at a 
meeting of the London Mathematical Society. The rst printed reference to the problem 
is a four-line report of Cayley’s remarks, “On the Colouring of Maps,” which appeared 
in the Society’s Proceedings. Interest was immediate. Arthur Kemp (1849-1922), a bar- 
rister and member of the society (and the author of a short, celebrated book with the 
provocative title How to Draw a Straight Line) published a paper in 1879 in the newly 
founded American Journal of Mathematics. In it he claimed to prove that four colors suf- 
ce for coloring any map on a sphere. For more than a decade, Kemp’s extremely clever 
argument was accepted; but in 1890, Percy Heawood pointed out a fatal aw in the rea- 
soning. Heawood’s modest paper of six pages, “Map-Colour Theorems,” was not entirely 
destructive: it included a simpli cation of Kemp’s proof showing that each map drawn 
on the plane or sphere can be colored by at most ve colors, the Five-Color Theorem. 

Heawood’s analysis of Kemp’s purported proof showed that the problem is more 
subtle than had rst been believed. During the subsequent years, it attracted the attention 
of dedicated amateurs and distinguished mathematicians, inspiring progress in the 
development of mathematical methods yet always denying the nal step of proof. One 
signi cant advance occurred in 1922 when it was shown that an arbitrarily drawn map 
of 25 or fewer countries is four-colorable; thus, any counterexample to the conjecture 
would have to be a map of at least 26 countries. This lower bound was gradually raised, 
nally reaching 96 countries before all such results were rendered super uous. This is 
because in the summer of 1976, the Four-Color Conjecture was nally conrmed by 
Kenneth Appel and Wolfgang Haken of the University of Illinois. The two colleagues 
presented their proof at a meeting of mathematicians in Toronto—to be rewarded with 
only polite applause for the solution to such a longstanding problem. Shortly later, a full 
account was published. The question seemed to be whether they had actually provided a 
“rigorous demonstration” of the Four-Color Conjecture. Their argument contains several 
hundred pages of complex detail, requiring more than 1200 hours of time on a large 
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computer. The coloring of certain complex con gurations is deduced from the coloring 
of others involving fewer regions, thereby “reducing” the type of map that needs to be 
considered. Appel and Haken found 1936 reducible cases, each of which involved a 
computer search of up to 500,000 logical options to conrm its reducibility. 

Even in its solution, the Four-Color Conjecture remains an enigma, for the funda- 
mental novelty in the Appel-Haken proof is the unprecedented use of a computer to 
establish a mathematical theorem. In the years since the conjecture was proven, other 
important results have been obtained with computer aid. But as the rst instance of such 
a proof, the coloring-proof provoked considerable controversy within the mathematical 
community. Because it is currently impossible to verify the correctness of the argument 
without a computer-facilitated analysis, there is a tendency on the part of many mathe- 
maticians to mistrust the whole thing. It cannot be ruled out that a short and convincing 
proof of the conjecture may yet be found, but it is just as conceivable that the only valid 
proofs will involve massive computations requiring computer assistance. If this is the 
case, we must acknowledge that a new and interesting type of theorem has emerged, one 
having no veri cation by traditionally accepted methods. Admitting these theorems will 
mean that the apparently secure notion of a mathematical proof is open to revision. 

Aside from the philosophical question about the method of proof used, a deeper 
quandary is facing modern mathematics: as the eld becomes increasingly complex and 
specialized, the evaluation of lengthy and highly technical proofs becomes more dif cult. 
(The existing proof of the classi cation of nite simple groups runs to 10,000 journal 
pages spread across some 500 separate articles.) A case in point is the recent, somewhat 
controversial, proof of the Kepler Conjecture. 

In a Latin booklet of 1611, entitled A New Year’s Gift—On the Six-Cornered 
Snowflake, Johannes Kepler posed a problem in solid geometry that has remained open 
for nearly 400 years. It concerns how densely a number of identical spheres (that is, 
spheres of equal radii) can be tted together in a given container, say, a large cubical 
box. No matter how cleverly they are arranged, there will always be some wasted space 
between the spheres. The most familiar arrangement is that seen in piles of oranges in 
fruit stores, or in stacked cannonballs on war memorials. This pattern is usually called 
shot-pile packing. In shot-pile packing a box, where successive layers of spheres are 
added in the gaps in the layer below, the spheres will occupy 2/18 ~ 0.7405 of the 
total volume of the box. Kepler asserted, without proof, that this type of packing is “the 
tightest possible so that in no other arrangement could more spheres be stuffed into the 
same container.” That is, the space Iled by any other packing of equal-sized spheres can- 
not exceed z7/+/18 of the volume. Despite the strong intuitive appeal of shot-piling as the 
most ef cient method of packing, the Kepler Conjecture de ed all attempts at resolution 
until a proof was announced in 1991. Much heralded, it was taken to be “without doubt 
the mathematical event of 1991.” But there is increasing dissatisfaction with the lengthy 
argument owing to gaps in the reasoning and missing details. If we regard mathematical 
proof as a clear, indisputable process, then the centuries-old problem was yet to be settled. 

Finally, in 1998, Thomas Hales of the University of Michigan provided a rigorous 
solution to the vexing geometric question. Hales had spent 10 years developing a 
computer-assisted proof, which totaled more than 250 pages. His approach was to 
reduce the problem to the analysis of an equation consisting of 150 variables, each of 
which describes a position of a sphere. The challenge was to show that no combination 
of variables (and hence, arrangement of same-sized spheres) would produce a tighter 
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packing than achieved by shot-piling. Extensive computer calculations were required to 
rule out the vast number of possibilities. 

Another memorable accomplishment involved a modern version of the ancient Greek 
problem known as “squaring the circle.” In 1925, the mathematical logician Alfred Tarski 
asked whether there is a way to cut up a circular region into a nite number of pieces 
which can be reassembled to form a square with the same area; no point of the circle can 
be lost and none of the pieces can overlap. The problem is less restrictive than the antique 
classical constructions which required that only straightedge and compass be used. In 
1989, the Hungarian mathematician Miklos Laczkovich proved that a solution to Tarski’s 
circle-squaring problem is theoretically possible, provided that the pieces are suitably cho- 
sen. More surprisingly, his 40-page proof showed that in assembling the square, the pieces 
do not even have to be rotated into place, just slid together. The pieces themselves—about 
10°° are required—are extremely complicated, almost unimaginable in shape. 

In many respects, the twentieth century was a golden age for mathematics. With more 
mathematicians than ever before at work, the subject was transformed by its unparalleled 
growth. Entirely new lines of investigation sprang up in almost bewildering profusion, 
touching upon such topics as distributions, chaos, categories, fractals, wavelets, Markov 
chains, Penrose tiles, super-strings, and so on. Some of the allegedly “purest” areas 
of mathematics found unexpected application in other disciplines. Group theory became 
central to crystallography and particle physics; results on prime numbers formed the basis 
for public-key cryptosystems; knot-theoretic topology was employed in molecular biol- 
ogy; and nite elds were indispensable to the design of error-correcting codes. (Coding 
theory has nothing to do with secret codes, rather with the transmission of information 
over “noisy” channels.) The twentieth century inherited from its predecessors a number 
of elusive, longstanding problems whose solutions had been vainly sought. Among its 
crowning achievements was the solution of three celebrated challenges: Fermat’s Last 
Theorem, the Four-Color Problem, and the Kepler Conjecture. Despite resolute efforts, 
the continuum hypothesis, the Goldbach Conjecture, and the Riemann Hypothesis all 
remain to engage the attention of the current century. 

Along the road to discovery, there will always be tantalizing statements whose proofs 
are later seen to be awed: however gemlike mathematical truths may be, research is but 
a human endeavor. Such episodes must be expected and do not detract from present-day 
scholarship, which is as vigorous and innovative as in any other period. Mathematics is 
not a completed structure but an evolving one, in which famous old problems are being 
solved and unexpected discoveries are opening new possibilities. Each generation adds 
another chapter to the unending story of mathematics. 
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13.3. Problems “ ua={[ c 


matrix addition and multiplication. 


, ]lestvevde 2], with 


1. Verify that the following sets form rings under the (d) 24 = {0, 1, 2, 3}, with addition and 
indicated operations: multiplication modulo 4. 
(a) 2, (the even integers), with ordinary addition 2. In which of the rings of Problem | is multiplication 


and multiplication. commutative? Which of them have an identity 


(b) Off] = {a+ bila, b € O;i? = —1}, with element for multiplication (that is, an element e 
complex addition and multiplication. satisfying a-e = e-a =a for all a)? 
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3. For elements a, b, and c of an arbitrary ring R, 
establish the following equalities: 


(a) a-0=0-a=0. [Hint: Note that 
0+a-0=a-(0+0).] 

(b) a-(—b)=(-a)-b=—(a-b). 

(c) (-a)-(—b)=a-b. 

(d) a-(b-—c)=a-b—-a-c. 


4. Prove that the set Z, is an ideal in the ring Z of 
integers. 


5. In 1903, Leonard Dickson gave the following 
de nition of a e ld, abstracting from the earlier 
efforts of Dedekind (1871) and Weber (1893): 


A eld F is a commutative ring with identity 
in which each nonzero element has an inverse 
under multiplication. 


Conrm that the following sets are elds under the 
indicated operations: 


(a) Q with ordinary addition and multiplication. 
(b) O[V2] = {a+ bv2Ia, b € Q}, with ordinary 
addition and multiplication. [Hint: For 
multiplicative inverses, solve the equation 

(a + bV2)(x + yV2) = 1 for x and y.] 


©) r={/ a b 


ae a,b, € Of, with matrix 


addition and multiplication. 
(d) Zs = {0, 1,2, 3,4}, with addition and 
multiplication modulo 5. 


6. Prove that the only ideals ina eld F are {0} and F 
itself. [Hint: If the ideal 7 4 {0}, pick an element 
04#a€e/ and use the fact that a has a multiplicative 
inverse. } 
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TheGreek Alphabet 


The Greek Alphabet 


Letters Names Names 
A a alpha iota 

B B beta kappa 

r y gamma lambda 
A 5 delta mu 

E € epsilon nu 

vd c zeta xi 

H n eta omicron 
(3) 0 theta pi 


Letters 

Pp 
x o 
ir T 
2 @ v 
od’ 6 
x x 
vow 
Q (23) 


Names 


tho 
sigma 
tau 
upsilon 
phi 

chi 

psi 


omega 
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Solutions 
to Selected 
Problems 


SECTION 1.2, p. 18 


jpnenan 9999%. 


NANAN 


199 9OORR(. 


1. (a) 


= 


c) 


= 


{| 


2. (a) 648.  (b) 140,060. 


3. (a) 99 

1 999 SBR 
© |JaO 

[11 
5. (a) ~P 5: o X49 e 
WP AYUND 


6. (a) 1234. (c) 55,555. 
(c) Bawéy. 


= HHH fe A444 i | 


7. (a) Xd. 


= 


= 


9. (a) 2756. (c) 2977. 


762 


9 119 9999828776, 


9 XP HHHH F 4444 ji 


°F MMxXxxX > [PHHH II 


"0 x HHH 
© XHHHa Pill 


11. (a) MCDXCII.  (c) MCMXCIX. 
(e) CXXMMMCDLVI. 


12. (a) 124. (c) 1748. (e) 19,000. 
13. (a) CMXIX. (c) LXX. (e) XC. 
SECTION 1.3, p. 28 


1. (d) 1234 = 20,34 = < < <  & | aa 


(e) 12,345 = 3,25,45 = 
TTT <q TTT 4dqirt 
TY q $y 


(f) 123,456 = 34,17,36 = 
4. TTT Qt TTT. 
T TYY A<drit 


2. (a) Among other possibilities, 


< TTT <<< TIT ™ 
T <= TTT 


3.2=0;10. $=0;640. += 0;12. 

1 . 1 _o. SD). as fhe 

4 = 0;230 4 =0;1,30. = 0;25. 
4. (a) 5025. (b) 122. (c) &. (d) 833. 


5. 12,3,45:6. 


6.(c) 1066=—O LT. 


Solutions to Selected Problems 763 


@ s792= [|b TT = Il 17. a $4, Oye (c) 7. (d) 13 or 19. (©) 6. 
@ 123.456=— I] = |] = T 
7. (a) 666,666. (b) 7725. (c) 123,321. (d) 9,623,088. gEcTION 2.3, p. 51 
i, 4 
2.@2. e+o4—, 
0 LTT ee 
) 4) ]| TT: (@S+e+5. @ R845. 
(c) —= 1} =]: 14545 
10. (@) 236= A (@) 1606 = — 04. 3 et, 


tt 1.1. 1 1 


2. A a te ae 
a a eae ee ee 
Te DE 65 : 


+ 15 75°65 39 -:195 
\ 2 1 1 


85 51° 255° 
11. (a) 83. (b) 470. (c) 29,005. (d) 5634. 


2 1 1 
7. (b) — = 
2% 0) a? 
2 1 ‘ 1 2 1 : 1 
75. 50 150°99 66 198° 
10. 2/n =1/4-1/n +7/4-1/n. 
a 5a Ba 1 a% yi) abated 
F “15 15 5/2 5 6 
=< ee lee (co ee ee ee 
= 49 49 49 q° 38° 196° 
e@ 
pe) —+49(2)-- edt 
— is) 35.35 (=) sta tpt ats 
; eo 
_— = 3 1 1 1 
12. (a) Possible answers are: = = —+ —~+ —; 
ee 7 4 7 28 
(e) 123,456= 22 4 1 1 7 1.1 1 
- 1s 47 60°29 5 29 145 
e 7 
— @ ote es Aly 
e 7 3 1 231° 15 4 60’ 
— 7 41 A 1 
14. (a) 93,707. (b) 1,086,220. (c) 5,832,244. 99351957 5° 
15. (a) ts! (©) == (c)2se 13. If m+ 1 =nk, then n/m = 1/k + 1/km; 
os ss 2 ft. 1 


5 
Een | es pedal ee eam 
1307 “2° 7 26 91° 
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meets ee = 

5S 6 30) — 
1d ” tid \ Ld 
6 30 7 42 31 930 
2 1 12 ~«21 1 

16, = S43 SS Spe, 
ll 6 66°17. 9 153 


17. The sums 2+6=3+4+5=1+4+3+44=8 yield 
2 1 1 1 1 1 1 1. 
Ts 10.30 12720 15 20° 60° 
whereas 2+4-+6+ 12 = 24 gives 
2 1 1 1 1 
43 43 86 129 * 258° 

18. The sum 15+ (6+ 3) = 24=2.- 12 yields 
a ee ee 
iG 2 a ee 
43+ (18+9+4 2) = 72 = 2-36 gives 
2 1 1 1 1 


43° 36° 86 172 774° 


1 11. 1 1 


1 1 

2° igo’ s' a 5 lo 3 IS 

20. i047 104+ 4 4692: 74 
: 3 anes oa a. 


19. 


22,.1+-+—. 


23. 12+ = + : + : 
. 3° 42 «(126° 
24. (a) Solve the equations 


x+(x+d)+ (x + 2d) + (x + 3d)+ 


(x + 4d) = 100 
1 
x+(x+d)= al@ + 2d) + & + 34) 
+ (x + 4d)] 
; ; 0 d ae) 
x=—,d=—. 
re 6: 6 


1 1 
(b) Because 1 + (6 + ;) + 12+ (17 + ;) 
2 : 
+23 = 60, and 60 (: + =) = 100, multiply 
1 1 
each of 1, 6+ > 12, 17+ > and 


2 


SECTION 2.4, p. 61 


1. (a) 640 cubic cubits. (b) 20 square khets. 5 + zr 


Solutions to Selected Problems 


1 
2. (b) 35. 


3. ar? /(2ryP = = implies 7 = as 
= ie ae 
4. Aryabhata’s rule gives the correct area. 


6. The Babylonian formula gives V = 180, as compared 
with the correct value of V = 56m © 176. 


Sop 
V=—. 


SECTION 2.5, p. 71 
1. (a) oe 19(0;4) = 1;16 
- (a i= 34) = 1516. 
; = 5(0;20) = 1;40. 


10 
J = 10(0:6, 40) = 1;6,40. 
10 10-60 10-20 10-20-20 


(b) 9 9.60 3-60 607 ™ 
35 33 
3. x = — = 4:22,30, y = — = 4;7,30. 
8 8 
4.x =8, y=2. 
5.x = 18, y=6. 
6 


~(a)x=8 y=2. (bd) x=7,y=3. (c)x=5, 
P=. 


T= 15° S 12; 

8. x = 0;30, y = 0;20, z = 6. 
9. x = 0;30. 

10. x = 14. 


11. The sides are 30 and 25. 
2 

12. x = = = 0;40. 
3 


13. All parts have x = 30, y = 20 as a solution. 


SECTION 2.6, p. 80 

1.12. 2. 768. 

3. x =20, y= 12. 

4.x = 18, y= 60, z= 40. 

5. b; = 10, by = 5, 8; = 82 = 20. 
6. d? © 1782.7, d? © 1701.6. 


17 9 33 
7 oe cle ee ier 
51,841 32,257 
8. V720 & + 26.83, 63 & ~ 7.93. 
1932 064 


Solutions to Selected Problems 


SECTION 3.2, p. 103 
1. 8t, + 1 = Sonat. 
3. ot, + l= Bn+i- 
4. (a) 56=55+1. (c) 185=91+ 66 4 28. 
6. 1225 = ty = 535, 41,616 = hes = 8294. 
7. (a) oO, =n(nt+1)=21+2+---+n). 
(c) o,t+r=n(nt+1ltn?=2n? +n 
_ 2nQan+1) _ 
= 5 = 
(ec) n? +20, +(n+1) 


=n? 4+2n(n+1)+(n+1/P 
= 4n? +4n+1 = (2n+ 1). 


ton. 


8 9=0°4+346=27427?+1, 
81=0? 43478 = 574+ 17 +55. 
10. [nv — 1) +1] 4+ [nm — 1)4-3]4+--- + [n(m — 1)+ 
(2n — 1)| 
=n[n(n —1)]+[14+34+---+(2n—-1)] 
=nr(n—1)4+n? 
3 


=n. 


Ul. (a) [14+24+34+---4+@—-—Dt+n]+[m—-1)4+--: 
+34241] 
n(n+1) (n—1)n 
eg 2 
=n’, 
(c) 1-242-343-4+4---+n(n+1) 
= (1? +1)4+ (2? +2)4+ 3? +3)+---+(007 +n) 
= (1? $2? 3? pees n?) + (124 34 
---+n) 
_ n(n t+12n+1) 


n(n + 1) 
6 2 
_ n(n +11 +2) 
An ce 
(ec) 122433459 +---+(2n-—13 = 


2. 
joo] — [24446 +---+ Qn) 


=n?(2n+1P —8(17 +23 439 4+---+n3) 
n(n+1)/° 
2 


=n(2n+ 1) -3| 
= n2(2n? — 1). 


12. (a) (a+d)+ (a+ 2d)+(a+3d)+---+(a+nd) 
=na+(14+24+3+4+---+n)d 
n(n + l)d 
ane 
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=(Ceoren| 
=a 


14. If n is odd, say n = 2m + 1, then 
(Ath)t(b+t)+-:-+Grotth-Dtth 
2. 1)(2 2 
= PEP 4.4 mp 4 SUF Dems? 
=4(1° +2? +.---4+m?)+ Qm4+ 1\(m +1) 


m(m + 1)(2m + 1) 


=4 + (2m + 1)(m + 1) 


6 
_ na+ D+ 2) 
=. 
16. = ee _ nein +2)] _ 


+1 


Ina + I) +7] 
= 7F (t, +n). 


4497 a = 


nat), E —1)n(n+ | 
a 6 


1 n—-1 
=nn +n] 5+ 5 


n(n + 1)(2n + 1) 
6 


= 1742?43?4.---4n7. 


SECTION 3.3, p. 117 
2. (12,5,13), (8,6,10). 
4. x7 +(x +1) = (x +2) implies that 
(x — 3) +1)=0. 
5. (b) (3,4,5), (20,21,29), (119,120,169), (696,697,985), 
(4059,4060,5741). 


6. (a) 1<7/5 < 41/29, 3/2 > 17/12 > 99/70. 
(b+) 2-6/2" =-1/4, 2 =(7/5Y = 1/25, 
2 — (17/12)? = —1/144, 2 = (41/29)? = 1/841. 
1G) 4=t = 10 = 76 =O ee = I, 
yy. = 3, ¥2 = 17, ¥3 = 99, v4 = 577, ys = 3363. 
(c) y? — 2x? = 1 implies that 


Vn [Xp = 2+ (/xn). 


8. (a) 2, 1.5, 1.41666..., 1.41422.... 


10. (a) x; = 3, x» = 8, x3 = 22, x4 = 60, x5 = 164, 


x6 = 448, 
haN= Ds 2 = 14, wB= 38, MF 104, M= 284, 
vo = 776. 
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11. 


12. 


13. 


15. 


16. 


17. 


19. 


22. 


23. 


(b) 5¢°-3-32=-2,7-3-P=1. 
5 7 19 26 71 97 
3°4° 11° 15°41 56° 
2651 1 1 1 \? 
(bt) —~=—(26-—)=—,/(26-—) < 
153.15 51 15 5] 


£V20-—1= V3. 


(d) 


2, q 235! 
a) —. = 
15 780 
106 99 119 127 
b) — <v50 < —, — < V63 < —, 
) 5 * “4 6° 6 
147 156 
— <v75 < — 
17 18 
(ee ” aul. a eed ea aad 
— = — and — = - glVv a =¢¥ ani 
iD ao BD 
b?>=c-BD 


1 : 1 1 
(b) = (5) + -ab= ou (=) implies that 
+ 


1 51 1 D5. hos 
34 + by = had + had + 3° implies that 


+h =’, 
AE AE AH OH-a 


a 

b BC BD HB b—OH’ 
2 2 

(a) way = () -(*S). 

(b) (2Vay =(n +12 —(n- 1. 


If A is the hypotenuse of the nth triangle, then 
h? =(/ny +17. 


SECTION 3.4, p. 127 


1. 


4. 


area | _ AB? AB? 1 


areall AC2 2AB2. 2° 
it follows that 


Because 


area lune = area semicircle on AC— area I 
= area semicircle on AC — 2 area I 
= area AABC. 


The equation x* = (ay)? = a*(2ax) gives 
x3 = 2¢3, 


Solutions to Selected Problems 


SECTION 3.5, p. 137 


1. If AK < AG, then with A as center and AK as 
radius draw a quarter circle KPL. Let FK 
perpendicular to 4D intersect the quadratrix at F’; 
join AF and extend it to meet the circumference BED 
at E and the circumference KPL at P. Reasoning as 
before, it follows that FK = arc PK; hence 
4AK -FK= 4AK -arc PK, or area triangle AKF = 
area sector AKP, a contradiction. 


2. (a) In polar coordinates, the de ning property 


LBAD AB 
—— = — of the quadratrix becomes 
LEAD FH 
a/2 a 
6 rsing’ 
. . 2a 1 2a 
Oa ae | ae 


0 


3. The similarity of triangles FBA and FBE implies that 
x oa 


box 

4. First obtain a right triangle whose area is equal to that 
of the circle; then construct a rectangle equal in area 
to the triangle. Now, use Problem 3 to construct a 
square whose area is that of the rectangle. 


6. In polar coordinates, the de ning property of the 
limacgon is r —2cos@ = 1. 


: x .. 
Since r = \/x? + y? and cos@ = —, this becomes 
ie 


r?—2x =r or (r? —2x¥ =r’. 


SECTION 4.2, p. 168 


1. Triangles DAB and CBA are congruent by the 
side-angle-side theorem; hence, 
LDBA = LCAB = LCBA, which contradicts Common 
Notion 5. 


2. a+ B = 180° =6+y implies that w = y. 


3. Because ZABC < LACD by the exterior angle 
theorem, it follows that 
LABC + LACB < LACD + ZACB= 180°. 


4. Triangle ABD is isosceles, hence ABD = ZADB. 
Applying the exterior angle theorem, 
LABC > LABD = LADB > LACB. 


5. Triangles GBC and DEF are congruent by the 
side-angle-side theorem; hence, C = ZF = /BCG, 
which contradicts Common Notion 5. 


Solutions to Selected Problems 


8. 


11. 


Because, in area, ABE = DCF, it follows that 

ABCD = ABGD + BCG = (ABE — DEG) + BCG 
= (DCF — DEG) + BCG = EGCF + BCG = 
EBCF. 

By Problem 8, it follows that, in area, ABDE = ABKH = 


BLSR, and ACFG = ACJH = RSMC. Hence, 
ABDE + ACFG = BLSR + RSMC = BLMC. 


SECTION 4.3, p. 182 


1. 


(a) Ifal|b, then b = ar for some r; hence, 
bc = a(rc) or albc. 

(c) Ifac|bc, then bc = arc for some r; hence, 
b =ar oral|b. 

(e) Ifal|b and cld, then b = ar and d =cs for some 
r, s; hence, bd = (ac)(rs) or ac\|bd. 


.66=5+4+61=7459= 13453 = 194+ 47= 


23 + 43 = 29 4 37. 
96 = 7+ 89 = 134+ 83 = 174+ 79 = 23473 = 
29 + 67 = 37+ 59 = 43 + 53. 


~ 51 =4742-2,53 =4742-3,55=41+42-7, 


57 = 353. +.22,:59'= 53 2233 ces 


. 85. 
. If n? — 1 = (n — 1)(n? +n +1) is prime, then 


n-l=l1. 


7. 11, 13, 15, 17; 101, 103, 105, 107, 109. 
$6 S17= 11, 12 = Wa 11; 1 = 29 11, 


10. 


11. 


12. 


13. 


15. 


16. 


24 = 31-7, 30 = 37-7, 36 = 43 —7,.... 


~ 29 = 234 194+ 17-13-11-7-—54+342+4+1, 


37 = 31 — 294+ 23 —- 19+ 17—-134+114+74+5+4+ 
34+2+1. 


(a) Ifm = 2k, then m? = 4k’; while if m = 2k — 1, 
then m? = 4(k? +k) +1. 

(c) Ifm=6k +1, then m? = 123k? +4) +1; 
while if m = 6k + 5, then 
m? = 123k? +5k4+2)+1. 

If a = 3k, then 3a; if a = 3k + 1, then 3|(a + 2); if 

a = 3k +2, then 3|(a + 1). 

In any case, 3ja(a + 1)(a + 2). 

If a = 3k +1, then a* — 1 = 3(3K? + 2h); while if 

a = 3k+2, then a? — 1 = 3(3k* +4k +1). 

If 2\(a + 1)° — a’, then 2|(2a + 1); hence, 

2|(2a + 1) — 2a, or 2|1, a contradiction. 

gcd(143, 227) = 1, gced(136, 232) = 8, ged(272, 

1479) = 17. 

(a) gced(56, 72) = 8 = 4- 56+ (—3)72. 

(b) ged(119, 272) = 17 = 7- 119 + (—3)272. 


17. 


18. 


19. 


20. 


21. 
22. 


23. 
25. 
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If dja and d|(a + 1), then d\(a+ 1)—a, so d|1; 
hence, d = +1. 


Because both 3 and 8 divide a(a + 1)(a+ 1)(a + 2), 
with gcd (3, 8) = 1, it follows that 3 - 8 divides this 
product. 

If pla", with p prime, then pla; hence p”|a”. 

(b) 1234 = 2- 617, 10140 = 27-3 -5- 137, 

36,000 = 2°. 37-53. 

(a) 17 and 257. (b) 7, 31, 127, and 8191. 


Because 3p = (a + 1)(a — 1), with p prime, either 
3\a +1 or 3ja — 1. But 3|a + 1 leads to a 
contradiction. If a — 1 = 3k for some k, then 

3p = Bk + 2)(34) or p = (34 + 2)k; it follows that 
k=1and p=S. 

6=13-7= 19-13 =29—-23 =37-31=.---. 
Assume that ,/p is rational; that is, /p = a/b for 
relatively prime integers a and b. Squaring gives 

a’ = pb* whence p divides a”. Because p is prime, 
this is impossible unless p also divides a; and thus p? 
divides a”. As b* = a*/p with a” divisible by p’, it 
follows that p divides b?. But then p will divide b as 
well as a, a contradiction. 


SECTION 4.4, p. 192 


2. 


Because 4B = ADsinB, AC = ADsina, 

BD = ADsin(90° — B) and CD = ADsin(90° — a), 
Ptolemy’s theorem becomes 

ADsina- AD sin(90° — B) = 

ADsinB - ADsin(90° — a) + BC- AD. 


. Ptolemy’s theorem implies that 


AB-PC=PA-BC+PB.:-AC, 
where AB = BC = AC. 


17 
. x 3—— = 3.14166.... 
(b) x 120 6 


(c) V3 © 1.73205... 


6. Take d = 0 to get k = /s(s —a)(s — b\(s —c). 
8. AX = XP=XD. 


SECTION 4.5, p. 208 


1. 


(a) If, is the radius of the base of the cylinder and 
h is its height, then its surface area equals 
s = 2nrh = rx’, where 
h x 


x 2r’ 
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(c) If, is the radius of the base of the cone and s is 
its slant height, then 


5nr)s e 
mr 
(e) If, is the radius of the sphere, then its volume 
equals 
Ko= ae = qt Jr =4/. 
3 3 


. If the sphere has a radius 7, then 


(a) Ve = (r?)(2r) = ; ($2) = =h, 
(b) Ac = Qmr)(2r) + 2Gtr’) = (4x7?) = 54. 


3. AB+ BF=DB+BF=DF=FC. 


6. The area of the “shoemaker’s knife” is 


10. 


11. 


45 4 = Co Lee = 
8 8 8 


(4B? =A =~CRy= = PC. 


. Because PRCS is a parallelogram, its diagonals PC 


and RC bisect each other. 


. Because CD is bisected at O, it follows that 


AD? + AC? = 2(CO* + AO?) and 
PQ=AO+OD=AD. 

Therefore, 

AB? + CD? = 4(AO? + CO?) = 
2(AD? + AC?) = 2(P QO? + AC’). 

This implies that the area of the “salt cellar” is 


1 


A= 4B? +2 cD? 2(Z4c) =p 2 
~ 8 8 8 ~ 4 o. 


. Note that 


4a°x3 


3 


Let LPOA = B, so that P = (af, 8). Then 


QO= (4. s) and R= (=. p). The circles with 


center O and radii OQ and OR meet the spiral in the 


2 2 
points V = (4. *) and U = (=. ). Hence, 


1 2n iT 2n 
2 2 
a=5/ r-dd=-— (a0) dé = 
2 Jo 2 Jo 


B 
LVOA = =. 
5) 


Wa 


The point P = (=. =), while O = (0, 0), hence 


ma 
OP=—. 
2 


Solutions to Selected Problems 


12. In polar coordinates, the equation of the tangent to the 
spiral r = aé at the point A = (27a, 27) is 


r(2x cos@ — sin@) = 47a. 


3 
The tangent intersects the line 6 = = at the point 
3 
B= (470 =), 
2 


SECTION 5.3, p. 231 
1. 15, 5, and 25. 
2. 30, 25, and 35. 


ir 23\7 
5. If a = 3, then the squares are (Z) and (>) 5 


35 
6. If 12+ x? = (x — 4)’, then the number is a 


1 
7. If x7 + 4x +2 = (x — 2), then the numbers are z 
d-. 
an ri 
8. If 10x + 54 = 64, then the numbers are 1, 7, and 9. 


9. If x + (4x + 4) = (4x — 5)’, then the numbers are 9, 
328, and 73. 


10. 8 and 2. 


11. If 8x(x? — 1) + (x? — 1) = (2x — 1), then the 
27 


11 
numbers are cre and 69" 


12. If (x? + 4) — 4x = 64 = 4°, then the triangle has sides 
40, 96, and 104. 


14. (a) x =20+91, y=—15—7t. 
(c) x =176435t, y= —1111 — 2212. 


15. (a) 
(c) No positive solutions. 


x=ly=6. 


17. 28 pieces is one answer. 

19. One answer is 1 man, 5 women, and 14 children. 
20. 56 and 44. 

21. 59 is one answer. 

22. 119 is one answer. 


23. 1103 is one answer. 


SECTION 5.4, p. 237 


1. (a) Because r(ar? + brs + cs*) = —ds’, it follows 
that r|ds*; hence, r|d. 


Solutions to Selected Problems 


2. 


@ -3. 
I J 1 
aS ar 


(e) No rational solutions. 


. (a) The equation 8x3 — 6x — 1 = 0 has no rational 


roots, hence cos 20° is not a constructible real 
number; it follows that an angle of 20° is not 
constructible. 


SECTION 5.5, p. 263 


1. 


na & YB NY 


20. 


(a) x*+8x =9 implies that x? + 8x + 16 = 25; 
hence (x + 4)? = 25 orx + 4=5, yielding 
cS, 

(d) If 3x*+ 10x = 32, then 9x” + 30x = 96 and so 
y? + 10y = 96, where y = 3x. It follows that 
y? + 10y + 25 = 121 or (v +5) = 121. Hence 
y+5=11, yielding y = 6 and x = 2. 


- (a) 12. (b) 24. (c) 7 and 3. 

. (a) 7 and 4. (b) 6 and 4. 

. 17,296 = 24. 23 - 47; 18,416 = 24-1151. 

. For one way, use three corner triangles and the 
leftover square. 

. Two men. 

.16. 1.4 and6. 12.3 or 2. 

. 2 and 8. 

. (a) 666 = paces. (b) 250 paces. 


. (a) 9 people, 70 wen. 


(b) 353 ounces gold, 294 ounces silver. 


. 60 days. 


. (a) side is 60 pu, diameter is 20 pu. 


(b) sides are 40 and 5 pu, diameter is 100 pu. 


(c) 268. 21. 123. 


SECTION 6.2, p. 285 


1. 


3% 


4. 


1 
18 and 32 feet. 2. NS denarii. 


The system x +23 =2y 
y+23 =3z 
z+23 = 4x 


has x = 9, y = 16, z = 13 denarii as a solution. 


(2n + 5° = (2n +37 + [(2n +3) — (2n + 1)7]4+ 8. 


10. 


14. 


15. 


16. 
17. 


7169 


. (a) Let x = 5 and y be any square such that x + y 


and x — y are also squares. 


. (a) Ifa’, b?, and c? are three squares in arithmetic 


progression, with common difference d, then 
2 


>I oe is a solution. 


(b) 324+424 12? = 132. 


. 481 = 157+ 167 = 20° + 9?. 
-(a) Ifc?+d* =k’, then 


(a? + b’)(c? + d*) = u? + v? implies that 
2,42_ (4¥\ v\? 
ore oe +). 
39 2 
b) 61=({— =). 
m o=(3) +) 


If u = 7, then t = 47; hence, the amounts would be 
33, 13, and 1. 


x = x(a + Ja? — 4(ca)/b) = 8, 
y = 4a — Ja? — 4(ca)/b) = 2. 
x= 3-1) + V@b+1I)-a@)]=7, 
y=t(a+)—- JQb+1—a@] =3. 


x =(ab—c)/(b+1)=4, y=(at+o)/(64+)=8. 
x = JVabe = 12, y = ./(ab)/c = 3. 


SECTION 6.3, p. 292 


1. 
2. 


50= f+ Fo) 4+ Po, 75 = F3+ F5+ Fo 4+ Fro. 


(b) 
Pot Pate- + Foy = (Fit fot Fy +--+ + Fan) 


— (Fi + F3 + +++ + Fan-1) 
= (Fong2 — Fon) — 1 = Fangs — 1. 


3. 7| Fs, 11 Fo. 
4. Ifd = ged (Fy, Fy-1), then d|(Fi41Fn-1 — F?) or 


d\(-1)". 


5. ged (Fis, Fao) = S. 
6. If Fi, |Fin, then ged (Fi, Fm) = Fi; by problem 5, 


n= gcd (n,m), hence n|m. 


.(a) If2\|F,, then F3|F,, and so 3|n. 


(c) If4|F,, then 3|n; since 2|n also it follows that 
6|n. 


SECTION 7.3, p. 326 


1. 


(a) 1,2,3. (c) —2, -2+4+ 4/2, —2 —4V2. 
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» (a) 2/4 - 22. (0) 2+ 925- V5. (©) 8. 
2, -2+4/11, —2— 4/11. 

6,4. 6. —48, 52. 

44 a = 16 — /16. 

. 14— 26/11 — 2/13 — 2/11. 

x = —44 190 + 35,757 + /190 — /35,757. 


pew 2107 peu 2107 
10. x 


3 og. J1960 g 413 | /1960 


oS ee 


11. x 


12. Ifx = 2 — y, then x3 + 81x = 702 becomes 
yet 7023 = 27°, with y = 3 as one solution. 
Ify= oe z, then y? + 6y = 7 becomes 
Zo 4 123 = 8, with z = 1 as one solution. 

13. 6. 


14. (a) —6. eee 


1 
(c) y = 7 has y = 3 as one solution. 


SECTION 7.4, p. 334 


1. (a) [2D Venlo ky Ves 


() 34/75 34/7 
Cc a aa 
2 2 
‘ 14 /-3 -34/-7 
=. 


es ee ee 


5. 10 (5+ - f3). 
6. 2 


SECTION 8.1, p. 361 

1. (a) 32 ducats. (b) 375 paces. (c) 32 days. 
2. 80 a 

3. 24,000 men. 
4. 28 beggars, 220 lire. 
5. (a) 36 years. (b) 312 orins. 
9. 9,999,995.11111. 
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SECTION 8.2, p. 380 
. The triangles ABC and DBE are similar, so that 


BA B 1 b 
= o or -= ; hence, BE = ab. 
BD BE a BE 


. The equation 


xt4+x2743x4t+1l=(4+ la? —2x?4+2x+1)=0 


has no positive roots, which implies that 


x? — x? + 2x +1 has no positive roots. 


. The equation x° + x° + 2x4+x3-1= 


(x + 1)? + 2x3 — x7 +x — 1) = 0 has just one 
positive root. 


. Because f(x) = x?” — 1 has one variation in sign 


and f(—x) = x" — 1 also has one sign variation, 

the equation 76 (x) = 0 cannot have more than one 
positive or more than one negative root. But 1 and —1 
are clearly roots, so that there are 2n — 2 complex 
roots. 


- (a) f(x) =x7+3x+4+7 has no variations in sign 


and therefore f(x) = 0 has no positive root. 
Now — f(—x) = x3 + 3x — 7 has just one sign 
variation, hence f(x) = 0 may have one 
negative root. It must therefore have two 
complex roots. 


. (a) Because there are no variations of sign in either 


f(x) or f(—x), there are no real roots of 


f(x) =0. 


.(a) xt — 3x? + 6x — 2 = (x? + 2x — 1)(x?- 


2x + 2), hence the roots of the quartic are 
-14J72,14/—1. 

(b)  x*— 2x? — 8x — 3 = (x? — 2x — 1)(x?4+ 
2x + 3), hence the roots of the quartic are 


1472, -1+ /—2. 


SECTION 8.3, p. 408 
1. (a) Substitute x = 1 in the given series, recalling 


that tan(z/4) = 1. 
(c) The 10 terms produce z ~ 3.1471.... 


3. (a) The values of the ratios may be written as 


iss bie lee tas 
4°44 84 12°4 16° ° 
which implies that 

1342394 334+...4+n3 


L= lim ——.—.—— 
noo WM+ntnit.--n3 


oe Gee 1 
= lm{-+—)=-. 
noo \4 " An 4 
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3 5 


(b) Ifthe interval [0, a] is divided into n equal 3. 6=142-24+1=(4+v-3) 
: a 
subintervals, each of length a then 42 fi + V3) — 3) + (1 —- V=3) 
2 
by OOP (a GY Gy ey = (Vi+v=34 y1- v3) ; 
a‘ eS e+aea+a+---+a3 
1+) 
— 841423 4---+73 1 5. tog (=*) = topct +x) —tog(t =») 
= in ———_._ = -. l-x 
nom +4+n4---+n3 4 
ye 3 x 
; fe rai =(*-5+5-F+") 
4 | Jidx =aya- | xdx = ——. 
0 0 
2 3 4 
5. The volume of the solid is Paes caine ee 
; sake , 2 3 4 
: at ya a a na\4 a 
r= im |=(2) +a (4) Q+- EO See 


14424*+-.-+n4 wad 


= (a>) lim ———————. = —.. day 
a a a ow ae 


2! 
This follows from the limit: —1\(—2)(—3 

ae x DF coos 

n(n + 1)(2n + 1)Gn? + 3n — 1) 
pace 30(n + Int 4 ee CN (—1)°3!, 
rae eo Ore ae 1 2! 3! 
= lim — = =_ st )j=<-. 

noo 30 n nn 5 =—l—-x+3x?7—3x34..., 


Si alae Aa I eo 
2n—1 2n+1 3! 

2 y} 2 l i. ees 
“Ts "5-9 Gee oaaed) aio aa” * 3st Po 
— 2 + 2 aa 1 n 
=@-Me+1) @=16e-1) 4 ir (;) 

9. Note that 1 + = + Ae ae seillt = = 
a er ry 2° 2 ae (| 
(4n — 3)(4n — 1) = ls 
1 
l l 2(1-5) <2 
—2 ————d §—$—— ee 7 ; 
lsstteat 4 
11. x, = 2, x9 = 2.1, x3 = 2.094568... 
La (7 a | 12. (by xj = 1, x2 = 1.53393... 


— 1,26388...,x4 = 1.25993... 
2. 2P, —(n—1)P,-, =2n!—(n—1)(n— D)! 1 "4 


= (n — 1)![2n — (n — 1)] 
=(n—1)![14+ l]=n!4+ (1-1)! 
n(n + 1) 


=P,+ P,-. 1. (a) Woop epi t= en 


SECTION 9.2, p. 467 


xope.. 


772 


14243+---+n4+(+41) 
_ arnt I) 
a) 
n 
n+ (5 +1) 
(c) IF1-242-3+4---+2@40 


_ nv +DM+2) F 
a 


+(n+1) 


_ (nt In +2) 
=. 


hen 


1-242-34---tntn+)4(n4 Da +2) 


= OO Ft Nn +2) 


n 
(n+ In +.2)(5 i 1) 
— at DM + 2)(n + 3) 
ee 


(ec) If14+2-243-274.--.4n2"! =(—1) 
2" + 1, then 


142-243-2?4---+n2"'! 4 (n+1)2” 
= (n— 1)27+14+(n + 1)2” 
=2"[(n-1)+(v4+)]4+1 
=2n-2"4+1=n2"' 41, 
3. (c) Note that 


qnt2 ae 52ntl — 4(47+! + Sen) al 52n-1 (52 -_ 4); 
hence if 21|4"+! + 5?"-!, then 21/4742 4 52"41, 


aS 


. If 1 +22!) +--+ +n!) = (n + 1)! — 1, then 
14.22!) +--+ n(n!) + (n+ ln 4 1)! 
=(n+ DI!-14+(nt+ lint Dd! 
=(n+ DIL+@41]-1 

=(n+2)!—1. 


5. (a) If 2" <n!, then 
art 2.2" <2-nl<(n+ Dn! =(n4+ 1). 


6. (a) On _ 1-2-3.--(Qn) 

n nin! 
1-3-5---(2n — 1)n!2”" 
n!n! 
1-3-5---(Qn—-1 
Ee Ne 


n! 


Solutions to Selected Problems 


n 
n+1 n+1 n+1 
("o')#Cr ert) 


10. (c) (; +()+(Q)+- 


n! n! 
1. (a) —  — if and only if 
Gai aaonee 
(n—r—-DICF+)! <Mm—r)'r + Is; that is, 


if and only if (7 + 1) <(n—r). 


SECTION 9.3, p. 493 


1. Pr[6] = = 
. 1 ]= 36° 
2. Pr[9] = 22, Pr[10] = 2. 
216 216 
1 
3. 360° 


7 30 20 10 = 1000 
(35) (35) (az) = sia 


Solutions to Selected Problems 


1 1 1 
n+1 n n+1 


implies that 


1 a 1 ieee — 1” 
1-2 2-3 n(in+1) n+l n+l 
9. (a) ° (c) a); 
- (a) —. _ . 
Cie 
276 111,088 
11. (a) ——. (c)———. 
7716 117,649 


3 
12. 3. 13. (a) 33 dollars. 
14. (a) Fair. 
15 9 
15. When n = 4, Pr[A] = 4 and Pr[B] = —. 


16. Note that 


SECTION 10.1, p. 511 
2. o(m) = 2160(2"' — 1) 4 2048(2!! — 1). 


pl 
= 
p-l 


3. (a) Because l+p+p?+---+prt= 


k k 
p-l<p', 

ky 2 k-1 k k 
o(p’)=l+pt+pt---+p + p* <2p". 


(6) o(~qgy=1+pt+q+pq <2pq. 


7. Any even perfect number greater than 6 can be 
expressed as the sum of consecutive odd cubes; in 
fact, 27k (Q2k+1 = 1) = 3 + 33 + 53 Sone ote (Cam = 1 
for all k > 0. 


8. Because n? = 274-7 p*, where p = 2* — 1 is prime, 
a(n?) = (14242? +---+2* 7) + pt p’) 
= (Ve = 1)(27* = gk+l me ok ae 1). 
Thus, o(n?) + 1 = 2*‘N. 


9. If m and m form an amicable pair, then 
o(n) =n+m=o(m). 
The relation m = o(n) —n yields 
o(o(n) —n) = o(n). It follows that 
o(s(n)) = s(n) +n, or s(s(n)) = n. Conversely, if this 


773 


condition holds, then n and s(n) = o(n) — n form an 
amicable pair. 
10. o(1184) = (1+ 242? 4234244251 + 37) 
= 63 - 38 = 2394 = 1184+ 1210. 


11. Assume the contrary that p and n form an amicable 
pair. Then the condition 1+ p=o(p)=p+n 
implies that n = 1. Buto(1) =14p4+1=o(p). 


SECTION 10.2, p. 537 
2n? 
2. (a) Because ao 


i > 2, it follows that 


n n 

> 2, whence 
n—-1 n+l 
n n 
— 41+ > 3. 
n—-1 n+1 


3. et? = er? /2 _ (e/2)i, 


5. (b)  an(x —11)\(xX —1r2)-++(% — Tn) = 


ae 


nV 112 ++ +1, (—1)" (1 
1 x 
(ra) 

7. (c) (S)(48)(2) [ee 
2:2)\4-4) \6-6 In -2n 
ol 1.27.3) (3-4-5 
 (2"n!?? (= 1- *) (5 <3) 


- [ — 1)(2n)2(2n + | 


2n-n-n 


_ 1 Qn)!2n)\(2n +1) 
~ (2"nlp (2"n!)?? 


SECTION 10.3, p. 561 


1. (a) Because a = b (mod n), it follows that 
a—b=kn for some k. If m|n, then n = rm for 
some r. Hence a — b = k(rm) = (kr)m, and so 
a=b(modm). 

(c) Suppose a = b (mod n), so that a — b = kn for 
some k. Ifa =rd,b=sd, and n = td for some 
r, s, and tf, then (rd) — (sd) = k(td) or 
r—s =kt. Hence, r =s (mod f); that is, 
a/d = b/d (mod n/d). 
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2. 5° = 1? (mod 8), but 5 4 1 (mod 8). 


4. If a = | (mod 8), then a? = 1? = 1 (mod 8); ifa =3 
(mod 8), then a” = 9 = 1 (mod 8); if a = 5 (mod 8), 
then a? = 25 = | (mod 8); if a = 7 (mod 8), then 
a> =49 = | (mod 8). 


Ill 
i 


5. (a) Ifa =0 (mod 3), then a? = 0 (mod 3); if a 
(mod 3), then a* = 1 (mod 3); 
if a = 2 (mod 3), then a? = 1| (mod 3). 


6. 5° — 1 = (25)'8 —1 = (-1)8 -1=1-1=0 
(mod 13), hence 13]5*° — 1. 


7. (a) 10 +53 = 10(100)4 + (—2)3 = 3(2)4 - 1 
3(8)8 — 1 = 3 — 1 =2 (mod 7). 


8. 23 — 1 = 252° — 1 = (—15)5? — 1 = (-3)57 -1= 
(—3)31 — 1 = 0 (mod 47). 


9. (c) 52 43-2572 = 4r 4.3. 25-143 = 
4" 43-4157. 4! = 0 (mod 7). 
(ec) 25741 4 5742 — 232)" + 25(5)" = 
2-5" + (—2)5" = 0 (mod 27). 


10. If & is of the form 47 + 1, then 
2k-1 (Qk _ 1) = 22 . gan _ 1) = 4"(2 F Aen _ 1) = 
4(2-4— 1) = 4 (mod 6). 
A similar argument holds if & is of the form 4n + 3. 


13. (a) 9[/113,058 since 9|((1+1+3+0+5-4 8). 
14.x=9. 


16. (a) x =6, 13, and 20 (mod 21). 
(c) No solutions, since ged (13, 52) / 27. 


SECTION 11.1, p. 581 


1. Bolyai’s assumption that there exists a circle passing 
through any three noncollinear points is equivalent to 
assuming Euclid’s parallel postulate. 


2. Legendre’s assumption that every line through a point 
in the interior of an angle must meet one of the sides 
is equivalent to assuming Euclid’s parallel postulate. 


4. (a) Because triangles BAE and CDE are congruent, 
BE = CE. This implies that triangles BFE and 
CFE are congruent, whence BFE = LCFE = 
90°. 
(c) The base and summit have a common 
perpendicular, hence are parallel by Euclid’s 
Proposition 27 on alternate interior angles. 


5. (a) Because ABED is a Saccheri quadrilateral, it 
follows that ZADC > ZADE = LBED > BCD. 


10. 


11. 


Solutions to Selected Problems 


(c) Either 4D < BC, AD > BC, or AD= BC. If 
AD < BC, then ZC < ZD by part (a), a 
contradiction; if 4D > BC, then ZC > ZD,a 
contradiction. Thus, 4D = BC. 


. Triangles ACD and XZW are congruent, so that 


LCAD = LZXW; hence, BAC = LY XZ. 
This implies that triangles ABC and XYZ are 
congruent by the side-angle-side theorem. 


. (b) Consider a triangle POR, which contains no 


right angle. Because it cannot have more than 
one obtuse angle, it contains at least two acute 
angles, say, at vertices P and Q. Drop a 
perpendicular RS from R to the side PQ. Apply 
part (a) to the resulting right triangles PRS and 
ORS. 


Use the gure in Problem 4. Since 

LB < LBFE = 90°, Problem 5(b) implies that in 
quadrilateral AEFB, one has AB > EF; similarly, in 
quadrilateral DEFC, it follows that CD > EF. 

If P lies outside of triangle ADC, point D lies on the 
extension of AB beyond B, and point E lies between 


A and C, then one cannot obtain 4B = AC from the 
earlier equations by either addition or subtraction. 


SECTION 11.3, p. 625 


1. 


0: 


(a) Series converges (r = 3). 
(b) No conclusion (r = 1). 
(c) Series diverges (r = 3). 
(d) Series converges (r = +). 


us cos(2n n4| SINNX 
10-52 +e iy! 


(2n — 


n=l 


. (b) Substitute x = > into the given series. 


. (b) Substitute x = 0 into the given series. 


ao a 
3. 6 8 


= 4 2 
= 5 log2. 


The sus a the rows of the array are 
2,1,4,4,4,...; when added, these yield 


Solutions to Selected Problems 


2 4 8 
10. (a) Cage ssinee Ss 
n+1 2n 
1 1 1 1 
Ont ha) 
(b) For any e > 0,n > + 
a 1 1 1 
implies that s,, — S, < <7 <6. 
nom on 


11. (a) Given any ¢« > 0, take 6 = €; then |x — 0| <6 


implies that 
[f(x) — 0] < |x| <e. 
(b) Let x9 and L be xed. If 


. fIZ| |L-ZLI . 
€=min cs 5) , then no 6 > 0 exists 


for which |x — xo| < 6 implies that 


lf@) = LI <<. 


SECTION 11.4, p. 650 


2.(c) Ifig=—b+ai —dj +ck =—b+ai+dj— 
ck = qi, then —d = d and c = —c, so that 
c=d=0. 

7. (b) Suppose 


a bl] _fa2?+bce ab+bd]_ [0 1 
c d| |actdce cd+d?}~ |0 O}° 
Then (a + d)c = 0. Either c = 0, whence 


a =d=0, which implies that 0 = 1; or else 
a = —d, again yielding 0 = 1. 


(d) Use induction on n. If the result holds for n = k, 


rep Tb y- 
LT eft 


Because 4A~! = J = 47! A, taking transposes 


gives (A~!)' A! = J = A'(A~')', and so 
(47!) is the inverse of A’. 


8. (c) 


SECTION 12.1, p. 672 
1. 48 scholars 
2. 12, 16, 24, and 48 dollars 


ey Oe 
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65 geese 

120, 130, and 250 dollars 
father is 60 years, son 40 
4 children 

20 days 


30 miles 


SECTION 12.2, p. 697 


1. 


10. 


11. 


(a) {2,27,23,...} ~ N using the function f de ned 
by f(2") =n. 

(b) {5,10,15,...} ~ N using the function f de ned 
by f(5n) =n. 


. If Z+ denotes the set of positive odd integers, then 


Z* ~ N using the function f(2n — 1) =n; similarly, 
if Z> denotes the set of negative odd integers, then 
Z, ~ N using the function g(—(2n — 1) =n. Now, 
Z, = Z}UZ>z, hence Z, is countable. 


. The prime numbers form an in nite subset of the 


countable set N and so are countable themselves. 


. If A is countable, then A x {b} is countable for each 


b € B. Hence, A x B = Usen(A x {b}) is a countable 
union of countable sets, which makes 4A x B 
countable. 


. If ¢ is a right triangle having sides of integral lengths 


a, b, and c, dene the function f by f(t) = (a, b,c). 
Then S~ f(S) C Z x Z x Z. Since Z x Z x Z is 
countable, so is the set f(.S) and, in its turn, the set S. 


(a) The set of numbers of the form m/2” is an 
in nite subset of the countable set QO, hence is a 
countable set. 

(c) Let € be the line through the origin and the point 
(r,s) where r, s € Q. Then & has a rational 
slope t. Identify @ with the ordered triple 
(r,s,the OX Ox. 

(e) Consider any in nite set S of nonoverlapping 
intervals. For each interval J € S, select a single 
rational number (in lowest terms) 7; in J. A 
one-to-one function f : S— Q is dened by 
letting fU/) =7;. Then S~ f(S) CQ. 


Note that L ~ f(L) = (0, 1], where the interval (0, 1] 
is uncountable. 


SECTION 12.3, p. 715 


2. 


By Cantor’s theorem, 0(P(R)) > o(R) = c > Xo. 
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3. 


(a) Suppose that 4 C B. If C € P(A), then 
CCACB,; hence, C C P(B). This shows that 
P(A) C P(B). 

(c) Let C € P(AN B), so that CC AN B. Then 
C C A, which means C € P(A); and C C B, 
which means C € P(B). Thus, 

C € P(A) P(B), implying that 
P(AN B)C P(A)N P(B). 


. If A and B are in one-to-one correspondence via the 


mapping f: A— B, then f*: P(A) > P() will 
also be one-to-one. For suppose that C 4 C’, where 
C, C’ € P(A); say, there is some element x € C with 
x ¢C’. Then f(x) € f(C), but f(x) ¢ f(C’); for if 
F(x) = f(x’), where x’ € C’, then the one-to-one 
nature of f would imply that x = x’ € C’,a 
contradiction. Thus, f*(C) 4 f*(C’). 


. (a) Suppose to the contrary that the set of countable 


subsets of N can be arranged in a sequence 
A, Ao, A3,.... By a diagonal argument, 
construct a subset 4 of N, which is different 
from each 4,,. 


.(a) Ifx =o(A), then x +0 =0(A U9) = 0(A) =x. 


Also, x - 1 = o(A x {1}) = 0o(A) = x. 
(c) Ro: Xo = O(N x N)=No. 
(e) c+c=o([0, 00) U(—o0, 0)) = o(R) = c. 
(g) c-c=0(Rx R)=0(R)=c. 


. Because the natural number n is de ne d in no more 


than 36 words, all of them taken from our lexicon, 
is contained in the set S. On the other hand, on 
account of its de nition, m cannot be contained in S. 
This leads to a formal contradiction. 


. Because the sentence contains a nite number of 


words, the real number 7, which it describes will be in 
the set S. But, owing to its very de nition, r differs 
from each member of S in at least one decimal place; 
hence this number is not in S. 


10. 


11. 


Solutions to Selected Problems 


v2. , F Pa : 
The number ./2~~ is either rational or irrational. If it 
is rational, then we have our example. If it is 


irrational, put x = wa and y = V2, so that 


w= (/2)v2 = (/2) = 2, which is certainly 
rational. 


(a) To decide on the value of n, we must either nd 
a sequence of digits 0123456789 in z or 
demonstrate that no such sequence can exist. At 
present, there is no method that would enable us 
to do either. From the intuitionist point of view, 
only constructible entities exist, so they would 
accept no statement regarding n. 


SECTION 13.2, p. 735 


1. 


(a) d(x, y) = 0 does not necessarily imply that 
x=y. 

(b) x =~y does not necessarily imply that 
d(x, y)=0. 


. The only way in which d(x, y) < d(x, z)+d(z, y) 


could fail to hold is if d(x, y) = 1 and 
d(x, z) = d(z, y) = 0. This is impossible, because it 
would imply that x = z = y, whence d(x, y) = 0. 


. Taking x = y gives 0 =d(y, y) < d(z, y) +d(z, y) 


so that d(z, y) > 0 for all z and y in_X. Ifz = y, then 
d(x, y) < d(y, x) + d(y, y) = d(y, x). Thus, 

d(x, y) < d(y, x) for all x and y in X. Interchanging 
x and y yields d(y, x) < d(x y), implying that 

d(x, y) = d(y, x). 


. Suppose that € = d(x, vy) > 0. Now d(x, x) < €/2 


and d(x,, v) < €/2 when n > no for some no. Thus 
d(x, y) < d(x, x,) + dan, y) < €/2+€/2 =e, 
which is a contradiction. Hence d(x, vy) = 0, and so 
x=y, 


8. (b) [0, 00) is a closed set. 
9. The sets (a) {(x, y)|x = 1} and (c) 


{(x, y)|x? + y? < 1} are both closed. 
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d’Alembert and, 605, 606 


differential (see differential calculus) 


Euler and, 531, 605-6 
history of, 338 
in nites imal, 418, 419, 472, 525 
integral (see integral calculus) 
Lagrange and, 606, 607-8 
Leibniz and, 413-16, 420-24, 
453-54, 475-76, 626-27 
limits and, 416, 605, 606, 607-8 
Newton and, 391, 416-20, 422-26, 
626 
vector analysis, 663 
calendars 
Arab, 249 
Chinese, 28, 262 
Gregorian, 248, 306 
Jalalian, 248 
lunar, 3, 249 
Mayan, 7-8 
Persian, 248 
Regiomontanus and, 305-6 
Roman (Julian), 305 
tally marks for, 3 
caliphs, 238 
Cambridge Mathemetical Journal, 646 
Cambyses (d. 522 B.c.), 90 


Campanus of Novara (d. 1296), 275, 305 


cancellation of terms, in congruence 
theory, 555 
Cantor-Dedekind theory, 620-21 
Cantor, Georg (1845-1918) 
Beitrage zur Begrtindung der 
Trans niten Mengenlehre, 677 
biographical information, 683-84 
continuum hypothesis, 694-97 
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A Contribution to Manifold Theory, 
694 
Contributions to the Founding of the 
Theory of Trans nite Numbers, 
683 
diagonal argument, 683, 688-89, 697 
Foundations of a General Manifold 
Theory, 702 
Hilbert on, 709 
illustration of, 677 
in nite sets, 676, 677-81, 684-88, 
689, 707 
Kronecker and, 682-83 
number theory, 620-21, 680 
paradox, 699-701, 703 
On a Property of the Collection of All 
Real Algebraic Numbers, 684 
real numbers, 620 
set theory, 677-81, 682, 683, 684-89, 
692, 694-97, 699-702, 702, 729 
transcendental numbers, 690 
trigonometric series, 677 
Uber eine Eigenshaft des Inbegriffes 
aller reellen algebraischen 
Zahlen, 677 
Uber unendliche lineare 
Punktmannichfaltigkeiten, 695-96 
Weierstrass and, 616 
Carcavi, Pierre de (1600-1684), 465 
card playing, 444 
Cardan, Girolamo (1501-1576) 
algebraic notation, 348 
Ars Magna, 320, 321, 325, 326-27, 
328, 329, 330-31, 334 
biographical information, 319-20 
De Malo Recentiorum Medicorum 
Medendi Usu Libellus, 319 
illustration of, 322 
Liber de Ludo Aleae, 320, 338, 444, 
445, 493 
Liber de Vita Propria, 320 
Opus Novum de Proportionibus, 457 
Practica Arithmeticae, 320, 445 
probability and, 320, 338, 445-46 
quartic equations and, 328 
tule of signs, 372 
Cardan’s formula, 323-24, 325, 326, 327 
cardinal numbers, 680-81, 689, 695-96, 
700-701 
Carlitz, L., 519 
Carlyle, Thomas (1795-1881), 576 
Carnot, Lazare (1753-1823), 544-45 
Carolingian miniscule, 270 
Carolingian Renaissance, 270-72 
Carr, George S. (1837-1914), 727 
Cartan, Henri (1904-2008), 748 
Cartesian parabolas, 380 
Carthage, 213 
cartouches, 35-36 
Cartwright, Mary L. (1900-1998), 745 
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Cassiodorus, Flavius Magnus Aurelius 
(c. 490-585), 236-37 
Cataldi, Pietro (1548-1626), 506-7 
catenary curve problem, 474 
cathedral schools, 271-72, 307-8 
Catherine I (1684-1727), 528 
Catherine the Great (1729-1796), 529 
cattle problem, 223-24 
Cauchy, Augustin-Louis (1789-1857) 
calculus development, 608-9, 619 
Cours d’analyse de |’Ecole Royale 
Polytechnique, 608, 609, 620 
Exercises d’analyse mathématiques et 
de physique, 607 
Exercises de mathématiques, 607 
Galois and, 332 
on geometric continuity, 379 
Legons sur le calcul différentiel, 608 
limit concept, 606, 608, 620 
permutation groups, 643 
Résumé des lecons sur le calcul 
in nitesimal , 608 
standard of rigor, 614 
Cauchy convergence criterion, 609 
Cauchy’s theorem, 643 
Cavalieri, Francesco Bonaventura 
(1598-1647), 338, 384, 426 
Cayley, Arthur (1821-1895) 
biographical information, 639-41 
Collected Mathematical Papers, 641 
Fawcett and, 666 
four-color conjecture and, 749 
group theory, 643-44 
illustration of, 639 
London Mathematical Society and, 
650 
Mathematical Tripos and, 721, 722 
matrix theory, 641-43, 651 
A Memoir on the Theory of Matrices, 
642, 651 
on Peirce, 663 
theory of invariants, 641 
Treatise on Elliptic Functions, 641 
Cayley-Hamilton theorem, 642-43 
Cayley tables, 644 
celestial mechanics, 386, 399, 432, 
480-83, 675 
central limit theorem, 492 
centrifugal force law, 397 
Ceres, 549, 573 
Ceulen, Ludolph van (1540-1610), 202, 
251 
chains, exible, 474 
challenges. See mathematical challenges 
Champollion, Jean-Francois 
(1790-1832), 35, 37 
Chang Ch’iu-chien (6th cent.), 229 
Chang Heng (78-139), 202 
Chao Yu-chin, 255 
characteristic triangles, 415-16 


characteristica universalis, 410 
Charlemagne (742-814), 270, 271, 277 
Charles Edward Stuart (1720-1788), 526 
Charles I (1600-1649), 381, 383 
Charles IT (1630-1685), 381, 382, 383, 
388, 389, 392, 442, 501 
Charles Martel (c. 688-741), 272 
Charles X (1757-1836), 606 
Chauvenet, William (1820-1870), 576 
Chebyshev, Pafnuty (1821-1894), 492, 
572-73 
check, origin of term, 4 
Cheops (. c. 2680 B.c.), 57 
Chevalley, Claude (1909-1984), 748 
Chia Hsien (c. 1010-1070), 457 
Ch’in Chu-shao (c. 1202—1261), 259-60 
Chinese mathematics, 251-63 
algebra, 228-30, 251, 252-56, 
257-58, 266 
Arithmetic Classic of the Gnomon 
and the Circular Paths of Heaven, 
106, 252, 262 
brush numerals, 29-30 
calendars, 28, 262 
counting rods, 28-29, 258, 260, 261 
geometry, 252, 253, 254-55, 256-57, 
259, 264-65 
history of, 26-28, 251-52, 262 
number system, 28-30 
paper invention, 27-28, 304 
x value, 201-2 
Pythagorean theorem proof, 106 
triangle of binomial coef cients, 457 
Western in uences , 262-63 
See also Nine Chapters on the 
Mathematical Art 
chords of angles, 192, 209, 273 
Christian Church 
barren period and, 284—85 
Crusades, 273, 274 
divine grace controversy, 450-51 
education and, 235-37, 236-37, 
269-72, 307 
Galileo investigation, 342-44, 500 
games of chance and, 444 
geocentric vs. heliocentric theory, 
340-45, 364, 375 
Gregorian calendar, 248, 306 
Italian academies and, 498 
Jesuit missionaries, 28, 262—63 
rise of, 215-16, 217, 233 
Christina (queen of Sweden) 
(1626-1689), 364-65 
Chu Shih-chieh (. 1280-1303), 261-62, 
457 
Chuquet, Nicolas (c. 1445-1500), 361 
Cicero (106-43 B.c.), 197, 215 
cipherization, 15—16, 29 
circles 
area of, 54-56, 61, 199 
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circumference, 55 
inscribed in triangles, 256-57 
problem of Apollonius, 208 
triangle circumscribed by, 76 
See also quadrature of the circle 
circular cones, 207-8 
circular orbits, 188-89 
circumference, of a circle, 55, 186-88, 
202-3 
Clairaut, Alexis-Claude (1713-1765), 
431, 432, 480 
Claudius I (10 B.c—54 a.p.), 444 
Clavius, Christoph (1537-1612), 262, 
306, 350, 361, 410 
Cleopatra I (c. 204-176 B.c.), 36 
Cleopatra VII (69-30 B.c.), 213-14, 234 
Clifford, William Kingdon (1845-1879), 
595 
clocks, pendulum, 470 
closed sets, 729 
COBOL computer language, 746 
Codex Vigilanus, 279-80 
Cohen, Paul (1934-2007), 697, 704 
coin-tossing, 489-91 
Colburn, Zerah (1804-1840), 634 
Cole, Frank Nelson (1861-1926), 508, 
668 
Cole Prize, 668 
Collins, John (1625-1683) 
De Analysi and, 417, 418 
Oldenburg and, 420, 421, 427, 429 
Colson, John (1680-1760), 418, 430 
Columbus, Christopher (1451—1506), 
191, 306 
comets, 403, 432, 573 
common divisor, greatest, 173, 289, 293 
Communications for Pure and Applied 
Mathematics, 742 
communitas, 307 
commutative law, 631, 638, 639, 642, 
738 
compactness, 732 
compass constructions, 545 
See also straightedge/compass 
constructions 
compilers, 746 
completion of squares method, 240-41, 
263 
complex numbers 
Argand and, 632 
Bombelli and, 325 
Cardan and, 321 
Euler and, 630-31 
fundamental theorem of algebra and, 
548 
Gauss and, 632-33 
Girard and, 350 
Hamilton and, 635-39, 650 
Wallis and, 631 
Wessel and, 631-32 
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complex numbers—Cont. 
zeta function and, 724 
composite numbers, 172, 178-79, 181, 
713 
Comptes rendus, 607, 730 
computers 
AIS, 743 
ASCC (Mark I), 745-46 
Babbage and, 627-28 
ENIAC, 746 
hardware development, 745—47 
MANIAC, 743 
x calculation, 694, 746 
proofs using, 748, 749-51 
UNIVAC, 742, 746 
von Neumann and, 743 
Condorcet, Marie-Jean Caritat, Marquis 
de (1743-1794), 530, 541 
cones, 61, 207-8 
congruence theorem, 88, 115 
congruence theory, 551—58, 561, 572, 
622 
congruent triangles, 115, 149-51 
conic sections, 206-8, 248-49, 327, 
448-49 
consistency of mathematics, 600-601, 
623-24, 708-10 
constant, as term, 613 
constant descent problem, 474 
Constantine the Great (272-337), 216, 
217 
Constantinople, 197, 206, 217, 238, 303 
constructible real numbers, 237 
construction problems. See duplication of 
the cube; quadrature of the circle; 
trisection of an angle 
continuity principle, 356, 379, 386, 622 
continuous functions, 608, 619-20 
continuum hypothesis, 694-97, 704, 751 
convergence, 102, 609-10, 625 
coordinate system, oblique, 368-69 
Copernicus, Nicolaus (1473-1543) 
De Revolutionibus Orbium 
Coelestium, 276, 306, 342 
epicycles, 189 
Galileo and, 340 
illustration of, 343, 344 
Kepler and, 355, 356 
Recorde and, 314 
Cordoba, Spain, 247, 273 
cords, knotted, 4, 6-7 
cosines 
Euler identity, 531 
in nite series expansion, 532-33, 
538 
law of, 244 
cosmology 
Aristotle, 339, 340, 341 
Copernicus, 340 
Descartes, 364, 375-77 


Galileo, 339-40 
Kepler, 356—57, 358 
Laplace, 482-83 
Ptolemy, 340, 342 
Pythagoreans, 93 
cossic art, 314 
Cotes, Roger (1682-1716), 403, 526, 
531 
countable sets, 684-88, 689, 715 
counting, primitive, 1-9, 13 
counting rods, 28-29, 258, 260, 261 
Courant, Richard (1888-1972), 741, 742 
Cox, Elbert (1895-1969), 666 
Cramer, Gabriel (1704-1752), 495 
Crelle, August Leopold (1780-1855), 
332 
Crelle’s Journal, 332, 594, 613, 615, 
677, 694 
Cromwell, Oliver (1599-1658), 381, 386 
cross products, 639, 650-51 
Crusades, 273, 274, 444 
cryptography, 383 
Ctesibius of Chalcis, 142 
cubes of numbers 
Diophantus problems with, 223, 231 
sums of, 98-99, 247, 462-63 
syncopated algebra symbols, 219 
See also duplication of the cube 
cubic equations 
Archimedes and, 198-99 
Babylonian, 63 
Bombelli and, 324-26 
Cardan solution, 321 
del Ferro solution, 317 
Diophantus and, 315 
Fibonacci and, 282-83 
irreducible case, 324 
Italian Renaissance and, 315-16, 317 
Jordanus and, 284 
Omar Khayyam and, 248-49, 264, 
327 
rational roots, 237 
reduced form, 322—24, 326, 327 
resolvent, 330-31 
Tartaglia solution, 319, 321 
cuneiform writing, 20-23, 66 
Cunningham, Allan Joseph (1842-1928), 
509 
curves 
area under, 384-85, 417-18, 453 
brachistochrome problem, 475 
catenary, 474 
cycloids, 452-54, 470, 475 
Descartes and, 369-71 
isochronous, 474 
tautochronous, 470 
versed sine, 430, 434 
Cusa, Nicholas (1401-1465), 285 
cyclic quadrilaterals, 225 
cycloids, 452-54, 470, 475 
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cyclotomic integers, 518, 519 
Cyril of Alexandria (c. 375-444), 234 


da Bisticci, Vespasiano, 311 
Daily Courant, 521 
d’Alembert, Jean Le Rond (1717-1783) 
Encyclopédie, 416, 522 
Frederick the Great and, 540 
fundamental theorem of algebra and, 
548 
illustration of, 522 
Laplace and, 478, 481 
limit concept, 416, 605, 606 
Opuscles Mathématiques, 625 
overview, 604—5 
on parallel postulate, 580 
St. Petersburg paradox and, 490 
Traité de dynamique, 605 
Traité des uides , 605 
d’Alembert’s principle, 605 
Damascus, 238, 249, 250, 304 
Damon and Pythias story, 92 
Danse Macabre, 302 
Darius (Persian king) (550-486 B.c.), 4, 
6, 22, 85 
Dark Ages, 269 
Davies, Charles (1798-1879), 576 
de Maupertuis, Pierre-Louis 
(1698-1759), 431 
de Méré, Antoine Gombaud, Chevalier 
(1607-1684), 454-56 
De Moivre, Abraham (1667-1754), 427, 
477-78, 525, 532 
De Morgan, Augustus (1806-1871) 
biographical information, 649 
Budget of Paradoxes, 529, 580, 649 
on calculus controversy, 421—22, 
423-24 
The Differential and Integral 
Calculus, 649 
Formal Logic, 647, 650 
four-color conjecture and, 749 
on Halley, 400 
induction term, 466 
on Laplace, 483 
Sylvester and, 665 
symbolic logic, 629, 648, 649-50 
De Morgan medal, 745 
De Morgan’s rules, 648 
de Saron, Jean-Baptiste, 541 
decagons, 165, 166 
decimal fractions, 348—50 
decimal point, 349-50 
decimal systems, 13-15, 26, 278 
decomposition, in ring theory, 739 
Dedekind, Richard (1831-1916) 
Cantor and, 681, 694, 695 
Continuity and Irrational Numbers, 
168, 678 
Dirichlet and, 559 
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Galois theory use, 333 

induction term, 466 

number theory, 620-21, 679, 682, 
738-39 

real lines, 707 

real numbers, 620 

ring theory and, 737, 752 

Stetigkeit und die Irrationalzahlen, 
684 

Was sind und was sollen die Zahlen, 
679 


Devanagari numerals, 279 

diagonal numbers, 111—16, 118 
Diamond Sutra, 253 

dice play, 443-44, 445, 454-56, 488 


Dickson, Leonard Eugene (1874-1954), 


668, 669, 671, 752 
Diderot, Denis (1713-1784), 429, 522, 
529, 605 
Dieudonné, Jean (1906-1992), 748 
Difference Engine, 628 
differential calculus 


Earth to moon, 208 
Sea Island Mathematical Manual 
and, 258-59 

ships at sea, 88-89 

topology and, 730-31 
distinguishable, as set attribute, 678 
distributive law, 738 
dividers, 25 
divination rites, 443 
division 

Babylonian, 64 
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Dedekind rings, 739 

deferents, 189 

de nite, as set attribute, 678 

del Ferro, Scipione (1465-1526), 317, 


Bernoulli and, 475 

catenary curve problem and, 474 
Lagrange and, 541, 607 

Leibniz and, 413-16, 422, 423, 472 


of differentials, 414-15 

Egyptian, 38-39, 42-43 

Euclidean divisibility properties, 
170-73 
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Delambre, Jean Baptiste (1749-1822), 
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Delian problem. See duplication of the 
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Delos oracle, 125 

Democritus (c. 460-370 B.c.), 54 
demography, Laplace and, 487-88 
demotic writing, 15, 35, 36 

dense sets, 729 

derived sets, 729 

Desargues, Girard (1591-1661), 338, 


377-79, 500 


Desargues’ theorem, 378 
Descartes, René (1596-1650), 362—77 


amicable numbers, 510 

analytic geometry development, 
367-65, 367-75 

biographical information, 362-65 

Desargues and, 379 

Discours de la Méthode, 347-48, 
364, 365-67, 468 

illustration of, 363 

imaginary numbers, 372-73, 630 

La Dioptrique, 367 

La Géométrie, 362, 366, 367-75, 
388, 513, 630 

Le Monde, 364 

Les Météores, 367 

mechanical universe concept, 364, 
375-77 

on Pascal, 449 

philosophy of systematic doubt, 
365-67 

polyhedron relation, 539 

Principia Philosophiae, 364, 375-77, 
401 

tule of signs, 372 

vortex theory, 375—76, 377, 386, 401, 
402 

brief citations, 338, 347, 362, 378, 
380 


Description de l’Egypte, 34 


L’Hospital and, 475 
Newton and, 391, 416-20, 422-26, 
626 
differential equations, 474, 531, 611, 
675, 706 
differential triangles, 389-90 
dimension, continuum hypothesis and, 
695 
Dinostratus (c. 350 B.c.), 132-33, 137 
Diocletian (c. 245-313), 216 
Diogenes the Cynic (c. 412-423 B.c.), 
102 
diophantine equations 
de ned, 226 
Fermat and, 512, 516, 558 
Fibonacci and, 280-81 
Hilbert’s tenth problem and, 744 
Lagrange and, 542 
linear congruences and, 556, 557 
overview, 226-32 
Diophantus of Alexandria (. A.D. 250) 
algebra and, 239 


Arithmetica, 219-23, 225, 230, 231, 
233, 293, 325, 349, 512, 513, 516 


biographical information, 217—18 
cubic equations, 315 
indeterminate equations, 226 
The Porisms, 219 


Pythagorean problem solution, 108-9 


Renaissance and, 313 
right triangle formulas, 74 
symbols use, 219-20, 348 
directrix, 208 
Dirichlet, Peter Gustav Lejeune 
(1805-1859) 
Fermat’s last theorem proof, 517 
function concept, 613-14 
Gauss and, 558, 598, 616 
Kronecker and, 681 
Lectures on Number Theory, 738 
number theory, 558, 559 
overview, 558-59 


Vorlesungen tiber Zahlentheorie, 559 


division symbol (+), 314, 346 
division theorem of Euclid, 173-74 
divisors 
aliquot, 503 
greatest common, 173, 289, 293 
of integers, 170, 171, 172-73 
sum of divisors function, 509 
dodecahedrons, 539 
Doomsday Book, 439 
dot products, 639 
double false position method, 48 
double refraction, 469-70 
double tally sticks, 4 
doubling, Egyptian method, 37-39, 
41-42, 49 
du Bois-Reymond, Paul (1831-1889), 
620 
duality principle, 379-80, 648 
duplication of the cube 
Apollonius and, 128 
Archytas and, 116 
as early construction problem, 
124-26, 315 
Eratosthenes and, 184—85 
Hippocrates and, 125-26 
Menaechmus and, 128 
Newton and, 129-30 
Nicomedes and, 129 
Pappus and, 232-33 
Plato and, 128 
using parabolas, 128, 327 
duplication of the square, 124 
dynamics, 345, 605, 675 


e (natural log base) 
Euler and, 530, 531-32 
irrationality of, 538-39, 571, 692 
mysterious formula using, 663 
Napier’s logarithms and, 354, 361 
transcendence of, 571, 692 

Earth 
circumference of, 186-88, 202-3, 

397, 398 


Dirichlet’s theorem, 559 
distance measurements 


distance from moon, 208 
ellipticity of, 403 


descriptive geometry, 543 
determinants, Jacobian, 560 
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Earth—Cont. 
magnetic force of, 550 
meter length and, 545 
polar vs. equatorial diameter, 431 
eccentricity of orbits, 481 
eclipses, 89-90 
Ecole des Ponts et Chaussées, 606 
Ecole Militaire, 478, 571 
Ecole Normale, 484, 540, 541, 543, 610, 
673 
Ecole Polytechnique, 379, 491, 540, 541, 
544, 558, 571, 606, 607, 608, 
610, 611, 673 
Edgerton, Winifred (1862-1951), 665 
education 
cathedral schools, 271-72, 307-8 
Christian, 235-37, 236-37, 269-72 
Greek, 84-85 
monastery schools, 236-37, 269-71, 
307 
Renaissance and, 306-10, 311 
See also universities and schools; 
speci c organizations 
Edward III (1312-1377), 444 
Edward VI (1537-1553), 319-20 
Egypt 
Arab conquest, 238 
as center of mathematics, 141 
history of, 10-11, 30-31, 610 
Roman rule, 213-14 
Rosetta Stone, 34, 35-37 
See also Alexandria 
Egyptian mathematics 
algebra, 47, 49, 78-79 
arithmetic, 13-15, 37-43, 49-53, 61 
geometry, 33, 53-61 
hieratic writing, 15-16, 34 
hieroglyphics, 12-15, 18 
Hounds and Jackals game, 443 
Mathematical Leather Scroll, 35, 58 
a value, 55, 61 
Einstein, Albert (1879-1955), 741 
El Madschriti of Madrid, 510 
Eleatic school, 101-2 
Electronic Numerical Integrator and 
Calculator (ENIAC), 746 
elefuga, 150 
Elements (Euclid) 
Book I, 147-59, 168, 169, 563, 567, 
568, 569, 579, 600, 730 
Book II, 134, 159-64, 167-68 
Book IV, 165 
Book IX, 170, 177-78, 180-82, 504 
Book VI, 158-59, 161, 163 
Book VII, 170-77, 178-79 
Book VIII, 170 
axioms, 117, 145-47, 565, 596, 600, 
621, 623 
common notions, 145, 146 
de nitions , 145, 147, 170-73 


divisibility properties, 170-73 

English vs. French use of, 574-75 

fundamental theorem of arithmetic, 
170, 177-78 

geometric algebra of, 159-64, 167 

history of, 143-45 

illustration of, 151, 171 

Lobachevsky and, 596 

logical aws , 146-48, 149, 152-53, 
159 

Newton and, 388 

vs. Nicomachus, 94 

Pascal and, 447-48 

as textbook, 575, 576 

translations and commentaries, 244, 
248, 250, 262, 272, 275-76, 305, 
383 

brief citations, 105, 108, 109, 197, 
204, 225, 236, 238, 240, 241, 
282, 293, 314, 410, 456, 513, 
565, 567 

See also parallel postulate 

Elizabeth (Queen Mother) (1900-2002), 
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elliptic functions, 560, 574, 585, 614, 
615 

elliptical orbits, 189, 337, 359, 360, 398, 
403 


Encyclopedia Britannica, 522 
Encyclopédie, 416, 522-23, 529, 605 
England 
Black Death, 301-2 
calculus controversy effects, 429-30 
coinage reform, 406 
eighteenth century period, 526, 527 
Enlightenment period, 523 
French Revolution and, 542 
games of chance in, 444 
Great Plague of London, 301, 391 
Gregorian calendar, 306 
Hundred Year’s War, 302 
nineteenth century period, 626, 627 
Renaissance mathematics, 314 
seventeenth century period, 381-82 
See also Royal Society of London 
Enlightenment, 429, 520-24 
epicycles, 189-90, 250 
Epsilon, as nickname, 208 
equality 
vs. congruence, 552—53 
symbols for, 219, 345 
equant points, 189, 246 
equations 
cubic (see cubic equations) 
differential, 474, 531, 611, 675, 706 
diophantine, 226-32, 280-81, 512, 
516, 542, 556, 557, 558, 744 
indeterminate, 225, 226, 228-30, 535 
linear, 47-48, 160, 226-27, 228, 
229-30, 257-58, 265 
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Pell’s, 224, 230, 535 
quadratic (see quadratic equations) 
quartic, 328, 329-31, 334, 374, 381 
quintic, 331-32 
reunion and reduction operations, 239 
theory of, 347, 350, 533, 644 
equilateral triangles, 126, 147-48, 286 
equivalent sets, 679 
Eratosthenes of Cyrene (. 230 B.C.) 
Archimedes’ Method, 206 
biographical information, 183-84 
cattle problem, 223 
duplication of the cube, 184-85 
Geographica, 183-84 
map of habitable world, 184 
sieve of, 185-86 
Erdés, Paul (1913-1996), 741, 747 
Emest Augustus (1629-1698), 424 
error estimates, in sampling, 488 
Esau and Jacob, 510 
ether, 375, 393 
Euclid (. 300 B.c.) 
algorithm of, 173-77, 182, 228, 289 
biographical information, 143-44, 
145 
Conic Sections, 143 
Data, 143 
Galileo and, 339 
illustration of, 144 
Kant on, 587 
method of exhaustion, 204 
number theory, 170-81, 504-5, 506 
Porisms, 143, 219 
Renaissance and, 313 
See also Elements (Euclid) 
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Dedekind, 620-21, 679, 682, 738-39 

Dickson, 668 

Dirichlet, 558, 559 

Eratosthenes, 185-86 

Erdés, 747 

Euclid, 170-81, 504—5, 506 

Euler, 505, 507-8, 535-37, 723-24 

Fermat, 338, 512, 513-16, 519, 528, 
535 

Gauss, 547, 548-49, 550-51, 632-33 

Hamilton, 634-39 

Hardy, 723, 726, 728 

Jacobi, 560 

Lagrange, 541-42 

Lamé, 517, 518 

Legendre, 510, 517, 572 

Littlewood, 726 

Mersenne, 502, 507 

Nicomachus, 503-4 

Noether, 739 

Paganini, 510 

Pythagorean, 93-94, 107, 116 

Ramanujan, 728 

Weierstrass, 620 


numbers 


algebraic, 517, 690-93, 738, 739 

alphabetic representation, 16-18, 19, 
20, 95 

amicable, 244, 263, 509-10, 513 

cardinal, 680-81, 689, 695-96, 
700-701 

complex (see complex numbers) 

composite, 172, 178-79, 181, 713 

diagonal, 111-16, 118 

even, 94, 505, 537 

Fibonacci, 288-93 

gurative, 95-101, 103-5 

harmonic, 468 

ideal, 517-19, 738-39 

imaginary (see imaginary numbers) 

irrational (see irrational numbers) 

Liouville, 690 

natural, 170, 292, 542, 679, 685, 705, 
726 
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negative (see negative numbers) 

oblong, 103 

odd, 94, 96-97, 505-6 

perfect, 170, 502—9, 503, 507-8, 
513-14, 536, 554-55 

plane, 170 

polygonal, 95 

prime (see prime numbers) 

rational, 43-46, 109, 111—15, 237, 
686-87, 689 

real, 237, 608, 620 

side, 111-16, 118 

solid, 170 

square, 95-97, 103-5, 220-23, 231, 
285 

tetrahedral, 104—5 

transcendental, 122, 571, 689-93, 
690, 692-93, 724 

trans nite, 677, 681, 689, 696 

triangular, 95—96, 98-99, 103-5, 412, 
464 

numerus surdus, 168 


oblong numbers, 103 
obtuse angle hypothesis, 567-69, 570, 
600 
octahedrons, 539 
octave (music), 93 
Octavian (Emperor Augustus) 
(63 B.c.14 a.p.), 213-14 
octonions, 638, 639 
odd numbers, 94, 96-97, 505-6 
odd order theorem, 744 
Ohm, Georg (1789-1854), 558 
Olbers, Heinrich (1758-1840), 573, 585 
Oldenburg, Henry (c. 1618-1677) 
Collins and, 420, 421, 427 
in nite series, 533 
Leibniz and, 412, 420-21, 424 
at Leiden, 469 
Newton’s binomial theorem and, 393, 
433, 473 
omicron, used for zero, 25 
one (1), as number, 93 
one-to-one correspondence, 679 
optics 
Descartes work, 367 
Hamilton work, 634 
Huygens work, 469-70 
Kepler work, 464 
Maurolico work, 464 
Newton work, 391, 392-93 
wave-particle duality of light, 393 
wave theory, 469 
orbits 
Ceres, 549, 573 
circular, 188-89 
elliptical, 189, 337, 359, 360, 398, 
403 
Gauss’s theory of, 549 
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orbits—Cont. 
Mars, 208, 359, 360 
moon, 396-97 
periodicity and eccentricity of, 481 
order, in group theory, 622—23, 744 
ordered sets, 683, 702 
ordinate, as term, 369 
Oresme, Nicole (1323-1349), 284-85 
Orestes (d. 476), 233-34 
Osgood, William Fogg (1864-1943), 669 
Oskar II (1829-1907), 675 
Ostrogradski, Mikhail (1801-1862), 594, 
617 
Ottoman Turks, 303 
Oughtred, William (1574-1660) 
Clavis Mathematicae, 345, 382, 383, 
384, 530 
mathematical notation, 345, 346 
slide rule invention, 355 


Beltrami and, 602 

Bolyai, John and, 589-90 

Bolyai, Wolfgang and, 581, 588-89 
de ned, 146, 563 

equivalent statements, 564-65 
Euclid’s, 563, 569, 580-81, 595, 602 
Gauss and, 585-86, 588 

Lambert and, 569-70 

Legendre and, 576-80, 581-82 
Lobachevsky and, 593-94, 596-98 
Omar Khayyam and, 248 

Playfair’s axiom and, 563-64, 565 
Proclus’s, 563, 565 

Riemann and, 599-600, 602 
Saccheri and, 566-69 

signi cance of, 156 
Thabit-ibn-Qurrah and, 246 

as theorem vs. assumption, 146—47 
Wallis and, 566 
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Peacock, George (1791-1853), 626-27, 
629-30 
Peano, Guiseppe (1858-1932) 
Arithmetices principia nova methodo 
exposita, 705 
axioms, 702, 708 
Calcolo geometrico secondo 
l’Ausdenungslehre di H. 
Grassman, 646 
continuous functions, 620 
Formulaire de mathématiques, 705-6 
I Principii di geometria logicamente 
espositi, 621 
Russell and, 707 
Peirce, Benjamin (1809-1880), 659, 661, 


Wallis and, 384 


Pacic Journal of Mathematics, 744 
Pacioli, Luca (c. 1445-1514) 


fair stakes and, 445-46 
illustration of, 317 
Summa de Arithmetica, Geometria, 


parallel rank, 458 

parallelism, 378, 622 
parallelograms, 145, 169, 291-92 
parchment, 304 

Paris Academy, 517, 518, 519 
partitions, theory of, 728 

Pascal, Blaise (1623-1662), 446-54 


662-63, 664 

Peirce, Charles Sanders (1839-1914), 
648, 665 

Pell, John (1611-1685), 224, 500, 507, 
535 


Pell’s equation, 224, 230, 535 
pendulum motion, 339, 397, 402, 470 
Penny Cyclopedia, 466 

pentagons, 165-66 

pentagrams, 92 

Pentathis nickname, 183 


Proportioni et Proportionalita, 
306, 309, 316-17, 445 
Padoa, Alessandro (1868-1937), 621 
Paganini, Nicolo I., 263, 510 
palimpsests, 206 
palindromes, 561 
paper, 27-28, 304 
Pappus of Alexandria (. c. 320) 
harmonic mean, 119 
locus problems, 368, 380 
Mathematical Collection, 119, 132, 
169, 232-33 
proposition 5 proof, 150 
on quadrature of the circle, 132, 133, 
134 
Renaissance and, 313 
papyrus, 15 
papyrus scrolls 
Akhmin, 51 
Cairo Mathematical, 77-79 
Moscow, 34, 56-57, 61-62 
Rhind (see Rhind Papyrus) 
parabolas 
area under curve, 384-85 
Cartesian, 380 
duplication of the cube and, 327 
equation for, 198-99, 207-8 
method of tangents and, 371 
Omar Khayyam and, 248-49, 327 
quadrature of segment, 206-7 
parabolic segments, area of, 246 
parallel lines, 153-56, 378, 563, 564 
parallel postulate, 563-84 
al-Tus* and, 250 


biographical information, 446-48 

calculating machine, 412, 449-50, 
452 

characteristic triangles, 415 

cycloid and, 452-54 

on Desargues, 378-79 

Essay pour les Coniques, 379, 
448-49, 451, 500 

Fermat and, 472, 512, 519 

History of the Cycloid, 453 

illustration of, 447 

Leibniz and, 413 

Lettres Provinciales, 446 

mathematical induction use, 465-66 

mystic hexagon theorem, 448, 449 

Pensées de M. Pascal sur la Religion, 
446, 452 

probability theory, 439, 446, 454-56 

religious work, 450-52 

Traité des sinus du quart de circle, 
454 

Traité du Triangle Arithmétique, 450, 
456, 457-58, 465 

Treatise on Figurative Numbers, 98 

brief citations, 338, 363, 442, 500 


Pascal, Etienne (1588-1640), 137, 447, 


449, 450, 500 


Pascaline, 449-50, 452 
Pascal’s triangle, 246, 261, 318, 456-61, 
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Pasch, Moritz (1843-1930), 621 
Pasch’s postulate, 623 

Pax Romana, 214 

Peace of Westphalia (1648), 411 


Pepys, Samuel (1633-1703), 391, 400 
Perelman, Grigori, 674 
perfect numbers 
de ned, 503 
Euclid and, 170, 536 
Euler and, 507-8, 536 
Fermat and, 513-14 
nal digits of, 554-55 
history of, 502—9 
perfect proportion, 119 
Pergamon, 206 
Pergamon, Library of, 142-43, 234 
perimeter, 66-67, 200-201 
periodicity of orbits, 481 
permutation groups, 643-44 
perpendicular rank, 458 
Persian calendar, 248 
perspective, projective geometry and, 
377, 378 
Pervusin, J., 508 
Peter Lombard (c. 1095-1160), 218 
Petrie, William Matthew Flinders 
(1853-1942), 60 
Pfaff, Johann Friedrich (1765-1825), 
548, 550 
phi function (g(n)), 535 
Philip of Macedonia (382-336 B.c.), 85, 
141 
Philosophical Magazine, 521 
Philosophical Transactions (Royal 
Society of London), 426-27, 428, 
502, 521, 627, 647 
Phoenician alphabet, 16, 84 
m (pi) 
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al-Kash’ value, 251 sophists and, 130-31 

Archimedes value, 200-202, 254 on Thales of Miletus, 87 

Aryabhata value, 201-2 Platonic Academy, 83, 134-37, 135, 136, 
Babylonian value, 55, 61 234 

Chinese value, 254-55, 262 Platonic solids, 539 

computer-calculated value, 694, 746 Playfair, John (1748-1829) 

Egyptian value, 55, 61 Elements of Geometry, 563-64 
Leibniz series for, 408, 413, 432, 537 on Laplace, 481 population size, estimation of, 487-88 
Ludolphine number, 202, 251 parallel postulate, 563-64 Porta, Giambattista della (1535-1615), 
Machin computation, 408 Playfair’s axiom, 563-64, 565, 566, 623 498 

mysterious formula using, 663 plenum, 375 positional number systems 

needle problem and, 487 Plimpton 322 tablet, 72-76 Babylonian, 23-26 

Newton series for, 433-34 Pliny the Elder (23-79 A. D.), 188 Chinese, 28-29 

number of places, 201-2, 693, 694 plus-or-minus symbol (+), 382 Mayan, 7—8, 30-31 

Ramanujan series for, 728 Plutarch (c. 46-120) See also Hindu-Arabic numerals 
Shanks value, 678, 693 on Archimedes, 194, 195, 197 positive integers. See natural numbers 


Pompadour, Jeanne-Antoinette Poisson, 
Marquise de (1721-1754), 540 

Poncelet, Jean-Victor (1788-1867), 
379-80, 544 

pons asinorum, 150 

Poor Man’s Bible, 304 

Pope, Alexander (1688-1744), 1, 404 


symbol origin, 200, 530 
transcendence of, 122, 571, 692-93 
Wallis formula, 538 


Convivium, 88 
The Life of Marcellus, 194, 195 


power 
of 7, as signi cant, 49-50 
Babylonian system, 23—24 


on Plato, 122 

on Thales of Miletus, 88 

on triangular numbers, 103 
Poincaré conjecture, 674 
Poincare, Henri (1854-1912) 

Analysis Situs, 674 

Birkhoff and, 670 

Hill and, 662 

illustration of, 700 

intuitionism and, 713 

Les Méthodes Nouvelles de la 

Meéchanique Celeste, 675 
overview, 673-76 
on set theory, 699, 701 


Piazzi, Giuseppi (1746-1826), 549 
Picard, Jean (1620-1682), 398 
pictographs, 21 
Pieri, Mario (1860-1904), 621, 622, 623 
Pierpont, James (1866-1938), 669 
Pisa, Leaning Tower of, 339 
place value, 7, 23 

See also positional number systems 
planar surfaces, 621 
Planck, Max (1858-1947), 616 
plane geometry, 125, 148-52 
plane numbers, 170 
planes, 378, 622, 632-33 
planetary motion 


Bombelli notation, 348 
Egyptian system, 13, 15 
exponential notation, 347-48 
Greek system, 17 
power series expansion, 526 
power sets, 699 
Powers, R. E., 508, 509 
pressure experiments, 450, 499, 500 
prime factorization, 180, 507 
See also fundamental theorem of 
arithmetic 
prime number theorem, 673, 747 
prime numbers 
Bertrand’s conjecture, 492 


Descartes and, 375-76 

Kepler’s laws, 189, 337, 359, 360, 
395-96 

Maragha astronomers and, 250 

Poincaré and, 675 


planetoids, 549 
planets 


Jupiter, 339, 357, 481, 549 
Mars, 357, 359, 360, 549 
Mercury, 357 

Neptune, 549 

Saturn, 357, 421 

Uranus, 549 

Venus, 357 

See also Earth 


Plato (429-348 B.c.) 


biographical information, 134-37 

duplication of the cube and, 125 

Eudoxus and, 117 

golden ratio/section and, 166 

Hippias Major/Minor, 131 

illustration of, 122 

inscription at Academy, 83 

irrational numbers and, 116 

Pythagorean problem solution, 108, 
117 

quadrature of the circle and, 121, 122 


three-body problem, 675, 676 
Poincaré’s last geometric theorem, 676 
point-set topology, 729-32 
points 

accumulation, 696 

Euclid’s de nition, 147 

Hilbert’s relations, 622 

at in nity, 378, 448 

notion of, 621 

problem of, 445-46 
pointwise convergence, 609 
Poisson, Siméon-Denis (1781-1840), 

333, 473, 491 
Polish School of mathematics, 733-34 
Polya, George (1887-1985), 741 
Polybius (c. 207-125 B.c.), 65, 85, 194 
Polycrates (d. 522 B.c.), 90 
polygonal numbers, 95 

See also gurative numbers 
polygons 

construction of, 165-66, 547 

am value and, 200-201, 202, 254-55 

See also speci c types 
polyhedrons, 539 
polynomials, 246-47, 260, 261, 371, 

372-75, 642-43 
polyphony, in cuneiform writing, 23 


dened, 172 

distribution of, 572-73 

Euler and, 535 

Fermat and, 535 

Fibonacci numbers and, 288-89 

fundamental theorem of arithmetic 

and, 170, 172, 177-80 

irregular, 519 

Mersenne, 507-9 

perfect numbers and, 506-8 

regular, 519 

relatively prime integers, 176 

sieve of Eratosthenes and, 185-86 

See also in nitude of primes 
primitive Pythagorean triples, 294-98 
primum mobile, 340, 341 
principle of duality, 379-80, 648 
principle of superposition, 149 
printing 

of cards, 444 

in China, 253, 262 

of Euclid’s Elements, 275-76 

Hindu-Arabic numerals and, 280 

invention of, 303—5 

in the Netherlands, 468 

Regiomontanus and, 313 

symbols and notation, 346 
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privatdozents, 558, 587, 598, 619, 658, 
737 
probability, 439-95 
addition and multiplication theorems, 
485 
Bernoulli family and, 471—77 
Cardan and, 320, 338, 445-46 
Chebyshev and, 492 
de Méré and, 454-56 
De Moivre and, 477-78 
de ned, 445, 484-85 
Fermat and, 338, 439, 454 
games of chance and, 443-46, 473 
Graunt and, 440-42 
Huygens and, 442, 469-70 
Kolmogorov and, 492-93 
Laplace and, 483-89 
law of large numbers, 473, 492 
Markov chains, 492 
origins of, 439-40 
Pascal and, 338, 439, 446, 454-56, 
462, 465-66 
Poisson and, 491 
St. Petersburg paradox, 489-91, 495 
problem of points, 445—46 
Proclus (410-485) 
on Archimedes, 194 
biographical information, 234 
Commentary on the First Book of 
Euclid’s Elements, 33, 87, 160, 
207, 232, 234, 565 
on Euclid, 144, 156 
Eudemian Summary, 234 
on Eudoxus, 166 
on Hippocrates, 121 
on Pappus, 232 
parallel postulate, 563, 565 
on Plato, 136 
on Thales of Miletus, 87, 88, 89 
product rule, 415 
progressions. See arithmetic 
progressions; geometric 
progressions 
projective geometry, 338, 377-80, 448, 
544, 671 
projective planes, 378 
proofs 
computer assistance and, 750-51 
mathematical induction and, 461—67 
Thales contributions, 87 
theory of, 711 
proportion, theory of, 116, 117 
Protagoras of Abdera (c. 585-501 B.c.), 
131 
Prussia, 657, 676 
See also Germany 
pseudospheres, 601—2 
Ptolemy, Claudius (100-170 a.p.) 
biographical information, 188-89 
Geographike Syntaxis, 190-92 


illustration of, 189-90, 344 
map of habitable world, 190, 191 
omicron use, 25 
sexagesimal system use, 26 
theorem of, 192 
brief citations, 233, 313, 340, 342, 
565 
See also Almagest 
Ptolemy I (c. 364-283 B.c.), 136, 141, 
142, 144 
Ptolemy III (d. 221 B.c.), 142, 183, 185 
Ptolemy V (c. 210-180 B.c.), 36 
Ptolemy VII (d. 144 B.c.), 213 
Punic War, Second (218-201 B.c.), 194, 
195 
pyramid inch, 60 
pyramid mystics (pyramidiots), 58, 60 
pyramids, 56-57, 57-61, 87-88 
Pythagoras (c. 585-501 B.c.) 
biographical information, 90-94, 105 
illustration of, 91, 235 
Pythagorean problem solution, 107-8, 
117 
Pythagorean problem, 107-9, 117, 293 
Pythagorean theorem 
Babylonian use, 72-74, 76-77 
Bhaskara proof, 106-7, 252 
Cairo Mathematical Papyrus and, 
77-19 
Chinese mathematics and, 252 
Euclid’s proof, 156-59 
Pappus generalization, 169, 232 
proofs, 105-7, 119 
Thabit-ibn-Qurrah generalization and 
proofs, 24446, 263 
Pythagorean triples, 107-9, 117-18, 
293-98 
Pythagoreans 
amicable numbers, 510 
background information, 90-94, 144 
duplication of the square, 124 
vs. Eleatic school, 101, 102 
everything-is-a-number, 167, 345 
gurative numbers, 95—97 
Hippocrates and, 121 
irrationality and, 109-10, 116, 
167-68 
mathemata term use, | 
pentagram symbol, 92, 166 
perfect numbers, 503, 504 
Pythias and Damon story, 92 


quadratic equations 

application of areas method, 160-64, 
167, 207 

Arab solutions, 240-44, 263 

Babylonian solutions, 64-71, 167 

completion of squares method, 
240-41 

Fibonacci and, 283, 293 


Index 


Hindu solutions, 225 
indeterminate, 230 
Viéta solution, 362 
quadratic reciprocity law, 572 
quadratrix 
invention of, 122, 130-31 
Pappus and, 232 
in polar coordinates, 137 
for quadrature of the circle, 132-34 
for trisection of an angle, 131-32 
quadrature of the circle 
Dinostratus and, 132-33, 137 
Hippocrates and, 121-24 
modern approach, 751 
Pappus and, 232-33 
quadratrix for, 132-34 
spiral of Archimedes and, 210 
transcendence of z and, 693 
quadrature of the parabolic segment, 
206-7 
quadrature of the quarter-circle, 137 
quadrilaterals 
area, 54-56, 62, 193, 225 
Saccheri, 246, 567-69 
quadrivium 
Boethius and, 236 
Charlemagne and, 270, 271 
vs. humanities, 311 
Pythagoras school, 90 
universities and, 310 
quanti cation theory, 705 
quarter-circle, quadrature of, 137 
Quarterly Journal of Mathematics, 641 
quartic equations 
Descartes’ solution, 374, 381 
Ferrari’s solution, 328, 329-31, 334 
Vieta’s solution, 334 
quaternions, 637-39, 650, 663 
quintic equations, 331-32 
quipus, 6—7 
quotient, de ned, 173 
quotient rule, 415 


rabbit problem of Fibonacci, 287-88 
Rademacher, Hans (1892-1969), 741 
radius, of circle circumscribing triangle, 
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Rahn, Johann Heinrich (1510-1588), 
346, 507 

Ramanujan, Srinivasa (1887-1920), 
726-29 


Ramus, Peter, 383 
rank, in Pascal’s triangle, 458 
rational numbers 
countability of sets of, 686-87, 689 
cubic equation solutions, 237 
dened, 109 
irrational number approximation and, 
111-15 
unit fraction expression of, 43-46 
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Rawlinson, Henry Creswicke 
(1810-1895), 22-23 
real numbers, 237, 608, 620 
reasoning by recurrence, 463, 466 
See also mathematical induction 
Recensio, 428, 429 
reciprocals, 63-64, 532-33 
Recorde, Robert (1510-1558) 
false position method, 48 
The Grounde of Artes, 48, 314 
overview, 314 
The Pathwaie of Knowledge, 314 
The Whetstone of Witte, 314, 316, 
345, 361 
rectangles 
application of area and, 161—63 
area/perimeter relations, 66-67, 71 
geometric deception with, 291—92 
golden, 120 
rectangular equations, 65—67 
rectangular polyhedrons, 539 
recursive sequences, 288 
reduced form of cubic equations, 
322-24, 326, 327 
reductio ad absurdum arguments 
Legendre and, 575 
parallel line propositions, 155 
quadrature of the circle, 133 
quadrature of the parabolic segment, 
206 
Saccheri and, 567 
reduction, in Arab mathematics, 239, 
241, 245-46 
refraction, double, 469-70 
Regiomontanus (Johann Miiller) 
(1436-1476) 
arithmetic problem, 232 
calendar reform, 305-6 
classical revival and, 313 
De Triangulis Omnimodis, 305, 307 
on Diophantus’ Arithmetica, 512 
Ephemerides Astronomicae, 306 
Regius, Hudalrichus (. 1535), 506 
regular polyhedrons, 539 
regular prime numbers, 519 
Reign of Terror, 541, 544 
Reinhold, Erasmus (1511-1553), 306 
relatively prime integers, 176, 288, 289, 
295, 557 
remainder, de ned, 173 
remainder theorem, 229-30 
Remonstrance, 383 
Renaissance 
calendar revision, 305-6 
vs. Carolingian revival, 273-74 
vs. Islamic decline, 277 
overview, 301-5, 310-14, 337 
universities, establishment of, 306-10 
See also Italian Renaissance 
research ethic 


American universities, 663, 664-65, 
667-69, 670 
German universities, 658, 663-64 
resolvent cubics, 330-31 
reunion, in Arab mathematics, 239 
Rhaeticus (Georg Joachim) (1514-1576), 
313 
rhetorical algebra, 219, 239-40, 243, 
284, 348 
Rhind, A. Henry (1833-1863), 34, 35 
Rhind Papyrus 
arithmetic problems, 15, 37, 46-50, 
51-53 
closing prayer, 50 
decipherment of, 34-35 
geometric problems, 29, 54-56 
illustration of, 51 
unit fraction table, 40-43 
Ricci, Matteo (1552-1610), 28, 262 
Ricci, Ostilio, 339 
Richard’s paradox, 716 
Riemann, Bernhard (1826-1866) 
Habilitationsvortrag, 599 
illustration of, 599 
integral calculus, 612 
non-Euclidean geometry, 598-600 
parallel postulate, 599-600, 602 
Uber die Hypothesen welche der 
Geometrie zu Grunde liegen, 599 
zeta function, 723-24 
Riemann hypothesis, 724, 751 
Riemann surfaces, 598 
Riese, Adam (1492-1559), 361 
right angle hypothesis, 567, 568 
right angles, trisection of, 126 
right circular cones, 207-8 
right triangles, integral length sides, 74 
ring theory, 737-39, 742, 751-52 
Roberval, Gilles Personne de 
(1602-1675), 453, 500, 513 
Robinson, Julia Bowman (1919-1985), 
744-45 
Rockefeller, John D. (1839-1937), 668 
Roman calendar, 305 
Roman Empire 
Christianity in, 215—16, 235-37 
decline of mathematics and, 213-15 
preservation of heritage, 269, 270, 
277 
Punic Wars, 194, 195 
siege of Syracuse, 194, 195—96 
Roman mathematics, 235-37 
Roman numerals, 20, 280, 352, 506 
rope-stretchers, 54, 77 
Rosetta Stone, 34, 35-37 
roulette, mathematical expectation and, 
488-89 
Rousseau, Jean-Jacques (1712-1778), 
522, 523, 524 
Routh, Edward (1831-1907), 721 
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Royal Academy of Sciences of Denmark, 
631 
Royal Astronomical Society, 627, 628 
Royal Irish Academy, 634 
Royal Military Academy in Woolwich, 
641, 665 
Royal Society of London, 383, 388, 392, 
400, 406, 407, 409, 412, 421-22, 
426-7, 428, 429, 442, 442, 469, 
477, 497, 501-2, 527, 575, 627, 
649, 661, 683, 745 
Rudolf Il (emperor) (1552-1612), 358, 
359 
Rudolff, Christoff (c. 1500-1545), 231, 
346, 347 
Ruf ni, Paolo (1765-1822), 331-32, 643 
tule of signs, 372, 380 
Runge, Carl (1856-1927), 616 
Russell, Bertrand (1872-1970) 
axiom of reducibility, 707-8 
on Boole, 647 
on Descartes, 364, 680-81 
Introduction to the Philosophy of 
Mathematics, 708 
logistic school, 704, 706-8 
on Mathematical Tripos, 721 
paradox, 698-99, 701, 703, 705, 715 
Peano and, 706 
philosophical work, 708 
Principia Mathematica, 624, 707-8, 
710 
Principles of Mathematics, 680, 698, 
705, 706-7 
on superposition, 149 
theory of types, 707-8 
Wiener and, 670 
Russian multiplication, 38 
Russian university system, 591—92, 617, 
618 
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Some Important Historical Names, Dates,* and Events 


MATHEMATICAL 


GENERAL 


Early Beginnings (Before the Sixth Century B.C. ) 


B.C. 30,000 
8000 
2500 
1900 
1850 
1650 


Notched wolf bone 
Ishango bone 


Table tablets from Nippur 


Plimpton 322 
Moscow Papyrus 
Rhind Papyrus 


B.C. 3300 
2600 

2100 

1500 

1200 

700 


Menes unites Egypt 
Great Pyramid at Gizeh 
Code of Hammurabi 
Phoenician alphabet 
Trojan War 

Homer: The Odyssey 


Classical Period (Sixth Century B.C. to Fifth Century) 


B.C. 622-547 
585-500 

ca. 470 

460-380 

ca. 420 

408-355 

323-285 

287-212 

262-190 

ca. 240 

ca. 230 

A.D. ca. 75 
ca. 100 

85-160 

ca. 250 

ca. 260 

ca. 300 

365-395 

d. 415 

410-485 


*Most dates before 600 B.C. are only approximate. 


Thales of Miletus 
Pythagoras of Samos 
Theodorus of Cyrene 
Hippocrates of Chios 
Hippias of Elis 
Euxodus of Cnidos 
Euclid 

Archimedes 
Apollonius of Perga 
Nicomedes 
Eratosthenes of Cyrene 
Heron of Alexandria 
Nicomachus of Gerasa 
Claudius Ptolemy 
Diophantus 

Liu Hui 

Pappus of Alexandria 
Theon of Alexandria 
Hypatia 

Proclus 


B.C. 558-486 
485-430 

480 

469-399 

431 

388 

356-323 

ca. 370 

331 

213 

212 

195 

106-43 

44 

27 

AD. 100 
272-337 

286 

324 

455 


Darius the Great 

Herodotus 

Battle of Thermopylae 
Socrates 

Peloponnesian War 

Plato founds Academy 
Alexander the Great 
Eudemus of Rhodes 
Foundation of Alexandria 
Books burned in China 
Fall of Syracuse to Romans 
Rosetta Stone engraved 
Cicero 

Assassination of Caesar 
Beginning of Roman Empire 
Paper made in China 
Constantine the Great 
Division of the Empire 
Constantinople founded 
Vandals sack Rome 


MATHEMATICAL 


GENERAL 


Medieval and Renaissance Periods (Sixth Century to Sixteenth Century) 


ca. 475-524 


ca. 476-550 
ca. 625 

ca. 826-901 
735-804 


ca. 850-930 
ca. 1050-1130 
1114-1185 
1114-1187 
ca. 1120 
ca. 1175-1250 
1436-1476 
1445-1514 
1473-1543 
1494-1575 
1495-1552 
1500-1557 
1501-1576 
1510-1558 
1522-1565 
1526-1572 
1540-1603 
1546-1601 
1548-1626 
1550-1617 
1564-1642 
1571-1630 


Boethius 


Aryabhata 
Brahmagupta 
Thabit-ibn-Korra 
Alcuin of York 


Abt Kamil 

Omar Khayyam 
Bhaskara 

Gerard of Cremona 
Adelhard of Bath 
Leonardo of Pisa 
Regiomontanus 
Luca Pacioli 
Nicolas Copernicus 
Francesco Maurolico 
Peter Apian 
Niccolo Tartaglia 
Girolamo Cardan 
Robert Recorde 
Ludovico Ferrari 
Raphael Bombelli 
Francois Vieta 
Tycho Brache 
Pietro Cataldi 

John Napier 
Galileo Galilei 
Johannes Kepler 


636 


641 
732 
787 
800 


ca. 1000 
1066 
1096-1099 
1200 

1209 

1215 

1258 
1271-1295 
1293 
1337-1453 
1348 

1440 

1447 

1451 

1453 
1472-1514 
1478 

1492 

1513 

1517 
1564-1616 
1588 


Death of the Prophet 
Mohammed 

Library at Alexandria burned 

Arabs defeated at Tours 

Start of the Viking Invasions 

Charlemagne crowned 
Emperor 

Leif Eriksson sails to America 

Battle of Hastings 

The First Crusade 

University of Paris chartered 

Cambridge University founded 

Magna Carta signed 

Oxford University founded 

Travels of Marco Polo 

Paper produced in Bologna 

Hundred Years’ War 

Black Death in Europe 

Invention of Printing 

Founding of Vatican Library 

Death of Joan of Arc 

Turks capture Constantinople 

Leonardo da Vinci 

Treviso Arithmetic 

Columbus reaches West Indies 

Balboa discovers the Paci c 

Luther’s 95 Theses 

William Shakespeare 

Defeat of Spanish Armada 


MATHEMATICAL 


GENERAL 


Early Modern Period (Seventeenth and Eighteenth Centuries) 


1588-1648 
1591-1661 
1596-1650 
1601-1665 
1608-1647 


1616-1703 
1623-1662 
1629-1695 
1630-1677 
1635-1703 
1642-1727 
1646-1716 


1654-1705 
1656-1742 
1661-1704 
1667-1733 
1667-1748 
1667-1754 
1685-1731 
1690-1764 
1707-1783 
1717-1783 
1718-1799 
1728-1777 
1736-1813 
1749-1827 
1752-1833 
1768-1830 
1777-1855 
1781-1848 


Marin Mersenne 
Gérard Desargues 
René Descartes 
Pierre de Fermat 
Evangelista Torricelli 


John Wallis 

Blaise Pascal 
Christiaan Huygens 
Isaac Barrow 
Robert Hooke 
Isaac Newton 
Gottfried Leibniz 


James Bernoulli 
Edmond Halley 
Marquis de |’Hospital 
Girolamo Saccheri 
John Bernoulli 
Abraham DeMoivre 
Brook Taylor 

Christian Goldbach 
Leonhard Euler 

Jean le Rond d’Alembert 
Maria Agnesi 

Johann Lambert 
Joseph Louis Lagrange 
Pierre Simon Laplace 
Adrien-Marie Legendre 
Joseph Fourier 

Carl Friedrich Gauss 
Bernhard Bolzano 


1607 
1608 
1611 
1618-1648 
1619 


1620 
1632-1723 
1636 
1642-1649 
1658 
1662 
1663 


1666 
1682 
1683 
1687 
1694-1778 
1712-1786 
1737-1794 
1751 
1769 
1769-1821 
1770-1827 
1776-1783 
1789 
1789 
1798 
1798 
1798 
1799 


Jamestown founded 
Telescope invented 
King James Bible 
Thirty Years’ War 
Savilian Professorship 
(Oxford) 
Landing of Pilgrims 
Christopher Wren 
Harvard College founded 
English Civil War 
Death of Cromwell 
Royal Society of London 
Lucasian Professorship 
(Cambridge) 
Académie des Sciences 
Acta Eruditorum 
Turks defeated at Vienna 
Newton’s Principia 
Voltaire 
Frederick the Great 
Edward Gibbon 
Diderot’s Encyclopédie 
James Watt’s steam engine 
Napoleon Bonaparte 
Ludwig van Beethoven 
American Revolution 
Washington President 
French Revolution 
Eli Whitney’s cotton gin 
Ecole Normale founded 
Bonaparte in Egypt 
Rosetta Stone 


MATHEMATICAL 


Modern Period (Nineteenth and Twentieth Centuries) 


1789-1857 
1793-1856 
1802-1829 
1802-1860 
1804-1851 
1805-1865 
1805-1859 
1809-1882 
1810-1893 
1811-1832 
1814-1897 
1815-1897 
1821-1895 
1823-1891 


1826-1866 
1831-1916 
1845-1918 


1848-1925 
1849-1925 


1850-1891 
1852-1939 
1858-1932 
1862-1943 
1868-1942 
1872-1970 
1877-1947 
1882-1966 
1882-1935 
1887-1920 
1903-1957 
1906-1978 


Augustine Louis Cauchy 
Nicolai Lobachevsky 
Niels Henrik Abel 

John Bolyai 

Carl Gustav Jacobi 
William Rowan Hamilton 
P. G. Lejune Dirichlet 
Joseph Liouville 

Ernst Eduard Kummer 
Evariste Galois 

James Joseph Sylvester 
Karl Weierstrass 

Arthur Cayley 

Leopold Kronecker 


Bernhard Riemann 
Richard Dedekind 
Georg Cantor 


Gottlob Frege 
Felix Klein 


Sonya Kovalesky 
Ferdinand Lindemann 
Giuseppe Peano 
David Hilbert 

Felix Hausdorff 
Bertrand Russell 
Godfrey Harold Hardy 
L. E. J. Brouwer 
Amalie Emmy Noether 
Srinivasa Ramanujan 
John von Neumann 
Kurt Gédel 


GENERAL 


1801 
1809-1882 
1812-1814 
1812-1870 
1818-1883 
1820-1910 

1823 

1825 

1826 

1836 

1846 

1858 
1861-1865 

1869 


1871 
1876 
1878 


1879 
1894 


1895 
1903 
1914 
1914-1918 
1917 
1927 
1929 
1933 
1939-1945 
1963 
1969 
1989 


Ceres discovered 

Charles Darwin 

War of 1812 

Charles Dickens 

Karl Marx 

Florence Nightingale 

Monroe Doctrine 

Erie Canal opens 

Crelle’s Journal 

First telegraph 

Discovery of Neptune 

Transatlantic cable 

American Civil War 

American transcontinental 
railway 

German empire 

Bell’s telephone 

American Journal of 
Mathematics 

Edison’s electric lamp 

American Mathematical 
Society 

Discovery of X-rays 

First powered air ight 


Completion of Panama Canal 


First World War 
Bolshevik Revolution 
Lindberg’s ight to Paris 
The Great Depression 
Hitler becomes Chancellor 
Second World War 
Kennedy Assassinated 
Landing on the moon 
Berlin Wall dismantled 


